Now on ScienceBlogs: Another contender for the worst reporting ever: "Coma man"

Seed Media Group

Collective Imagination

Discovering Biology in a Digital World

My thoughts on biology, teaching, life, and exploring the living world via the digital one. Only my opinions are represented by these postings, they do not represent the viewpoints of any funding agency or Geospiza, Inc.

Profile

Sandra Porter I am a microbiologist and molecular biologist turned tenured biotech faculty turned bioinformatics scientist turned entrepreneur. My passion is developing instructional materials for 21st century biology (Digital World Biology).

Search

Digital World Biology

Discover Biology with Bioinformatics


Subscribe to our newsletter


e-mail digitalbio at scienceblogs.com

use 'Digital World Biology' news as the subject

DigitalBio Favorites

Science Blogs School Fundraiser


link_donorschoose_small.gif


Recent Posts

Recent Comments

Categories

Blogroll

Science Education Groups

Keep up to date

Awards

Red Orbit

Digital Bio at Blogged

Wikio - Top Blogs - Sciences
Add Digital Bio to your Technorati Favorites!





Follow me on Twitter

When you need to laugh

Interesting places

The Tangled Bank
MicrobeWorld Radio

Locations of visitors to this page

Archives

« Open science, peer review and the flu | Main | BLASTing through the flu: activity 5, how similar is similar? »

Why would we be able to detect more genetic variation by blasting with nucleotide sequences?

Category: BioinformaticsGenomeInfluenza resourcesScience educationclassroom activitiessequence analysisweb resources
Posted on: April 30, 2009 11:00 PM, by Sandra Porter

We'll have a blast, I promise! But there's one little thing we need to discuss first...

I want to explain why I'm going to use nucleotide sequences for the blast search. (I used protein the other day). It's not just because someone told me too, there is a solid rational reason for this.

The reason is the redundancy in the genetic code.

Okay, that probably didn't make any sense to those of you who didn't already know the answer. Here it is.

standard genetic code.png  The picture above shows the human genetic code (there are at least 16 variations on this, but that's another story). Each middle cell in the table shows the codons. Those are the groups of three bases in the left most column. Then, reading from left to right, we have the three letter and one letter abbreviations for the amino acids they encode. In every case, except for tryptophan (W), an amino acid can be encoded by multiple codons. (That's what we mean when we say the code is redundant. Oops, I said it again!)

Well, this means that we can have the same amino acid in a protein, but different codons in the mRNA. 

So a protein sequence like this: FLAKEY

Could be encoded by the DNA sequence: TTTCTTGCCAAATAT
                               or the DNA sequenceTTCCTAGCAAAGTAC

These two sequences are only 70% identical but they code for amino acid sequences that are 100% identical.

Thus, you can see more variation at the level of the nucleotides. 

One other thing, you might be wondering why there are T's in these sequences when the virus is made of RNA.  Well, one reason is we usually make a DNA copy of RNA before we do any sequencing.  The other reason, is that we store almost all sequences in the form of a DNA sequence, even when the sequences really did come from RNA. 


Share this: Stumbleupon Reddit Email + More

Post a Comment

(Email is required for authentication purposes only. On some blogs, comments are moderated for spam, so your comment may not appear immediately.)





ScienceBlogs

Search ScienceBlogs:

Go to:

Advertisement
Enter to win a free copy of The Monty Hall Problem
Visit the Collective Imagination blog
Advertisement
Collective Imagination

© 2006-2009 Seed Media Group LLC. ScienceBlogs is a registered trademark of Seed Media Group. All rights reserved.

Sites by Seed Media Group: Seed Media Group | ScienceBlogs | SEEDMAGAZINE.COM