Seed Media Group

Discovering Biology in a Digital World

My thoughts on biology, teaching, life, and exploring the living world via the digital one. Only my opinions are represented by these postings, they do not represent the viewpoints of any funding agency or Geospiza, Inc.

Profile

Sandra Porter I am a microbiologist and molecular biologist turned tenured biotech faculty turned bioinformatics scientist turned entrepreneur. My passion is developing instructional materials for 21st century biology (Geospiza Education).

Search this blog

Learn about DNA with molecular models

Exploring DNA Structure


Subscribe to Geospiza Education News


e-mail digitalbio at gmail.com


DigitalBio Favorites

Molecular Momentos


Recent Posts

Recent Comments

Archives

Categories

Rotating Blogroll

Science Education Groups

Science Blogs School Fundraiser



Keep up to date

Awards

Red Orbit

Digital Bio at Blogged


Add Digital Bio to your Technorati Favorites!

Interesting places

  • xkcd
  • The Tangled Bank
    MicrobeWorld Radio

    « Hot plants and viruses: the story continues | Main | MeSH part I. Where can you find the meaning of "life"? »

    What are hypothetical and putative proteins?

    Category: Ask Dr. ScienceBasicsBioinformaticssequence analysis
    Posted on: February 8, 2007 3:17 PM, by Sandra Porter

    "Beware the Jabberwock, my son! The jaws that bite, the claws that catch! Beware the Jubjub bird, and shun The frumious Bandersnatch!"

    - from Jabberwocky, by Lewis Carroll

    I'm certain that if we ever sequenced DNA from the frumious Bandersnatch it would match hypothetical and putative proteins.

    Why?

    Because we always (well, almost always) get matches to hypothetical and putative proteins when we do a database search with a protein sequence.

    Why?

    Because many of the protein sequences in GenBank (at the NCBI) are a result of conceptual translations.

    What? !!

    A conceptual translation is where we feed a DNA sequence (or often an mRNA sequence) to a translation program. The program "decodes" the sequence and determines the possible amino acid sequences that could be produced from that nucleic acid.

    If we have a DNA sequence, we could have six potential amino acid sequences, since we could decode either strand.

    If we have an mRNA sequence, we have three potential amino acid sequences.

    We make our best guess which sequence is right, and submit that to the NCBI as a hypothetical or putative protein. The protein remains hypothetical or putative until there are other data to show that it really exists.

    How do we do that?
    With the genes that I sequenced, I did this by cloning the sequences into E. coli, producing the protein, making antibodies to the protein, and using the antibodies to demonstrate that the protein could be found in the organism.

    I think mass spectrometry is one of the additional methods being used today.

    Comments

    #1

    Thanks Sandra,

    I'm taking my first graduate courses (although I really will not start GS tell next fall) and one of them is an on-line class in genetics and molecular biology. This post somehow clarified what I was reading on the bus home from school today.

    Posted by: KevinC | February 8, 2007 4:49 PM

    Post a Comment

    (Email is required for authentication purposes only. Comments are moderated for spam, your comment may not appear immediately. Thanks for waiting.)





    Having problems commenting? (UPDATED)

    Search All Blogs

    Blogs in the Network

    Top Five: Most German

    Top Science Stories

    powered by SEED - seedmagazine.com