What are hypothetical and putative proteins?

"Beware the Jabberwock, my son!
The jaws that bite, the claws that catch!
Beware the Jubjub bird, and shun
The frumious Bandersnatch!"

- from Jabberwocky, by Lewis Carroll

I'm certain that if we ever sequenced DNA from the frumious Bandersnatch it would match hypothetical and putative proteins.

Why?

Because we always (well, almost always) get matches to hypothetical and putative proteins when we do a database search with a protein sequence.

Why?

Because many of the protein sequences in GenBank (at the NCBI) are a result of conceptual translations.

What? !!

A conceptual translation is where we feed a DNA sequence (or often an mRNA sequence) to a translation program. The program "decodes" the sequence and determines the possible amino acid sequences that could be produced from that nucleic acid.

If we have a DNA sequence, we could have six potential amino acid sequences, since we could decode either strand.

If we have an mRNA sequence, we have three potential amino acid sequences.

We make our best guess which sequence is right, and submit that to the NCBI as a hypothetical or putative protein. The protein remains hypothetical or putative until there are other data to show that it really exists.

How do we do that?
With the genes that I sequenced, I did this by cloning the sequences into E. coli, producing the protein, making antibodies to the protein, and using the antibodies to demonstrate that the protein could be found in the organism.

I think mass spectrometry is one of the additional methods being used today.

More like this

tags: plants, bioinformatics, sequence analysis, viruses, fungi Quick synopsis: A type of grass grows in Yellowstone National Park in hot (65° C), unfriendly soil. How the plant manages this feat is a mystery. What we do know, is that the grass can only tolerate high temperatures if it's been…
Part II. What do mumps proteins do? And how do we find out? This is the second in a five part series on an unexpected discovery of a paramyxovirus in mosquitoes, and a general method for finding interesting things. I. The back story from the genome record II. What do the mumps proteins do? And…
In which we search for Elvis, using blastp, and find out how old we would have to be to see Elvis in a Las Vegas club. Introduction Once you're acquainted with proteins, amino acids, and the kinds of bonds that hold proteins together, we can talk about using this information to evaluate the…
Here is the third BIO101 lecture (from May 08, 2006). Again, I'd appreciate comments on the correctness as well as suggestions for improvement. -------------------------------------------------- BIO101 - Bora Zivkovic - Lecture 1 - Part 3 The DNA code DNA is a long double-stranded molecule…

Thanks Sandra,

I'm taking my first graduate courses (although I really will not start GS tell next fall) and one of them is an on-line class in genetics and molecular biology. This post somehow clarified what I was reading on the bus home from school today.

Sir/Madam,i have a project on homology modelling of Protein,i hve heard of GPRC which have function but no structure,playing a role in Huntington disease.Can you please help me out?