More genes please!

Targeted discovery of novel human exons by comparative genomics:

Here we describe a genome-wide effort, carried out as part of the Mammalian Gene Collection (MGC) project, to identify human genes not yet in the gene catalogs. Our approach was to produce gene predictions by algorithms that rely on comparative sequence data but do not require direct cDNA evidence, then to test predicted novel genes by RT-PCR. We have identified 734 novel gene fragments (NGFs) containing 2188 exons with, at most, weak prior cDNA support. These NGFs correspond to an estimated 563 distinct genes, of which >160 are completely absent from the major gene catalogs, while hundreds of others represent significant extensions of known genes. The NGFs appear to be predominantly protein-coding genes rather than noncoding RNAs, unlike novel transcribed sequences identified by technologies such as tiling arrays and CAGE. They tend to be expressed at low levels and in a tissue-specific manner, and they are enriched for roles in motor activity, cell adhesion, connective tissue, and central nervous system development. Our results demonstrate that many important genes and gene fragments have been missed by traditional approaches to gene discovery but can be identified by their evolutionary signatures using comparative sequence data. However, they suggest that hundreds--not thousands--of protein-coding genes are completely missing from the current gene catalogs.

ScienceDaily makes it intelligible.

Tags

More like this

The ENCODE project made a big splash a couple of years ago — it is a huge project to not only ask what the sequence of a strand of human DNA was, but to analyzed and annotate and try to figure out what it was doing. One of the very surprising results was that in the sections of DNA analyzed,…
If you missed it, today's NY Times Science section has been dedicated to "The Gene" a concept invented 99 years ago by Wilhelm Johanssen. Overall, the articles were very good, however as a scientist who wants to explain basic concepts of molecular biology to the masses, I have a few problems. First…
PLoS Biology, Medicine, Neglected Tropical Diseases and ONE publish on Tuesday. What's new today? As always, you should rate the articles, post notes and comments and send trackbacks when you blog about the papers. You can now also easily place articles on various social services (CiteULike,…
You know that organisms develop, grow, and function in part because genes code for proteins that form the building blocks of life or that function as working bioactive molecules (like enzymes). You also know that most DNA is junk, only a couple percent actually coding for anything useful. Most…