Standards for personal genomics information apply to everything but the data

Genome Web's Daily Scan noted an interesting blog post today from John D. Halamka, one of the people to get his genome sequenced through the personal genome project.

I was interested to see his post since Genome Web wrote that he was discussing data standards and we have been writing quite a bit, ourselves, about data measurements for Next Gen sequencing (e.g. Next Gen-Omics) on our company blog, FinchTalk.

But Halamka didn't write about standards for data.

He wrote about standards for metadata, like family histories, and the things that are done with data after it's been collected.

i-b5d8b2c8ab6af9745dc815645de3bec0-data_life.gifAll of those issues are important, but as you can see from my drawing, the regulations that Halamka describes only cover part of the picture. How will you know that your genome sequence data are correct? With several different platforms for Next Gen sequencing, all measuring information in different ways, it may be some time before the real data standards emerge.

More like this

Pushkarev, D., Neff, N., & Quake, S. (2009). Single-molecule sequencing of an individual human genome Nature Biotechnology DOI: 10.1038/nbt.1561 Yes, it's yet another "complete" individual genome sequence, following on the heels of Craig Venter, James Watson, an anonymous African male (twice…
"How much do I love you? I'll tell you no lie. How deep is the ocean? How high is the sky?" - Irving Berlin The other installments are here: Part I: Introduction Part II: Sequencing strategies Part III: Reads and chromats Part V: checking out the library We all know that sequencing a genome…
I wrote last week about the dramatic presentation here at  AGBT by Clifford Reid, CEO of new DNA sequencing company Complete Genomics. Reid made grand promises - entire human genome sequencing for $5000 available this year, and the sequencing of a million complete human genomes within the next five…
Welcome to Gene Genie #24: with a heavy emphasis on Personal Genetics The previous Gene Genie was hosted at DNAdirect Talk and it is still fresh, so go have a look if you have not already. The next Gene Genie will be hosted at My Biotech Life. By the way, the Gene Genie logo was created by…

That's because standards for raw data being similar/identical to a gold standard--what is known as analytical validity--generally exist as part of being a CLIA-licensed lab. If genomic data is eventually going to be used for explicit medical or diagnostic purposes, labs producing them will have to be licensed through CLIA.

Standards for clinical validity--how well analytically valid data predict a particular outcome--are much softer and often change from condition to condition. The CDC only has clinical validity reports on five conditions, and those are generally limited to mutations known to be involved in familial forms of disease (but not sporadic forms of e.g. breast/ovarian cancer and colorectal cancer).

By neandrothal (not verified) on 19 Nov 2008 #permalink