Standards for personal genomics information apply to everything but the data

Genome Web's Daily Scan noted an interesting blog post today from John D. Halamka, one of the people to get his genome sequenced through the personal genome project.

I was interested to see his post since Genome Web wrote that he was discussing data standards and we have been writing quite a bit, ourselves, about data measurements for Next Gen sequencing (e.g. Next Gen-Omics) on our company blog, FinchTalk.

But Halamka didn't write about standards for data.

He wrote about standards for metadata, like family histories, and the things that are done with data after it's been collected.

i-b5d8b2c8ab6af9745dc815645de3bec0-data_life.gifAll of those issues are important, but as you can see from my drawing, the regulations that Halamka describes only cover part of the picture. How will you know that your genome sequence data are correct? With several different platforms for Next Gen sequencing, all measuring information in different ways, it may be some time before the real data standards emerge.

More like this

"How much do I love you? I'll tell you no lie. How deep is the ocean? How high is the sky?" - Irving Berlin The other installments are here:Part I: IntroductionPart II: Sequencing strategiesPart III: Reads and chromatsPart V: checking out the library We all know that sequencing a genome must be…
One of my colleagues has a two part series on FinchTalk (starting today) that discusses uncertainty in measurement and what that uncertainty means for the present and Next Generation DNA sequencing technologies. I've been running into this uncertainty myself lately. I have always known that DNA…
A couple of years ago, I answered a reader's question about the cost of genome sequencing. One of my readers had asked why the cost of sequencing a human genome was so high. At that time, I used some of the prices advertised by core labs on the web and the reported coverage to estimate the cost…
Well, someone at ScienceBlogs had to draw down on Scientopia, and it might as well be the Mad Biologist. I was going to respond to this post by proflikesubstance about genomics and data release in a calm, serious, and respectful manner, and, then, I thought, "Fuck that. I'm the Mad Biologist. I…

That's because standards for raw data being similar/identical to a gold standard--what is known as analytical validity--generally exist as part of being a CLIA-licensed lab. If genomic data is eventually going to be used for explicit medical or diagnostic purposes, labs producing them will have to be licensed through CLIA.

Standards for clinical validity--how well analytically valid data predict a particular outcome--are much softer and often change from condition to condition. The CDC only has clinical validity reports on five conditions, and those are generally limited to mutations known to be involved in familial forms of disease (but not sporadic forms of e.g. breast/ovarian cancer and colorectal cancer).

By neandrothal (not verified) on 19 Nov 2008 #permalink