What can you learn from a whole genome sequence?

By dgmacarthur on April 29, 2010.

A paper just released in the Lancet describes a thorough and integrated approach to squeezing as much clinically relevant information as possible out of a genome sequence. However, despite a state-of-the-art clinical interpretation pipeline, the major message from the paper is just how far we still have to go before we can make full use of our genetic information.

The paper is based on the genome of Stephen Quake (right), which was sequenced using the single-molecule platform developed by Helicos (I wrote about Quake's genome publication at the time). This is a rather curious choice: of all of the genome sequences currently available for analysis, Quake's is one of the least complete and accurate due to the very short reads and high error rates of the Heliscope. It's also interesting to note that at least one of the other authors on the paper - George Church - has a substantially better-quality sequence of his own genome (generated by Complete Genomics) in the public domain.

Nonetheless, Quake's genome it is. The authors throw everything they can at the sequence, bringing in information from databases of both common and rare disease-associated variants and variants affecting drug metabolism, as well as family history and various clinical tests.

There are some genuinely intriguing results: three independent rare mutations in genes associated with sudden cardiac death (although one of these is later shown to be probably benign), and - integrated across the full available set of common risk markers - high lifetime risks for three actionable conditions, myocardial infarction, type 2 diabetes and obesity. Based on Quake's risk predictions, his physicians decided to recommend a lipid-lowering drug (which, incidentally, he would be predicted to respond positively to based on variants in drug-metabolising genes).

That's a tantalising glimpse into the future of personalised medicine using genetic information. Still, it's important to remember what this analysis is missing. Firstly, there's the more than 10% of the genome that can't be accessed at all using Helicos' short-read technology; some of this can't currently be analysed by any of the currently available sequencing technologies (although most would do a substantially better job than Helicos, it must be said).

But more importantly, there are the variants that simply can't be interpreted. This includes virtually everything seen outside protein-coding regions, and the majority of even those variants found inside coding regions. We simply don't understand the biology of most genes well enough yet to be able to predict with confidence whether a novel variant will have a major impact on how that gene operates; and we have an even less complete picture of how genes work together to affect the risk of disease.

That means that the real benefit of whole-genome sequencing over other assays - the uncovering of truly novel or rare genetic variants - has much less of an impact than it should, because in most cases it's impossible to assign function to such variants. Indeed, it's striking in this study that the really compelling, actionable findings - the increased risk of myocardial infarction and metabolic diseases, and the drug metabolism effects - come largely from common variants, most of which would be captured by chip-based assays such as that used by 23andMe. (The two rare variants potentially linked to sudden cardiac death are intriguing, and warrant extra surveillance, but don't yet appear to be compelling risk factors.)

The authors should be commended for their efforts in bringing a wealth of functional annotation and clinical interpretation together for this study, but it's clear we have a lot further to go before we can extract everything of value from a genome sequence.

More like this

Daniel,
Hmmmm. This will be the quandry for 10 years, 20 max. Time to dig into EnSembl and Zebrafish.....hmmm, conserved region....hmmmm ok, maybe that's bad.....hmmm, what's your cholesterol?

-Steve

Wow, I had no idea we were that far along in this. I figured all the stuff about certain health risks being genetically conditioned was simply based on large-scale studies, not knowledge of the actual specific genes involved. Seems like we'll live to see the day when newborn kids can get some serious medical-prediction stuff (if I'm not mistaken). Neat!

It's an interesting paper. Whole genome sequencing will be interesting for many reasons (particularly I think for human migration history) but it will be the variation scans that will be clinicaly useful whether for SNPs, indels, CNVs etc. Technology is steaming ahead but the biology can't move so quickly. Over 5 years of GWAS we have gone from a few $million per genome to a few $thousand but have not YET made much dent in gene based disease prediction (and I don't believe - re MCK - that 5 years of deep sequencing rare SNP search will be the answer either) - so you're final paragraph is spot on

I agree! I seriously doubt we will garner much more information about disease risk from genome sequencing than from GWAS studies. Until we have a better understanding of all the epigenetic and other genetic mechanisms (which are likely influenced by environmental exposures) which affect gene expression/translation etc over time, I suspect we will only be making little baby steps in finding specific factors involved in causing high disease risk. Great summary!

bring it on !

I'll bet you he has a malaria susceptability gene and if you treated him for lymes disease with doxycyline. He would get fat and have a heart attack from chronic babesiosis. Wilson and Chowning saw this in the Bitter Root Valley in 1905 ---- human piroplasmosis. They got run out of town and the cattle got treated. Good history --real history. Allergy and Immunology? No -- people dying from endemic disease. They saw a tick borne version of malaria back in 05 that still hasn't gotten proper recognition today. Babesia Microti, WA-1, MO-1, CA-1 --- we have very specific tests for those today --- but to convince your gatekeeper to look? You'll get thorazine before doxy. and guess what the nobel prize in medicine was awarded for in 1927? Creating thorazine to treat people with malaria and syphylis.

Thank God for Zyrtec!

I see about 5 patients per week who do not know at least half of their family history. Genetic testing TODAY is cost effective for them; in the future, it will be ILLEGAL not to.

I'm tired of the nay-sayers who want to live in the past: genetic testing is here to stay. Period.

And yes, the marginal utility of a WGS does not merit the massive increase in cost relative to a SNP chip. But that is always subject to revision as the cost curve progresses.

Hi Daniel,

Any comment on the MS twin study published in Nature?

@Steve "....hmmm, what's your cholesterol?"

What Steve advocates is an "occam's razor" of clinical application.

A genetic risk factor is not relevant when the factor itself can be directly tested. For example, if a patient complains of hypertension-like symptoms, you confirm a diagnosis with a sphygmomanometer, and then you treat the hypertension. A genetic test for a risk of hypertension would be redundant as the probability that the patient has hypertension is already "1.0".

However, if the genetic test informed us at some higher resolution about some malfunctioning biochemical pathway which caused the hypertension symptoms, and that information was necessary in some optimal clinical decision ---that would be useful. I'm not aware of any such application in this case.

People hope that DNA will have all the answers to good health. The problem is that those answers are not well translated into the language of medicine ---or for that matter, humans. "Hypertension" in medicine is about as descriptive as "rocky" is in astronomy. ("ICD-9 401.9: Unspecified essential hypertension" is about as descriptive as "a solid, rock-like mass")

@Afterthought "I see about 5 patients per week who do not know at least half of their family history. Genetic testing TODAY is cost effective for them; in the future, it will be ILLEGAL not to."

*Illegal* not to? This is both terrifying and terrifying. By this same logic, it's cost effective to get your blood pressure. Should it also be illegal not get a regular blood pressure?

The didn't perform a physical exam, they didn't do a HGBA1c, no PSA, looks like the didn't do a clinical assessment, just played with genome browsers and sequencers. How is that medicine? http://thegenesherpa.blogspot.com/2010/05/personal-genomes-in-clinical-…

The answer, it is not.

-Steve

@afterthought

You actually see patients and had a comment like that?
It is truly scary a clinician would suggest it illegal to refuse a test. What's next, Death Panels? Nice.......

-Steve

@ afterthought,
The possibility of using genetic testing to obtain a family history from a patient who can't is something to look forward to someday, but unless I am mistaken, here in the US we can't force anyone to be tested. As it is right now, most people would find the cost prohibitive, and good luck trying to get your HMO to pay for it.

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

More by this author

Genetic Future is moving

January 18, 2011

After a semi-hiatus due to various distractions, I'm about to restart blogging in earnest again over at the new home of Genetic Future on Wired Science. Please update your RSS feed: my new one is here. And a reminder: you can always keep track of new posts here as well as other nuggets of…

One more step towards the end of recessive diseases

January 13, 2011

In the last century infant mortality has declined precipitously in the Western world, thanks in large part to the development of antibiotics and vaccination. Yet as the suffering and death from infectious disease has reduced, the burden from genetic disease has become proportionately greater:…

New FireFox plugin for 23andMe customers

January 11, 2011

Software company 5AM Solutions has just launched a neat little FireFox plug-in for customers of consumer genomics company 23andMe. The idea is very simple: Download your raw data from 23andMe (or use one of the files from me or my colleagues at Genomes Unzipped); Install the plug-in from here…

Why you CAN have your $1000 genome - so long as you learn what to do with it

January 7, 2011

As part of his Gene Week celebration over at Forbes, Matthew Herper has a provocative post titled "Why you can't have your $1000 genome". In this post I'll explain why, while Herper's pessimism is absolutely justified for genomes produced in a medical setting, I'm confident that I'll be obtaining…

Bioscience Resource Project critique of modern genomics: a missed opportunity

December 15, 2010

Late last week I stumbled across a press release with an attention-grabbing headline ("The Causes of Common Diseases are Not Genetic Concludes a New Analysis") linking to a lengthy blog post at the Bioscience Resource Project, a website devoted to food and agriculture. The post, written by two…

What can you learn from a whole genome sequence?

More like this

Genetic Future is moving

One more step towards the end of recessive diseases

New FireFox plugin for 23andMe customers

Why you CAN have your $1000 genome - so long as you learn what to do with it

Bioscience Resource Project critique of modern genomics: a missed opportunity

Comments of the Week #71: From the Universe's age to the science of the CMB

The Math of the Fastest Human Alive

The Ice Cap is Melting and You Can Help