Large-scale differences in human genomes

Razib points me to a great plain-language article reviewing our current scientific understanding of human genetic variation.

The major focus is on copy-number variants (CNVs) - genetic variants involving the insertion or deletion of large chunks of DNA, sometimes spanning over a million bases. These large-scale variants lurked essentially unknown within the human genome until the recent advent of chip-based platforms, which make it possible to very rapidly assay almost the entire genome for their presence. Such surveys have revealed that large-scale CNVs are far more common than anyone expected - 65 to 80 percent of the population carries copy number variants affecting over 100,000 bases.

From my point of view, one of the more striking findings of CNV research over the last couple of years has been a somewhat disappointing one: there have been surprisingly few convincing, well-replicated associations between CNVs and common disease. Sure, a few signals from genome-wide association studies for obesity and auto-immune diseases have turned out to be linked to common CNVs, and there have been a rash of papers proposing links between very rare CNVs and psychiatric diseases such as schizophrenia and autism, but the yield has been lower than I would have expected given that these things are large (often affecting one or more genes) and remarkably common.

I suspect to some extent this is due to the relative immaturity of technology for assaying these variants: although platforms for surveying single-base genetic variants are now extremely accurate, current methods still struggle both to find CNVs and to unambiguously determine how many copies of a particular CNV an individual is carrying - particularly in cases of complex genetic rearrangements. That is already changing with the latest generation of chips, and will change even further once cheap long-read DNA sequencing becomes available from emerging third-generation sequencing technologies (e.g. Pacific BioSciences or Oxford Nanopore). No doubt examples of disease-associated CNVs will begin to mount up as technology improves.

Anyway, kudos to Tina Hesman Saey for a very accessible and timely review of an important topic.

 Subscribe to Genetic Future.

More like this

Wellcome Trust Case Control Consortium. (2010). Genome-wide association study of CNVs in 16,000 cases of eight common diseases and 3,000 shared controls Nature, 464 (7289), 713-720 DOI: 10.1038/nature08979 The Wellcome Trust Case Control Consortium has just published the results of a massive…
BioArray News (subscription required) reports that genomic analysis technology provider Illumina has launched a new family of genotyping chips designed to simultaneously assay 4 million sites of variation in the human genome. The chips are a major step up from the 1-million-feature chips that…
This little USB drive represents the current pinnacle of luxury personal genomics. It's the product of Knome (pronounced "know me"), a Cambridge, MA-based biotech start-up fronted by genomics pioneer George Church (recently profiled in Wired). In return for $350,000, Knome's customers receive a…
The successes of genome-wide association studies (GWAS) in identifying genetic risk factors for common diseases have been heavily publicised in the mainstream media - barely a week goes by these days that we don't hear about another genome scan that has identified new risk genes for diabetes, lupus…