Small delays due to Big Genetics

By dgmacarthur on February 15, 2009.

[Added in edit in response to concerned emails: The original title was deliberately provocative, and contrary to the message in the text; I apologise for any misunderstanding. I've largely rewritten the post to make my point more clearly.]

One of the curious and paradoxical effects of Big Genetics projects like the 1000 Genomes Project - which plans to generate low-coverage whole-genome sequences for ~1,500 people by the end of this year, providing a map of human genetic variation of unprecedented resolution - is that while they considerably accelerate research in the long term, they can actually slightly delay some research projects in the short term.

Over the last six months I've heard four separate researchers note in presentations or conversations that they have abandoned large-scale sequencing projects performed as part of a larger association study, since any data generated by their sequencing would be made largely or entirely obsolete by 1000 Genomes. Essentially, the scientists are (understandably, given limited scientific resources) temporarily setting a project aside and waiting for the 1000 Genomes project to discover the variants they need, before resuming their gene discovery efforts.

In some cases 1000 Genomes will actually generate the data faster than the researchers could themselves, thus accelerating their research - but this is not always the case. For some projects, avoiding the duplication of work that is already being done by a Big Genetics consortium will actually delay their project, albeit usually only for a few months.

I gather from informal conversations I had at AGBT last week that previous Big Genetics projects had a similar effect: at least a few research groups held off disease gene mapping studies while they waited for the completion of the Human Genome Project, and some large-scale SNP discovery and genotyping studies were similarly put on hold while labs awaited data from the HapMap project.

Of course, I'm not arguing that this is in any way a reason not to do big genetics projects - there's no question that the overall scientific outcome of all of these projects far, far outweighs any costs due to short-term research delays. In addition, it's hard to see how this short-term effect could possibly be avoided - it's simply an inevitable consequence of any large-scale collaborative project generating a resource for the scientific community that would otherwise be constructed piecemeal by many separate research groups. (Nor am I saying, by the way, that 1000 Genomes could have been completed any faster than it has - I actually think the pace of progress has been astonishing, and its free release of data during the process has substantially reduced the likelihood of research delays.)

Finally, I think I need to emphasise that Big Genetics creates many opportunities for researchers to expand on the details of massive data-sets; I've said previously that "Big Genetics generates far more data than its participants can
ever hope to analyse themselves, and the hefty remainder is fodder for
a plethora of small labs exploring small but important facets of the
bigger picture."

Still, it makes me wonder how many researchers (and especially graduate students) have had their research suddenly change direction, while their main project is put on hold to await results from some collaborative behemoth. How many readers have experienced this themselves?

Subscribe to Genetic Future.

More like this

The promise and challenges of Big Genetics

Olivia Judson's blog has a guest post by Aaron Hirsh that got me thinking about a topic that will be familiar to most scientists: the transition of research towards Big Science. Big Science basically includes any project involving a large consortium of research groups working together on a tightly…

Why do genome-wide scans fail?

The successes of genome-wide association studies (GWAS) in identifying genetic risk factors for common diseases have been heavily publicised in the mainstream media - barely a week goes by these days that we don't hear about another genome scan that has identified new risk genes for diabetes, lupus…

To the barricades in defence of Big Genetics

Over at Gene Expression, p-ter has a post up defending the "big genetics" approach, noting that large-scale hypothesis-free genetics studies have consistently yielded important results for follow-up detailed fine-scale studies. It's a sound argument. I've argued in the past that many of the fears…

Peering into the Genetic Future: trends in human genomics in 2009

Well, it's a little late, but I finally have a list of what I see as some of the major trends that will play out in the human genomics field in 2009 - both in terms of research outcomes, and shifts in the rapidly-evolving consumer genomics industry. For genetics-savvy readers a lot of these…

Why is this a bad thing?
It's not like those researchers would now sit around and do nothing.
The researchers will instead put their resources somewhere else to work where they can hope for a better long term payoff.

While perhaps not the sort of massive project you describe, the PGP10 is certainly high profile. Despite reassurances of 'any day now' its been a long time since that data was produced, and yet most of it is still offline.

Interesting point which highlights the front line of the war between 'big science' and 'small science' advocates. I'm with you, in believing that these large collaborations are worthwhile and even necessary for the advancement of science.

WIth sequencing, in particular, there's something to be said for economies of scale, I have to imagine that the 1000 genomes project will get this data out more cheaply and with uniform standards, as opposed to 20 different labs doing it, each releasing data in a different format, with different quality metrics, etc. As a bioinformatician, I know which data set I'd rather work with.

I honestly think that we're better off having the grunt work (massive sequencing) done by specialized centers.

(Disclaimer: I'm at BCM, a major player in the 1000 genomes, but am not involved with the project)

Hi Christian,

I don't really think it's a bad thing overall (despite my deliberately provocative title).

You are correct that researchers will simply shift their focus onto other topics, although the end result may be substantial delays for individual projects.

Mainly, though, I just think it's an interesting effect, and I'm wondering how common it is.

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

EPA Reconsiders Its Biden Ban On Asbestos Everywhere

More by this author

Genetic Future is moving

January 18, 2011

After a semi-hiatus due to various distractions, I'm about to restart blogging in earnest again over at the new home of Genetic Future on Wired Science. Please update your RSS feed: my new one is here. And a reminder: you can always keep track of new posts here as well as other nuggets of…

One more step towards the end of recessive diseases

January 13, 2011

In the last century infant mortality has declined precipitously in the Western world, thanks in large part to the development of antibiotics and vaccination. Yet as the suffering and death from infectious disease has reduced, the burden from genetic disease has become proportionately greater:…

New FireFox plugin for 23andMe customers

January 11, 2011

Software company 5AM Solutions has just launched a neat little FireFox plug-in for customers of consumer genomics company 23andMe. The idea is very simple: Download your raw data from 23andMe (or use one of the files from me or my colleagues at Genomes Unzipped); Install the plug-in from here…

Why you CAN have your $1000 genome - so long as you learn what to do with it

January 7, 2011

As part of his Gene Week celebration over at Forbes, Matthew Herper has a provocative post titled "Why you can't have your $1000 genome". In this post I'll explain why, while Herper's pessimism is absolutely justified for genomes produced in a medical setting, I'm confident that I'll be obtaining…

Bioscience Resource Project critique of modern genomics: a missed opportunity

December 15, 2010

Late last week I stumbled across a press release with an attention-grabbing headline ("The Causes of Common Diseases are Not Genetic Concludes a New Analysis") linking to a lengthy blog post at the Bioscience Resource Project, a website devoted to food and agriculture. The post, written by two…