Now on ScienceBlogs: Q: How do you sex a Smilodon? (A: Very carefully)

Seed Media Group

Genetic Future

Commentary on human genetics and evolution, direct-to-consumer genetic testing, and the personal genomics industry.

Search

Profile

Daniel MacArthur
I write about the genetic and evolutionary basis of human variation, and the companies trying to sell you information about your genome.

Subscribe via RSS.
Follow me on Twitter.

Recent Posts

Recent Comments

Archives

Blogs I read:

Consumer Genomics:

Genomic Science:

Genetics/Evolution Blogs:

General Science:

Corporate Blogs:

Skeptics:

« David Goldstein on the failures of genome-wide association studies | Main | Google co-founder at increased risk of Parkinson's, according to 23andMe »

HapMap phase 3 data available for browsing

Category: big genetics
Posted on: September 19, 2008 9:10 AM, by Daniel MacArthur

hapmap3.jpg

This will probably only be of interest to population genetics afficianados, but I just noticed that the HapMap project has made its phase 3 data available through its browser (the data were previously available for download, but are much more accessible - especially to non-bioinformaticians - through the browser interface).

The HapMap project is a massive international collaboration collecting information on common sites of genetic variation (called single nucleotide polymorphisms, or SNPs) in anonymised individuals from a variety of human populations. Phase 3 has data on about 1.5 million genetic markers for 1,115 individuals from 11 populations. That's substantially fewer markers than in earlier phases of the HapMap project, but on a hugely expanded set of samples (the original HapMap data-set contained information on just 270 individuals from 4 populations). Of particular interest are three additional populations with African ancestry (Luhya and Maasai from Kenya, and African-Americans collected in southwest USA), given the exceptionally high level of genetic diversity in African groups relative to other human populations.

This is still very much a rough draft of the catalogue of human genetic diversity, sampling just a tiny fraction of our species' populations and being restricted only to common genetic variants. Extending the catalogue to include rare variants will require whole-genome sequencing of much larger samples - work that is currently being kick-started by the ambitious 1000 Genomes Project.

The breakdown of the analysed samples in the phase 3 HapMap data-set is below the fold...

Number 	Code	Population
71 ASW African ancestry in Southwest USA
162 CEU Utah residents with European ancestry
82 CHB Han Chinese in Beijing, China
70 CHD Chinese in Metropolitan Denver, Colorado
83 GIH Gujarati Indians in Houston, Texas
82 JPT Japanese in Tokyo, Japan
83 LWK Luhya in Webuye, Kenya
71 MEX Mexican ancestry in Los Angeles, California
171 MKK Maasai in Kinyawa, Kenya
77 TSI Toscani in Italia
163 YRI Yoruba in Ibadan, Nigeria

Share this: Stumbleupon Reddit Email + More

Comments

1

Daniel,
That high risk is strictly for Berbers and Jews! Not Northern Europeans!
We can't keep perpetuationg that risk!
-Steve
www.thegenesherpa.blogspot.com

Posted by: Steven Murphy | September 20, 2008 8:39 AM

Post a Comment

(Email is required for authentication purposes only. On some blogs, comments are moderated for spam, so your comment may not appear immediately.)





ScienceBlogs

Search ScienceBlogs:

Go to:

Advertisement
Follow ScienceBlogs on Twitter
Visit the Collective Imagination blog
Advertisement

© 2006-2009 Seed Media Group LLC. ScienceBlogs is a registered trademark of Seed Media Group. All rights reserved.

Sites by Seed Media Group: Seed Media Group | ScienceBlogs | SEEDMAGAZINE.COM