Ivory & teak, how related are South Asians & Europeans?

The post yesterday about the deletion which results in heart disease later in life had some interesting ancestry related material. This makes sense, the genetic maps which I post on now and then ultimately have a medical rationale behind them; eliminate population structure so that you don't have spurious correlations confusing you when you try and get a fix on the genetic underpinnings of a disease. By example, consider a study with cases & controls, and individuals with the trait or disease have five times the likelihood of carrying a particular allele at a particular gene. But you look more closely, and you see that if you control for race this doesn't hold, in fact you are just picking up the fact that one population has a greater propensity for the trait or disease as well as the reality that populations differ in allele frequencies on many genes. The old chestnut about correlation not equaling causation applies here. But, causation can ascertained using correlation as a precondition. Eliminating cryptic population substructure with ancestrally informative markers (AIMs) is the way you would do this, so that there's nothing in the genetic background confounding the associations you pick up.

In any case, the supplementary material has some graphs that I thought would be of interest.

i-f552ea826082ad1a5278e576571482d3-indiansepI.jpg

As you can see, the North Indians, Central Indians and South Indians overlap, though there are broad differences. Those differences, not surprisingly, follow geography. The genetic data here also aligns with our intuition derived from visual cues; Punjabis and Bengalis, for example, differ in appearance, but not so much that a minority of Punjabis might not be confused for a minority of Bengalis and vice versa. In contrast, a set of ethnic Swedes and south Sudanese will look so different that one can assume perfect sorting just on visual cues alone. While Swedes and Sudanese are disjoint on a host of skin color genes (e.g., SLC24A5) just as they are in complexion, the different South Asian groups overlap. Bengalis are far darker skinned on average than Punjabis, but the two groups overlap physically and genetically on a trait like skin color.

But here's the important figure:

i-129b5155ce44c44a65f5495f108005c2-indiansepII.jpg

CEU represents the northern European sample used tin the HapMap. These are Mormons from Utah, disproportionately British Isles, Scandinavian and German. Unlike the previous maps I've posted which depended on hundreds of thousands of genes to construct a plot of genetic variation, this uses only 50 markers. As you can see 50 genes are all you need to perfectly separate northern Europeans from South Asians. If you had one gene, like SLC24A5, you would have a harder time. Europeans are fixed for the derived variant that results in light skin, but among South Asian populations the frequencies vary from around 50% (in the south and east) to 85% (in the north and west) for the derived variant. In other words, looking at SLC24A5 alone you could only identify a minority of South Asians in a mixed population of South Asians and Europeans (on the order of 1/4). The rest of the population would be a mix of Europeans and South Asians. But, add dozens of more genes which vary between the two populations and they separate out nice & cleanly.

The rank order of genetic affinity matches our intuition, South Asians from the north are closer to Europeans than South Asians from the south. Unfortunately for the egos of many North Indians they are closer to South Indians than they are to white people. It is sometimes difficult to explain to Punjabis and Kashmiris who pride themselves on the European appearance (especially Kashmiris, who admittedly can "pass" for the superior race quite often!) which is more considered more attractive by South Asians that though they may sometimes look more European than the typical South Asian, that's just due to the variance on the small subset of genes which construct physical appearance. On the totality of their genes South Asians without recent European ancestry are more likely to be like other South Asians than Europeans.

Note: The supplementary information has the region and in some cases caste backgrounds for the samples. So of the Kashmiris, all are Pandits. Also, I do see that the South Asians with the lowest values on the horizontal dimension are closer to the Europeans with the highest values than they are to the South Asians with the lowest values.

Update: Some are asking about how South Asians relate to Europeans in the context of East Asians. This is not a trivial question because the slight majority of South Asian mtDNA lineages cluster with those of East as opposed to West Eurasia. But in regards to total genome content I think Li et. al. are close to the mark when they looked at the HGDP populations. The only really representative South Asian population are Sindhis, the other groups are relatively marginal or atypical. But, Sindhis are probably genetically interchangeable with the Punjabi sample which was included above, so I think I can fairly assume that the other populations wouldn't be much further than where I approximately placed them.

i-2d1fa038b37277776608ca29baa1f2ae-populationsmyer.jpg

Also, just a note, the outliers for Central and South Asians are the Hazaras and Makranis. These are two groups with known East Asian and African ancestry respectively.

Tags

More like this

Over the past few weeks I've been looking closely at all the skin color related genes in humans which have been studied over the past few years. A little over two years ago the evolutionary biologist Armand Leroi wrote: We don't know what the differences are between white skin and black skin,…
Why are brown people so many shades of brown? If you were raised in a South Asian family I'm sure that you've had to deal with the "color" issue somehow. This isn't a cultural blog, so I'm not going to go there, but I do think that the salience of complexion in South Asian culture makes this new…
From Different Matrilineal Contributions to Genetic Structure of Ethnic Groups in the Silk Road Region in China: Although our samples were from the same geographic location, a decreasing tendency of the western Eurasian-specific haplogroup frequency was observed, with the highest frequency present…
One of the main facts about American life is hypodescent, "the practice of determining the lineage of a child of mixed-race ancestry by assigning the child the race of his or her more socially subordinate parent." Barack Obama & the Kenyan politician Raila Odinga (who, probably falsely, claims…

My impression is that Indians are closer to Europeans than they are to Chinese, for instance.

I set up an account with 23andME and had my genome test done there. They have an autosomal map arranged by the major continental populations and I fall unsurprisingly within the European cluster.

223andme also has fairly good coverage of India, with the following samples: Burusho, Pathan, Sindhi, Balochi, Brahui, and Makrani.

By Justin Loe (not verified) on 19 Jan 2009 #permalink

You should do one of how closely East Asians are related to South Asians

The last graph seems to validate the classical three races of anthropology as distinct clusters, doesn't it?

Would it be a fair interpretation of the last map -- the map with samples for Africa, Middle_East, Europe, C_S_Asia, Oceania, and America -- to say that Europe has some "East Asian" admixture, given that the European sample spikes towards the "East Asian" clusters?

Or is something like that just due to genetic drift?

(Because it seems to be similar to how the Middle_East cluster spikes towards the African cluster.)

charles,

there's historically attested african admixture in middle eastern groups relatively recently due to the slave trade (as evidenced by mtDNA which is found in muslims, but not xtians or jews, in that region). but for europeans i think the most plausible explanation is that there is a geographical cline. browns are between middle easterners and europeans. no surprise, middle easterners are in the middle. geography is important. history is secondary, though not trivial.

Rotate the map clockwise about 30 degrees and you get a pretty good map of the world. Though America and possibly Oceania (does it include Australia?) are in the wrong place - not surprisingly. I find the huge gap of the Sahara interesting, as well as the sweep of central Asia connecting Europe and East Asia.

The last graph seems to validate the classical three races of anthropology as distinct clusters, doesn't it?

Of course it does - it includes discrete samples from widely separated regions. No surprise that they cluster so strongly.

Make the samples more continuous by sampling more regularly over space and the "clusters" may well merge seamlessly into each other. E.g. including Ethiopians as Africans (rather than Western Africans) would strongly reduce the separation between Africans and Europeans. Just like, as you can see, including a whole host of Central Asians (though markedly biased towards "white" populations, except for Uighurs) has tended to blur the distinction between Asians and Europeans.

Make the samples more continuous by sampling more regularly over space and the "clusters" may well merge seamlessly into each other.

Yep, though even if that is the case, Europeans and South Asian populations (for example) would remain closer to one another than would be expected from the ratio of geographic to genetic distance between, e.g. European and West African populations or Europeans and East Africans. I think it's important to stress this as all too often it seems that that a notion that there is a ratio between genetic and geographic distance that remains constant between any two populations on Earth seems to crop up whenever simplified approximations of population structure that use the idea of race are rejected.

Accessibility is dependent not just on distance, but on travel time and disease as well. Until fairly recently sub-Saharan Africa was hard to reach and home to diseases the outside world had no effective treatments for. In contrast South Asia was relatively easy to reach from Europe, and as a result diseases as well as knowledge and goods were more easily shared starting rather early.

Thus demographic distance and geographic distance don't always correspond.

Looking at the last diagram, would it be fair to conclude that South Asians (SA) and Middle Easterners (ME) are equidistant from Europeans (EU)? That is, is the genetic "distance" between ME and EU groups equal to the genetic "distance" between SA and EU groups?

Do EU, SA, and ME groups then form the "vertices" of a equilateral triangle?

By Leo Caddy (not verified) on 30 Apr 2009 #permalink