Population genomics & the coalescent

Blogger Thomas Mailund is an author on a new paper, Ancestral Population Genomics: The Coalescent Hidden Markov Model Approach:

With incomplete lineage sorting (ILS), the genealogy of closely related species differs along their genomes. The amount of ILS depends on population parameters such as the ancestral effective population sizes and the recombination rate, but also on the number of generations between speciation events. We use a hidden Markov model parametrized according to coalescent theory in order to infer the genealogy along a four-species genome alignment of closely related species, and estimate population parameters. We analyze a basic, panmictic demographic model and study its properties using an extensive set of coalescent simulations. We assess the effect of the model assumptions, and demonstrate that the Markov property provides a good approximation to the ancestral recombination graph. Using a too restricted set of possible genealogies, necessary to reduce the computational load, can bias parameter estimates. We propose a simple correction for this bias, and suggest directions for future extensions of the model. We show that the patterns of ILS along a sequence alignment can be recovered efficiently together with the ancestral recombination rate. Finally, we introduce an extension of the basic model that allows for mutation rate heterogeneity, and reanalyze Human-Chimpanzee-Gorilla-Orangutan alignments using the new models. We expect that this framework will prove useful for population genomics and provide exciting insights into genome evolution.

More like this

Good news! The gorilla genome sequence was published in Nature last week, and adds to our body of knowledge about primate evolution. Here's the abstract: Gorillas are humans' closest living relatives after chimpanzees, and are of comparable importance for the study of human origins and evolution.…
This week's phylogeny comes from this paper on molecular dating of speciation events. I won't be addressing molecular dating per se, but I will be dealing with what molecular clocks tell us. Like, do they actually reveal the speciation time of a pair of species? The divergence date of a pair of…
When Mendelism reemerged in the early 20th century to become what we term genetics no doubt the early practitioners of the nascent field would have been surprised to see where it went. The centrality of of DNA as the substrate which encodes genetic information in the 1950s opened up molecular…
Nucleotide Polymorphism and Selection This is the seventh of multiple postings I plan to write about detecting natural selection using molecular data (ie, DNA sequences). The introduction can be found here. The first post described the organization of the genome, and the second described the…

A model is just a model. Give us some data!