The stochasticity of genetic drift

I have spoken of the probability of extinction and the rate of substitution once past extinction, but now to something more prosaic, genetic drift. My post is based on John Gillespie's treatment in Evolutionary Genetics: Concepts & Case Studies. Like R.A. Fisher he does not think much of this process in evolutionary dynamics, and deemphasizes its salience. The easiest way to think of drift is simply as sample variance over generations, the expected deviation from the mean as one moves through time. In a diallelic model the deviation between generation n and generation n + 1 is:

σ = √(pq)/(2N), where

p = allele 1
q = allele 2 (and is 1 - p)
N = population size

From this, you can see that as N → ∞, σ → 0. The larger the population size the lower the proportionate sample variance across generations. Intuitively this should make sense, you know that the more data points you have from a population the more likely you are to approximate its true character. The more flips of a coin you make the closer your will approach to a 50/50 heads vs. tails rate over all of your tosses.

The formalism above implies that the power of drift is inversely proportional to the size of the population. Sewall Wright showed that the rate of the loss of genetic variation is proportional to 1/(2N), following the equation above. As the population increases the rate of the loss of genetic variation decreases as the gene pool is less significantly buffeted by the stochastic influence of random genetic drift. Gillespie makes much of the fact that the decrease in heterozygosity of a population between generations is:

ΔH = (σ2/N)H

Where σ2 is variance (the square of the deviation above), and N is the population size and H heterozygosity. What is heterozygosity? Simply stated is the fraction of individuals who are heterozygous at a locus. In the model above, that means an individuals who has allele 1 and allele 2, the 2pq in the Hardy-Weinberg Equilibrium. In a two allele system eterozygosity is maximized when p = q, and since it is a diallelic model it follows that p and q are both 0.5.1 Any deviation from this equilibrium reduces heterozygosity, and as we know that genetic drift shifts the frequencies away from their initial state, either toward fixation (100%) or extinction, H is always hurtling downward sans other evolutionary dynamics such as selection (e.g., heterosis) or mutation to maintain or replenish variation.

Compared to the earlier processes Gillespie, like Fisher, maintains that genetic drift driven by population size is a relatively weak long term influence on evolutionary dynamics. The probability of extinction in one generation is 1/3, but the time for a neutral mutation to move to fixation is 4N generations! A lot happens between t = 0 and t = 4N....

Addendum: Please play with this random genetic drift simulator if you haven't before. It'll give you a general "feel" for how powerful (or not) this force is depending upon the parameters you pop into it.

Update: In response to RPM's comment, yes, I'm assuming HWE, random mating and all that. In response to Larry's comment, I do not deny the relevance of the Neutral Theory (and model) on the molecular scale, I think Gillespie's point would be that "boundary process" (i.e., when a mutation is at very low frequency in generation t < 10) is far more important than simply random genetic drift over the long term in terms of stochasticity.

Note: Adapted from chapter 5 of Evolutionary Genetics: Concepts & Case Studies.

1 - Confirm it numerically using a Hardy-Weinberg model, or, just think of it as calclus where f(x) = 2x(1-x), where x = p, and set f(x)' = 2 - 4x, and when the rate of change is 0 you get x = 0.5.

Tags

More like this

Previously I discussed the probability of extinction across one generation for a new mutant allele. To review, there is ~1/3 chance that a new mutant will go extinct within one generation of its origination (i.e., a de novo mutation is not replicated and transmitted to the next generation of…
One issue that has cropped up in the comments a few times here is a conflation between quantitative & population genetics. Though people seem to think they're interchangeable terms, they're distinct fields. That's why population genetics text books have chapters devoted specifically to…
Over the past month or so I've been blogging chapter 5 of Evolutionary Genetics: Concepts & Case Studies. This chapter covers "stochastics processes," basically the random elements in the flux of gene frequencies in biological populations. Now, I'm a selection man for real, but to understand…
One of the classic ways to maintain genetic variation with a population is "overdominance," in short, a state where heterozygotes exhibit greater fitness than the homozygote genotypes. Imagine for example a locus, A, with two alleles, A1 & A2. Now, assume the fitness is distributed like so…

To Chet :)

- thought I'd direct you to a recent New Scientist article that may interest you. "You are what your grandmother ate" http://www.newscientist.com/article/dn10518-you-are-what-your-grandmoth…

"A mother's diet can change the behaviour of a specific gene for at least two subsequent generations, a new study demonstrates for the first time.

Feeding mice an enriched diet during pregnancy silenced a gene for light fur in their pups. And even though these pups ate a standard, un-enriched diet, the gene remained less active in their subsequent offspring. ..."

By dcmichelle (not verified) on 15 Nov 2006 #permalink

What is heterozygosity? Simply stated is the fraction of individuals who are heterozygous at a locus.

Yeah, but what people usually measure is the expected heterozygostiy (H) assuming HWE. In that case H=SUM(pi^2). That's the sum of the squares of all allele frequencies.

When you examine aligned orthologus sequences and compare them with the known three-dimensional structure of a protein is becomes clear that the vast majority of amino acid substitutions are neutral or nearly neutral. In fact, it's hard to find any substitutions that are advantageous.

The implications are obvious. The vast majority of evolutionary events involve the fixation of alleles by random genetic drift and not by natural selection.

How is this possible if random genetic drift is only significant in small populations? It's because large populations have a lot more mutations, most of which are neutral or nearly so. These segregating alleles account for most of the heterogeneity in populations.

While the probability of fixation by random genetic drift of any one allele is quite small in a large population, the frequency of mutation in such a population makes it highly probable that some alleles will be fixed.

Incidently, the title of your post is a bit misleading. I hope you don't mean to imply that chance plays no role in natural selection?

One should also be aware of the fact that random genetic drift covers processes other than just chance segregation of alleles in each generation (of sexual organisms). It can also refer to chance events in the environment and things like the founder effect [Random Genetic Drift]

By Larry Moran (not verified) on 15 Nov 2006 #permalink

Yeah, but what people usually measure is the expected heterozygostiy (H) assuming HWE. In that case H=SUM(pi^2). That's the sum of the squares of all allele frequencies.

tx. i'll add that assumptions in the addendum after work.

You provided a minor Eureka! for moi.

Great job in explaining WHY a smaller population size in a breakaway or pushed away or isolate group is more likely to have more drift.

Knew it was from Cavalli-Sforza e.g. in the History and Geography of Human Genes, but you just did a much better job of explaining why.