Graph theory & selection

By razib on January 6, 2008.

A few days ago I posted about selection and population structure. The basic idea is to imagine demes, breeding populations, and consider how variation in the standard parameters such as selection coefficient and migration might affect the overall frequencies of the alleles. The paper, Fixation Probability and Time in Subdivided Populations, was rather "old school" despite the recourse to simulation. It emerged out of the theoretical population genetic tradition of R.A. Fisher & Haldane, and their successors starting with Kimura. It utilized diffusion equations and rested upon standard evolutionary genetic models of population structure such as the "Stepping-Stone." Today I'm going to take a different tack, standing upon the shoulders of our friend Martin Nowak, and his foray into
Graph Theory. This post is derived in large part from his paper Evolutionary Dynamics in Graphs, as well as the equivalent chapter in his book Evolutionary Dynamics.

Nowak isn't focusing demes per se here, rather, the nodes or vertices within the network can be thought of as individuals or points from which the mutation might emerge. A background assumption here is that you're reasonably familiar with the Moran process and linear algebra, but if you aren't you can hum through pretty easily I think. There aren't any major algebraic manipulations here anyway.

This is figure 1 b. You see the vertices in blue, while the directional arrows represent "edges." The matrix to the right is a stochastic, so the rows are probabilities which sum up to 1. Each element is ij, where i is the row and j is the column. The probabilities represent the likelihood that the offspring of indivdiual i will replace those of individual j. Many of the elements, as you can see, are 0 because i's offspring can not "replace" itself (so the top leftmost element would be w₀₀), and some vertices do not have edges leaving or entering. If you want to know the relevance of these probability matrices to evolutionary processes over time I suggest you consult the notes (also see Markov process). I just want you to keep in mind that Nowak's paper is focusing on the networks and their concomitant probability matrices.

The above is from figure 2 of the paper, and it shows a number of graphs which I will give a quick overview of. But first, an equation:

ρ₁ = (1 - 1/r)/ (1 - 1/r^N)

This represents the probability of fixation of a new mutant in a population governed by the Moran process, where N remains fixed across generations and during each generation one individual is selected with a probability proportional to its fitness, r, to produce an offspring which will replace a randomly chosen individual (you can see how this relates to the matrix above). The population is also homogeneous. Note that there is a chance of fixation, governed by the nature of r and N; but as in 2s there is a role for both deterministic selection and various stochastic factors (e.g., drift).

But the equation above applies to more than homogeneous (that is, panmictic) populations. In figure 2 a, b and c have the same fixation probability as a homogeneous population, ρ₁. This is because W, the stochastic matrix, is symmetric. Additionally, if T, 'temperature,' for the vertices is the same then the probability of fixation is ρ₁ as well. Temperature basically measures the weight of the edges going in and out of a vertex, or, T_i = Σ,_jW_ij. 'Hot' vertices, in orange above, are often replaced, while 'cold' ones, blue, are not. Graphs where all vertices have equal temperature are termed 'isothermal.' Graph d in the figure above shows a non-symmetric, but isothermal, network where the probability of fixation is ρ₁.

Obviously this is kind of a boring result, not all roads lead to ρ₁. Look at graphs f & g; their probability of fixation is 1/N. Why? This is pretty clear verbally, because of the nature of the edges unless the mutant starts in the cold position it can't sweep through the population. The chance of a mutant occurring on the cold positions is...you guessed it, 1/N. This is the old rate for the probability of fixation of a neutral allele. All you learn here is that some population structures, graphs, can theoretically prevent selection from fixing an allele. Additionally, if there are multiple cold positions upstream of a large number of hot positions then the probability of fixation is 0, since obviously a mutant in one cold position can never penetrate another.

But enough with throwing cold water on selection. How about structures with amplify the probability of fixation? You know you want some of that! Above is figure 3 from Nowak's paper. Pretty huh?

The fixation probability for graph a, the "star structure," is:

ρ₂ = (1 - 1/r²)/ (1 - 1/r^2N)

Since r spans 0 to ∞, with 1 being population mean fitness, any beneficial mutation is amplified to r². For example, 1.1 is converted to 1.21 as r² (note that everything else remains as in ρ₁). If you look at the network the power of selection to take over this network is pretty obvious, the central node acts as a mediator across the population.

But things really get going when we hit graph b, c and d, the "super star," "funnel" and "meta-funnel." Here's their fixation probability:

ρ_K = (1 - 1/r^K)/ (1 - 1/r^KN)

K is the number of leaves. The star structure has 2, the latter three structures 3. The important thing about these amplifiers is that as N → ∞ the probability of fixation converges upon 1! That means that a beneficial allele will fix, and a disadvantageous allele be eliminated! OK, back to earth. It's just a model...reality isn't a Moran process or perfectly defined by Graph Theory. But in any case, Nowak observes that these amplifying structures tend to have a few primary nodes which serve as shuttles for beneficial alleles. Good to know.

So what does this tell us? The easiest way to imagine this is that there are individuals. But what if it is interdemic competition? In other words, a competition, extinction & replacement meta-population model. And could this apply to gene flow even without replacement (let's break out of the Moran process derived box for a moment)? Perhaps there are particular dynamics at work when there is asymmetrical gene flow, when the structure of demes is irregular, and so forth. Most readers know enough data that they could produce many conjectures trying to fit the data and theory together....

Reference: Evolutionary dynamics on graphs, Erez Lieberman, Christoph Hauert, & Martin A. Nowak, Nature, 433:20 January 2005

More like this

'Virtual' Communication During Social Distancing: How We Change When We Know We're Being Seen

Social distancing due to the SARS-CoV-2 virus and the threat of COVID-19 has meant online communication is more popular than ever, with even casual parenting groups discovering the previous enterprise video conferencing tool Zoom.

My fixations

I've decided to jot down some simple* formalisms which I can refer new readers to on this website. So today.... You know that if you have a novel mutation within a population, its probability of fixation if it is neutral is:

Hold one hand still, move the other

I saw this post about human population diversity the other day...and though it was interesting, there was something that stuck in my craw:

Eye movement, visual memory, and peanut butter sandwiches

Experiments on change blindness have revealed striking limitations in visual memory.

Razib, I'm not sure I understand what you are getting from these graphs...

It appears that these graphs represent substructure population advantages not related to genetics. E.g., over time the children of the town's wealthiest family may displace children of the poorer families. (See model F in figure 2.) The link weights represent the advantage of being born into the wealthy family. Then any mutation (not just a beneficial mutation) that originates in the wealthy family is more likely to fixate than one that originates in a poor family.

Other examples of such non-symmetric flow would be noble vs serfs, town vs. countryside, trade-route-nexus village vs. nearby villages, inhabitants of fertile land vs. non-fertile land.

I don't really see how the graph weights could directly represent positive selection. The link weights would change as soon as a highly beneficial mutation occurred, a hot node under the old weights would become a cold node. Once the cold node acquired the mutation then it would in turn become hot.

What if we consider link weights that have a fixed substructure component and a variable "fitness" component. Suppose a beneficial mutation occurs at one of the nodes. The selective advantage of the allele would temporarily alter the link weight, i.e., nodes that have the allele would be slightly more likely to displace the offspring of those that don't have the allele. Even if a node is hot and so is often replaced, the beneficial allele should still pass into a cold node and then sweep that cold node. I do see that when a beneficial mutation occurs in a wealthy family it has a much greater chance of surviving stochastic elimination than if it arose in a poor family.

The more complex "amplifying" graphs don't seem realistic. I doubt substructures with many one-way flows have been common in human history. Even a little reverse flow would allow beneficial alleles to introgress.

These graph models seems more appropriate for studying drift (where the link weights don't change) than for studying selection.

Guess I need to read the paper.

Whoops, "Once the cold node acquired the mutation then it would in turn become hot." should read, "Once a downstream hot node acquires the mutation its link-weight-sum changes and it becomes cold."

empirically i'm thinking africa here...i think it might be a cold node and that outside alleles have a hard time penetrating. i'm also thinking that some island biogeography models might work as analogs.

i am going to reread the paper and chapter and answer you more directly later so i make sure i know what nowak is getting to.

Stop using divicult equations and formulas!!! But on the bright site I adore the information. Take Care...

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

More by this author

Remember to switch RSS feeds

April 3, 2010

If you link to this weblog from your weblog, please update links: http://blogs.discovermagazine.com/gnxp/ If you have not updated your feeds, please do so now: http://feeds.feedburner.com/GeneExpressionBlog The old feed address will point for another week or so to the new feed, but eventually it…

I'm moving to Discover

March 26, 2010

Update your bookmarks: http://blogs.discovermagazine.com/gnxp And RSS: http://feeds.feedburner.com/GeneExpressionBlog If you have a weblog that links to ScienceBlogs GNXP, I would appreciate you update the link for the sake of PageRank. There isn't much to say about the move. There wasn't one big…

Canada is not a "free society"

March 24, 2010

That's all I have to say to Eric Michael Johnson's post, Ann Coulter, Hate Speech, and Free Societies. OK, seriously, from what I recall Eric is an American, though resident in the forgotten north. American absolutist stances on free speech are not shared by most Western societies, so demanding…

Others in Siberia

March 24, 2010

The complete mitochondrial DNA genome of an unknown hominin from southern Siberia: With the exception of Neanderthals, from which DNA sequences of numerous individuals have now been determined...the number and genetic relationships of other hominin lineages are largely unknown. Here we report a…

The biophysical limits of cognitive computation

March 23, 2010

In this diavlog with Glenn Loury the behavioral economist Sendhil Mullainathan recounts the results of an experiment. - If given the option of paying $100 for an item vs. $80 for an item, but in the second case having to go across town for the item, respondents choose $80 and going across town - If…