Pharyngula

Modeling metazoan cell lineages

i-ccbc028bf567ec6e49f3b515a2c4c149-old_pharyngula.gif

A while back, I criticized this poorly implemented idea from Paul Nelson of the Discovery Institute, a thing that he claimed was a measure of organismal complexity called Ontogenetic Depth. I was not impressed. The short summary of my complaints:

  • Unworkable idea: There was no explanation about how we could implement and test the idea, and despite promises at the time, Nelson still hasn’t produced his methods.
  • False assertions and confusing examples: He claims that all changes in early lineages are destructive, for instance, which is false.
  • Bad metaphors: He uses a terribly flawed metaphor of a marching band to explain how development works; I’d say that it’s a better example of how development doesn’t occur.
  • No research: Which is really a major shortcoming for a research program, that no research is being done.

Recently, Nature published a paper by Azevedo et al. that superficially might resemble Nelson’s proposal, in that it attempts to quantify the complexity of developing organisms by looking at the pattern within their early lineages. The differences are instructive, though: this paper clearly explains their methodology, presents many of the limitations, and draws mostly reasonable conclusions from the work. It is an interesting paper and contains some good ideas, but has a few flaws of its own, I think. My main objections are that its limitations are even greater than the authors mention, and there are some conclusions that are driven by an adaptationist bias.

First, I’m going to give a bit of background. Some organisms, such as nematodes and ascidians, have a remarkably precise pattern of cell divisions in early development. One can map out the lineage of each cell, division by division, and see the same pattern and the same fate in every individual—the pattern is so stereotyped that each cell can be individually named, as in this pedigree of all of the cells in the first four divisions of the nematode embryo.

i-e345496aa8c2e8537b1b0a810c98b902-nematode_pedigree.jpg

There are quite a few invertebrate species that have this kind of determinate cell lineage, but it isn’t the only way to do things. We vertebrates, for instance, tend to have a series of early cell divisions that do not sort out into specific fates, but instead produce a large pool of essentially identical cells. They are then shunted into various tissues by the vagaries of chance, location, and global signals. This is one limitation of the work of Azevedo et al., that it doesn’t seem easily applicable to organisms with less rigid patterns of development. They are using the pattern of cell divisions as a measure of developmental rules, but in us, cell divisions aren’t so clearly linked to developmental decisions.

i-78d65ad255cd302c2fe731589b4a97b3-nemlin.gif

In nematodes, though, the pattern is sharply defined. Here is a lineage map of the fate of the progeny of one particular cell, named T.ap, which gives rise to a specific population of epidermal cells, neurons, and support cells in the wild type animal. The precise details don’t matter here, but what you can see in the diagram at left is the stereotyped sequence of divisions. T.ap divides once to generate one daughter cell that will eventually make the population of green boxes at the bottom; the other daughter cell also divides in its own pattern that eventually leads to one particular cell that undergoes programmed cell death, the “X”.

For now, all you need to know is that bit: one daughter always goes on to produce green boxes, the other goes on to make one great-great-grandchild that kicks the bucket.


i-547df3ea52ff701c1d3ffc8d3a2f6584-lin14(gf).gif

The pattern of divisions is hardwired into these cells. Mutations have been identified that change the cleavages in interestingly stereotyped ways. To the left, for example, is the pedigree of T.ap cells in a mutant called lin14(gf) (lin14 is the gene, the (gf) means it is a gain-of-function mutation: the animal has a particularly potent form of lin14). Look at what happens: T.ap divides into two daughter cells, and the one that usually goes on to make the dead great-great grandchild carries out its program normally. The other daughter cell, though, divides and produces one daughter that is supposed to go on to make the green boxes, but instead, it seems to think it is T.ap—it divides to generate a lineage that is a carbon-copy of the T.ap progeny, right down to producing that dead great-great-grandchild.

i-98e883fd95acb62aaa66bdb6d1bd7c59-lin_module.gif

It’s as if this one specific sequence of cell divisions and cell fates is an independent module, a programmed series, that is regulated by a relatively simple switch, the lin14 gene. It produces a specific and reiterated set of cell divisions. Produce lots of lin14 in a cell, and it will then go on to automatically produce a subset of progeny like the pedigree to the right.

This is powerful stuff. At least in those animals with strictly defined embryonic cell lineages, reiterated patterns of cell division represent a modular program of development that can be switched on and off in evolution. The lin14(gf) mutant is a dead end, of course, since the animal gets stuck in a rut and never develops past that first larval stage, but we can see other examples of reiterated lineages in nematodes and other animals that are functional… for instance, look at the A5.1 and A5.2 cells in this ascidian lineage.

i-5e4916e3f0ee4b946723fc2b6491403d-ascidianlin.gif

One way to look at the development of these animals is that they form nested, reiterated sublineages; an animal with many cells and cell types is built by following a relatively small number of rules repeatedly. Azevedo et al. have looked at those rules, and see the repetitions as a mathematical pattern that can be reduced to a shorter algorithmic description. They start with the known pattern of cell divisions and encode it as a series of simple rules, such as “cell X → {neuron, epidermis}”, if a cell divides to form a neuron and an epidermal cell. After encoding this verbose, literal description of the pattern, they then compress it by collapsing equivalent rules until they have a program that produces the same set of cells with a minimal algorithm. Lineages with many reiterated sublineages will compress more readily and yield a smaller number of necessary instructions per final cell, while cases where every division is unique in its outcome will be uncompressable, requiring a unique set of rules for each final cell. The ratio of the minimal number of reduced rules to the total number of cell divisions is therefore a measure of the complexity of the lineage.

The diagram below illustrates a described lineage from the nematode on the left. You can see, for instance, that there are 3 pink cells that do the same thing; they divide to produce a cell of type “neu” and a cell of type “X” by Rules R7, R15, and R16. Since Rules R7, R15, and R16 do the same thing, however, they can be compressed to Reduced Rule RR7 in the diagram to the right. Figure b is a shorter algorithmic description of the pattern in part a, and says that we only need 11 rules, RR0 through RR10, to build the actual distribution.

i-f4d2c779d2aaf4583caa0ae66fda9c73-abarapp_sublineage.gif
Example of the calculation of cell lineage complexity. a, The C. elegans ABarapp sublineage gives rise to 18 terminal cells of four different types (open circles): epidermal (Epi), neuron (Neu), structural (Str), and death (X). We begin by describing the cell lineage as a series of 17 rules, one for each cell division (solid circles): R0→{R1,R2}, R1→{R3,R4}, …, R16→{Neu,X}. Solid circles of the same colour indicate equivalent rules, ignoring planes of cell division (for example, R7, R15 and R16). b, The minimum algorithmic description of the ABarapp sublineage consists of 11 reduced rules. Each reduced rule is represented by a solid circle labelled RR0?RR10, with a unique colour matching that of equivalent cell divisions (for example, RR7→{Neu,X} corresponds to the initial rules R7, R15 and R16). The lineage complexity of ABarapp is calculated as the number of reduced rules divided by the total number of cell divisions: C = 11/17 = 65%.

In the animal, there are 17 cell divisions. The number of reduced rules is 11, so the relative complexity of this system is 11/17, or 65%. If the system were uncompressable and each division was unlike all the others, it would require 17 rules to describe 17 divisions, so the complexity would be 100%. All clear? The lower the complexity number, the more repetitious the sequence of cell divisions is. In the example of lin14 above, the lin14(gf) mutant would cause an extreme reduction in the complexity of the lineage.

The authors applied their method to 3 nematode lineages (in C. elegans, Pellioditis marina, and Halicephalobus gingivalis), and one ascidian (Halocynthia roretzi) and got complexity values of 35%, 38%, 33%, and 32%, or a third the complexity of equivalent systems with no reiterated lineages. They also compared the complexity values to random networks. That is, if you generate a lineage on the computer with random bifurcations at each division, you also expect a complexity value less than 100% because sometimes, just by chance, two divisions will produce the same outcome. The real networks were still simpler than the random networks by 26-45%.

This diagram illustrates this idea. The top figure is the actual reduced description for the lineage of Halocynthia roretzi, with a complexity of 32%. The second figure is a random network, generated by a computer with the only constraint being that it produce the same distribution of cell types in the same number of cell divisions; it’s obviously much more complicated.

i-9f280352feb34dbd553497564558163f-h_roretzi_lineage.gif
The simplicity of the ascidian cell lineage. Shortest algorithmic descriptions of three lineages capable of generating the cells in the H. roretzi tissue-restricted stage embryo. a, The real lineage has a complexity of C = 32%. b, A random bifurcation lineage with over twice the complexity of the real one (C = 76%; Fig. 2d). c, The simplest lineage evolved from the H. roretzi lineage by selection for low complexity is approximately half as complex as the real one (C = 17%; Fig. 4d). Solid circles represent the reduced rules required to generate the different terminal cell states (open circles): endoderm (End), epidermis (Epi), mesenchyme (Mes), muscle (Mus), nervous system (Ner), notochord (Not) and undifferentiated (Und).

The third figure, c, is an example of the outcome of a simulation. The simulation is constrained to again produce the same distribution of cells as a final result, but is free to modify the rules within the lineage until the simplest possible rule set is identified. The computer was able to find an alternate set of rules that was half as complex as the observed set. The message is that real pedigrees are much simpler than either the worst possible or a random rule set, but still somewhat more complex than an optimum.

It’s an interesting paper and has the virtue of applying quantitative techniques to the problem of complexity in evolution, but I’m not entirely satisfied. I think there are still some problems here, and I don’t entirely trust the numbers that the authors have generated.

One concern is that the descriptions of the terminal fates of these cells are only approximations. When one cell is described as “neuron” and another is also “neuron”, the fact that they are labeled as identical in the rulesets may be an artifact of incomplete knowledge. Maybe the first cell is “serotonergic neuron”, while the second is “GABAergic neuron”, and future detailed analysis will make the complexity values go up. The values seem to be a lower bound, at best.

The authors do mention that interactions between cells are not incorporated into their models, and that position is only approximated as a place on the rule tree. This is a serious shortcoming; orientation, position, and interactivity with other cells and the environment are vital parts of the developmental story, and there is a genetic bias to their analysis. For instance, look at this lovely structure:

i-7fd9705a381a087e99f565e60bf21ecf-nemvulva.jpg

That’s the nematode vulva, and it is assembled from the products of a determinate cell lineage, but it is a signal from one cell (ac, the anchor cell) that localizes the vulva, and it is the orientation and location of the component cells that defines the vulval opening and the associated tissues. These spatial and morphological factors are neglected in the modeling. (By the way, the vulva development story is a wonderful piece of work…I’m going to have to put that on my list of things to write up.)

My strongest objection to the paper, though, is that it is cast as a purely adaptive story. The simplicity of the rule networks is presented as the product of selection for minimal rules in the history of these animals. I’d actually argue the other way, that what has happened is that evolution occurred by amplification of simpler, core modules in development; the building blocks were simple. Their techniques are actually showing the underlying modularity of development, not a process of paring away complexity.


Azevedo RBR, Lohaus R, Braun V, Gumbel M, Umamaheshwar M, Agapow P-M, Houthoofd W, Platzer U, tan Borgonie G, Meinzer H-P, Leroi AM (2005) The simplicity of metazoan cell lineages. Nature 433:152-156.

Comments

  1. #1 Jamie Hemmer
    January 1, 2010

    Thnks vry mch fr shrng ths ntrstng pst. I m jst strtng p my wn blg nd ths hs gvn m nsprtn t wht I cn chv.

The site is currently under maintenance and will be back shortly. New comments have been disabled during this time, please check back soon.