There’s a post up at Pharyngula describing the concept of synteny in comparative genomics (Basics: Synteny). The definition given by PZ Myers will sound pretty familiar to those of you who have read some of the genomics literature. The problem: it’s not quite correct. It’s actually the definition that I think most comparative genomics folks would give if they were asked to define synteny. But they keep using that word, and I don’t think it means what they think it means. What’s the definition? Here it is in PZ’s own words:
Synteny is the conservation of blocks of order within two sets of chromosomes that are being compared.
I disagree. While this is what many genomicists mean when they write or talk about synteny, they are wrong. Instead, I would argue that synteny merely means that genes are found on the same chromosome. Synteny says nothing about the order of genes. What give me the right to say this?
Let’s take a quick journey through the literature. In a paper comparing the genomes of various mammals, Joseph Nadeau and colleagues wrote the following:
Synteny refers to the occurrence of two or more genes on the same chromosome, whereas conserved synteny refers to two or more homologous genes that are syntenic in two or more species, regardless of gene order on each chromosome, i.e., synteny but not necessarily gene order is conserved (Figure 2; see also NADEAU 1989). Conserved linkage pertains to the conservation of both synteny and order of homologous genes between species (Figure 2; see also NADEAU 1989). A disrupted synteny refers to circumstances where a pair of genes are located on the same chromosome in one species but their homologues are located on different chromosomes in another species, i.e., the genes are syntenic in only one of the two species.
In the Nadeau framework, if genes are found in the same order in two species, we say there is conserved linkage. Is there any precedence for this terminology? Well, here’s the relevant passage from Nadeau’s 1989 paper (doi:10.1016/0168-9525(89)90031-0):
Conserved syntenies are homology segments composed of two or more pairs of homologous genes located on the same chromosome, regardless of gene order. These represent the first formal evidence for conservation.
Conserved linkages are the most rigorously defined segments because both synteny and gene order must be conserved. Distinctions between the three characterizations, which are illustrated in Fig. 1, are important for understanding the extent and nature of conservation and for assessing progress towards saturated maps of linkage and synteny homologies.
In the figure to the right, genes A, B, and C are used to illustrate conserved synteny and conserved linkage. In panel A, there is no conserved synteny. Panel B shows conserved synteny, but not conserved linkage. And Panel C shows conserved linkage (which implies conserved synteny).
Why does any of this matter? Well, rather than talk about things like micro- and macro-synteny, as Myers and various other do, there is only synteny. This clarifies the terminology a bit. We have a separate term for the conservation of gene order of syntenic genes — conserved linkage. A uniform and clear vocabulary would make the literature and discussion within the comparative genomics community more precise. Precision is good, right?
Is anyone in the comparative genomics community using this terminology? Yes, there are a few people, but it’s a small group that doesn’t carry much weight: Drosophila geneticists. But, hey, Drosophilists invented modern genetics, so they are a bit of an authority on the topic. The second sequenced Drosophila genome, that of D. pseudoobscura, provided an opportunity to compare gene order between that species and D. melanogaster (doi:10.1101/gr.3059305). In this paper, the conserved linkage terminology was used.
Okay, but what the heck are syntenic blocks? Yeah, that’s a bit of a oddity. It appears that syntenic block has become the term to refer to syntenic genes with conserved linkage. I’d prefer “conserved linkage blocks”, but those of us who favor clear terminology may have lost this war. So, it now seems like there are syntenic genes (those found on the same chromosome) and syntenic blocks (groups of syntenic genes found in the same order in the species being compared).
Ehrlich et al. 1997. Synteny Conservation and Chromosome Rearrangements During Mammalian Evolution. Genetics 147: 289-296 [link]
Nadeau 1989. Maps of linkage and synteny homologies between mouse and man. Trends Genet. 5: 82-86 doi:10.1016/0168-9525(89)90031-0
Richards et al. 2005. Comparative genome sequencing of Drosophila pseudoobscura: Chromosomal, gene, and cis-element evolution. Genome Res. 15:1-18 doi:10.1101/gr.3059305