Phylogeny Friday - 28 April 2006

By evolgen on April 28, 2006.

Over at my old site, I lamented the apparent death of distance based tree building algorithms. Just as all of life on earth can be divided into three domains, phylogenetic methods can be split into three groups: distance based, maximum parsimony, and maximum likelihood. Distance and parsimony based approaches have been around for a while (and were used prior to the availability of molecular data). The combination of molecular data and more powerful computers allowed large molecular datasets to be analyzed using parsimony methods. Our great computing power has also allowed for the advent of maximum likelihood methods to be applied to solving phylogenies. Bayesian likelihood algorithms are the en vogue tree building methods and they can be tuned to the specific parameters observed in your data. But, as I asked in the post, what about distance based methods?

More below the fold...

The discussion above is far from comprehensive, and I don't spend a lot of time building trees so I'm not qualified to judge which method is best. That said, the appropriate method definitely depends on your data, and it's always good to confirm your phylogeny using multiple methods. Despite being published nearly twenty years ago, the neighbor joining method remains one of the most popular tree building algorithms. The article has been cited an amazing 9,820 times (according to Google Scholar). That may be an underestimate, as ISI lists it as having 13,353 citations.

The token phylogeny is shown to the left. This is the first ever neighbor joining phylogeny constructed using real data. The evolutionary distance between these frog species (from the genus Rana) were measured using allozyme loci and biochemical interactions -- not exactly DNA sequences, but the original data were published in 1978. The numbers represent the evolutionary distance along each branch. DNA sequencing was still quite difficult in the 1980s, but technological advances made in the 1990s lead to a rapid increase of DNA sequences in public databases. The neighbor joining algorithm was used to construct many of the early phylogenies using molecular data (some of these may appear in Phylogeny Friday in the coming weeks).

More like this

Everybody has their preferences but since Neighbor-Joining, parsimony and UPGMA have different assumptions it's worth running all three on a dataset. It's technically not an obstacle so why not explore the dataset? I agree that keeping the simplest analysis is the best but the different views afforded by parsimony and UPGMA night turn up some interesting tidbits in the dataset.
Oh, and like flossing after ever meal, don't forget to bootstrap.

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

More by this author

This is a Good-bye Post

January 16, 2009

This is the final post ever at evolgen. It was a fun 4+ years, the last three spent at ScienceBlogs, but it has come time for me to close up shop. When I first got into blogging, I did it as a way to share what was on my mind to the few people who would read what I had to say (usually in topics…

Mendel's Garden #27 - Call for Submissions

January 2, 2009

Mendel's Garden is the original genetics blog carnival. The next edition will be hosted by Jeremy at Another Blasted Weblog. If you would like to submit a blog post to be included in the carnival, send an email to Jeremy (jcherfas at mac dot com). The carnival should be posted within the next few…

Eric Lander Teaches?

December 20, 2008

John Hawks points out that Eric Lander has been appointed to co-chair Obama's Council of Advisers on Science and Technology along with science adviser John Holdren and Nobel Laureate Harold Varmus. Here's how the AP article describes Lander: Lander, who teaches at both MIT and Harvard, founded the…

The Implementation of Molecular Evolution for the Masses

December 18, 2008

A couple of years ago, there was talk in the bioblogosphere about getting the general public interested in bioinformatics and molecular evolution: Amateur bioinformatics? Lowering the Ivory Tower with Molecular Evolution Molecular Evolution for the Masses The idea was inspired by the findings of…

Do people still use microarrays?

December 17, 2008

Larry Moran points to a couple of posts critical of microarrays (The Problem with Microarrays): Why microarray study conclusions are so often wrong Three reasons to distrust microarray results Microarrays are small chips that are covered with short stretches of single stranded DNA. People…