Sequencing a Genome, part VI: Chimeras are not just funny-looking animals

By sporte on February 1, 2007.

To the ancient Greeks, a chimera was a kind of monster, with the body of a goat, the tail of a dragon, and a lion's head. To geneticists, a chimera can be an animal that's derived from two embryos, such as a transgenic mouse. Or if the organism is a plant, it can be a plant with a graft. We have a chimeric cherry tree in our back yard with branches from Rainier cherries, Bing cherries, and Van cherries. And you should see the chimeras that hang out at evolgen.

Naturally, the DNA cloning and sequencing world has it's chimeras, too. There are two main kinds that I know. Sometimes chimeras are created during a cloning step when two fragments of DNA that are normally separated, become joined by accident in the cloning step.

Can you spot the chimera?

Sometimes chimeras are created during electrophoresis. In the early days of DNA sequencing instruments, when fluorescently-labeled DNA fragments were separated by size in polyacrylamide gels, chimeras could appear during the gel runs.

This image shows what the DNA in a gel would look like. Labeled DNA fragments from 8 different samples, are lined up in 8 lanes, straight up and down. Occasionally, while the gel was running, the tracking software would shift lanes, and start reading information from the next lane. These events produced chimeric reads, that is, reads with DNA sequences originating from two different samples. (What is a read?)

A chimeric read.

Believe it or not, one of my former students had a night job at a genome center that involved staring at computer screens and making corrections if the gel tracking software started to misbehave.

I'm sure you're wondering, about now, how this all relates to genome sequencing.

Yesterday, I wrote about some of our experiences with analyzing some genomic libraries from phage (1). Some of the libraries had been created from DNA that was broken into random pieces by sonication. Other libraries were created from genomic DNA that was digested with restriction enzymes.

Remember, there are 3 main steps in sequencing genomes (and I suspect if you read a few of these posts you will never forget this):

Break the genome into lots of small pieces at random positions.

Determine the sequence of each small piece of DNA.

Use an assembly program to figure out which pieces fit together.

We got such strange results from trying to reconstruct genome sequences from the RE (restriction enzyme) libraries that we knew there must be something strange going on. Yesterday, we established that the libraries weren't random, but we thought there must be other problems, too.

This is what happens when you assemble DNA that's been broken at random positions.

Hunting for chimeras.

You can imagine that if the DNA sequences in our reads are from different locations, themselves, things can get pretty confusing, pretty fast. We thought that if some of the reads were chimeras, that might explain the bizarre results from our assemblies.

Fortunately, Phrap (2), the assembly program that we use in the Finch Suite, has an option to detect chimeras and problem reads. We put it to the test. We assembled different sets of sequences that had been sampled over time, and to each assembly, we added different numbers of new reads.

What happens when we add RE reads to our assemblies?

The first half of the graph shows the results from assembling 3148 reads obtained from the sonicated DNA library. We started reads from the RE libraries at the position of the vertical bar. In total, we added 1187 reads from the AseI and DraI libraries. Phrap detected 67 chimeras in the last assembly (4.7% of the added reads) and comparatively few in the reads from the sonicated DNA library.

I'm sure glad we weren't sequencing manatees.

References:
1. E. Green. 2001. Strategies for the systematic sequencing of complex genomes. Nature Reviews Genetics 2:573-583.

2. Porter, S., Slagel, J., and T. Smith. 2004. Analysis of Genomic DNA Library Quality with the Finch®-Server. Geospiza, Inc. You can download the paper as a pdf document from here: http://www.geospiza.com/research/white-papers.htm
Look in the middle of the page.

The earlier installments are here:
Part I: Introduction
Part II: Sequencing strategies
Part III: Reads and chromats
Part IV: How many reads does it take?
Part V: Checking out the library

More like this

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

More by this author

New home for Discovering Biology in a Digital World

October 30, 2017

Sometime in the next day or two, Scienceblogs will shut down. We've enjoyed the opportunity to blog here for the past 10+ years. Not to worry, @digitalbio and @finchtalk will continue blogging, but more so from their own site at Digital World Biology. The Scienceblogs posts have been…

Synbiobeta: The Future is Now

October 12, 2017

@synbiobeta concluded it’s #sbbsf17 annual meeting on synthetic biology Oct 5, 2017. The progress companies are making in harnessing biology as a platform for manufacturing and problem solving is world changing. Locations of Synbio Companies What is Synthetic Biology? Synthetic biology is a term…

Understanding the CRISPR Cas9 system

September 18, 2016

On Sept. 30th, I'm going to be co-presenting a Bio-Link webinar on Genome Engineering with CRISPR-Cas9 with Dr. Thomas Tubon from Madison College. If you're interested, Register here. Since my part will be to help our audience understand the basics of this system, I prepared a…

Zika virus, drug discovery, and student projects

March 8, 2016

It's well understood in science education that students are more engaged when they work on problems that matter. Right now, Zika virus matters. Zika is a very scary problem that matters a great deal to anyone who might want to start a family and greatly concerns my students. I…

DNA: it's in your blood

February 28, 2016

Did you know small fragments of DNA are circulating in your blood stream? These short pieces of DNA are left behind after cells self-destruct. This self-destruction, or apoptosis, is a normal process. In the case of fetal development, certain cells in our hands die, leaving behind individual…