Match the trace with the sample

DNA sequence traces are often used in cases where:

  1. We want to identify the source of the nucleic acid.
  2. We want to detect drug-resistant variants of human immune deficiency virus.
  3. We want to know which base is located at which position, especially where we might be able to diagnose a human disease or determine the best dose of a therapeutic drug.

In the future, these assays will likely rely more on automation. Currently, (at least outside of genome centers) many of these results are assessed by human technicians in clinical research labs, or DNA testing companies, who review these data by eye. (Yes, really!)

Today, you too, can be a data analyst and figure out what's happening with these traces.

i-9c37a1f9810f3bf66d275b4683fd7e0c-traces_lined_up.gif

These traces came from PCR products that were obtained from bacterial cultures by Sanger dideoxy sequencing. The bacterial cultures were isolated by JHU students in 2004. Some questions might have multiple answers.

  • Which of these traces looks like it might have been generated from a mixed sample of DNA? (in this case, a mixed culture of bacteria.)
  • Which of these traces probably came from a pure culture of bacteria?
  • Which of these traces appears to contain positions with polymorphic bases? (Polymorphic means more than one form)
  • The Extra Credit Question: All of these traces came from independently-isolated bacteria and are not likely to be the same species. Even so, two of the traces appear to have very similar sequences. Why do you think this might be the case? You can use blastn to answer this, but be sure to adjust the parameters for short sequences.


And, if you want to download some of our data, you can see how to get it here.

More like this

How did the human genome ever get finished if every one of the three billion bases had to be reviewed by human eyes? In the early days of the human genome project, laboratory personnel routinely scanned printed copies of chromatograms, editing and reviewing all DNA sequences by eye. For more…
What do genetic testing and genealogy have in common? The easy answer is that they're both used by people who are trying to find out who they are, in more ways than one. Another answer is that both tests can involve DNA sequence data. And that leads us to another question. If the sequence of my…
Welcome Bio256 students! This quarter, we're going to do some very cool things. We are going to use bioinformatics resources and tools to investigate some biological questions. My goal, is for you to remember that these resources exist and hopefully, be able to use them when you're out working…
Shotgun sequencing. Sounds like fun. Speculations on the origin of the phrase I think that this term came from shotgun cloning. In the early days of gene cloning before cDNA, PCR, or electroporation; molecular biologists would break genomic DNA up into lots of smaller pieces, package DNA in…

Hi Sandra,

1) The answer to this question is C. Here we can clearly see that there secondary peaks at many base locations which can interfere with accurate base calling.

2) Traces A and B come from pure culture.

3) Traces A and B. There are polymorphic locations in A(169,171 etc) as compared to B. Also the base call quality is high so these can be considered as good or true changes.

4) I havent done any BLAST analysis. but i guess from the trace picture its looks like the sequences A & B are related or share same function or role but have diverged over a period of time. the entire pairwise alignment is good except there a couple of bases in trace A at location 181 and 185 which have poor quality values (which may be sequencing errors)

Amit: you are right in that all three sequences are related. They were generated from different samples, but using same set of primers.

And you're right about which cultures are mixed and which are not and about the bases that differ between samples. But, you're not right about the polymorphisms. Only one of these samples contains positions with polymorphic bases.

Hi Sandra,

Trace A appears to contain positions with polymorphic bases. If we clone this PCR product and pick several colonies to be extracted and sequenced (as plasmid DNA), the mixed base position at 181 and 184-base of Trace A could show as 2 different bases in separate clones.

Hi,
please,in my samples i can see the bases in reigon with back ground gry and also on back ground pink with program AB1 for sequencing what this mean? and can i depend on the bases in gry region or not
thanks

Are you viewing traces in FinchTV from iFinch or GSAE?
If so, the area with the pink shading is the area that matches the vector and the area in gray is the area with poor quality sequence that would be trimmed by other programs in the Geospiza system. So, if this is the case, the answer is no.