Seed Media Group

Discovering Biology in a Digital World

My thoughts on biology, teaching, life, and exploring the living world via the digital one. Only my opinions are represented by these postings, they do not represent the viewpoints of any funding agency or Geospiza, Inc.

Profile

Sandra Porter I am a microbiologist and molecular biologist turned tenured biotech faculty turned bioinformatics scientist turned entrepreneur. My passion is developing instructional materials for 21st century biology (Geospiza Education).

Search this blog

Learn about DNA with molecular models

Exploring DNA Structure


Subscribe to Geospiza Education News


e-mail digitalbio at gmail.com


DigitalBio Favorites

Molecular Momentos


Recent Posts

Recent Comments

Archives

Categories

Blogroll

Science Education Groups

Science Blogs School Fundraiser



Keep up to date

Awards

Red Orbit

Digital Bio at Blogged


Add Digital Bio to your Technorati Favorites!

Interesting places

  • xkcd
  • The Tangled Bank
    MicrobeWorld Radio

    « Bacterial metagenomics on the JHU campus: analyzing the data, part IV | Main | The day PZ visited Seattle and became a pirate »

    Digital Biology Friday: A microbiology blast puzzler

    Category: BioinformaticsDigital Biology FridaysGenetics & Molecular BiologyGenomicsMicrobiologyclassroom activitiesenvironmental educationsequence analysis
    Posted on: March 7, 2008 12:28 PM, by Sandra Porter

    Here's a fun puzzler for you to figure out.

    The blast graph is here:

    graph.png

    The table with scores is here, click the table to see a bigger image:

    smaller_table.png

    And here is the puzzling part: Why is the total score so high?

    If you want to repeat this for yourself, go here.

    You can use this sequence as a query (it's the same one that I used).

    >301.ab1
    CTAGCTCTTGGGTGACGAGTGGCGGACGGGTGAGTAATGTCTGGGAAACTGCCCGATGGAG
    GGGGATAACTACTGGAAACGGTAGCTAATACCGCATAACGTCGCAAGACCAAAGTGGGGGA
    CCTTCGGGCCTCACACCATCGGATGTGCCCAGATGGGATTAGCTAGTAGGTGGGGTAACGG
    CTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACTGAGA
    CACGGTCCAGACTCCTACGGGAGGCAGCAGTGGGGAATATTGCACAATGGGCGCAAGCCTG
    ATGCAGCCATGCCGCGTGTATGAAGAAGGCCTTCGGGTTGTAAAGTACTTTCAGCGAGGAGG
    AAGGCATTGTGGTTAATAACCGCAGTGATTGACGTTACTCGCAGAAGAAGCACCGGCTAAC
    TCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAAGCGTTAATCGGAATTACTGGGCGTAA
    AGCGCACGCAGGCGGTCTGTCAAGTCGGATGTGAAATCCCCGGGCTCAACCTGGGAACTGCA
    TTCGAAACTGGCAGGCTAGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGAAAT
    GCGTAGAGATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTC
    AGGTGCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGAT
    GTCTATTTGAAGGTTGTTCCCTTGAGGAGTGGCTTTCGGAGCTAACGCGTTAAATAGACCGCC
    TGGGGAGTACGGCCGCAAGGTTAAAACTCAAATGAATTGACGGGGGCCCGCACAAGCGGTG
    GAGCATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCTACTCTTGACATCCAC

    And change the database to Reference genomic sequences.

    I'll post the answer next Friday.

    Comments

    #1

    The total score is the sum of the scores of all the HSPs (alignments). In this case there are lots of HSPs, because this sequence appears many times in the target genomes.

    Posted by: Gustavo | March 7, 2008 10:56 PM

    #2

    You got it.

    Posted by: Sandra Porter | March 9, 2008 2:07 PM

    Post a Comment

    (Email is required for authentication purposes only. Comments are moderated for spam, your comment may not appear immediately. Thanks for waiting.)





    Having problems commenting? (UPDATED)

    Blogs in the Network

    Advertisement

    Top Five: Readers' Picks

    Search All Blogs

    Top Science Stories

    powered by SEED - seedmagazine.com