Collaborating on a data analysis project: students do the math with the Google Docs spreadsheet program

For many years, I had my biotech students do projects where each group of students would analyze their own data, in addition to all of the data gathered by the class. I would draw a table on the white board and each group would enter their data. At the end of the class, all the groups would copy all the results into their notebooks, then analyze them in Microsoft Excel.

This worked pretty well, but it wasn't perfect. There were always cases where one group would be really slow, or someone had to leave early, or I needed to use the board and couldn't.

And, this method certainly wouldn't work for an on-line course. You don't want students to have to post lots of data to forums or e-mail spread sheets back and forth.

Google Docs spreadsheet program has solved this problem!

Students can gather data and collaborate on-line on the same document at the same time.

And I can control whether students are able to edit a document, or simply view the document and download the results.

I can even share the document with the world, if I so choose.

And when I tried this out in my class a couple of weeks ago, it worked really well!

Here's what we did:

  1. Before class, I set up the table for data entry.
  2. All my students went to and signed up for a gmail account (if they didn't already have one).
  3. I clicked the Share tab and entered a list of my students' gmail addresses.
  4. Then, I clicked Invite Collaborators to send an e-mail to the students.
  1. The students clicked the link in their e-mail to access the spread-sheet.
  2. They entered their results in the spread sheet, simultaneously, as they worked on identifying their bacteria via blastn. This went on during and between class periods.
  3. Then in the next class, I used the Sort function to sort data, show them what happened, and discuss some of the issues related to data analysis and bioinformatics, for example:
  • being a good citizen and entering your data in community databases (I had never realized before this fall that so many of the information-poor entries in GenBank are that way because of the people who entered the information so poorly!)
  • consistent data entry (it's hard to analyze your data when all the entries describe the same thing differently)
  • the work that you have to do to clean up your data before you can analyze it, like fixing names, making all the entries consistent.

Then, all the students downloaded data to their own computers for analysis and graphing.  The students could have made some graphs in the Google document, but I wanted all the students to get practice. Second, I wanted to them to use PivotTables to analyze the results for the two different biomes (creek vs. forest) and that feature isn't available in Google Docs.

If you'd like to take a look at our results and see the table, I also published our data. You can find it at:  JHU_bacteria_2004

More like this

In part I, I wrote about my first semester of teaching on-line and talked about our challenges with technology. Blackboard had a database corruption event during finals week and I had all kinds of struggles with the Windows version of Microsoft Excel. Mike wrote and asked if I thought students…
Three (or more) operating systems times three (or more) versions of software with bugs unique to one or systems (that I don't have) means too many systems for me to manage teaching. Thank the FSM they're not using Linux, too. (Let me see that would be Ubuntu Linux, RedHat Linux, Debian Linux,…
This quarter, I'm using a wiki with my bioinformatics class and posting sometimes about the things that I learn. Two things I've been experimenting with are: Setting up pages for individual students so they can take notes while they're working. Embedding a Google form into one of my wiki pages…
Okay OpenOffice fans, show me what you can do. Earlier this week, I wrote about my challenges with a bug in Microsoft Excel that only appears on Windows computers. Since I use a Mac, I didn't know about the bug when I wrote the assignment and I only found out about it after all but one of my…

Thank you very much for the details about this. I heard about Google Docs from another geology blogger, and I'm trying to set it up for collaboration on a student lab project. You've just saved me a ton of fiddling around!

Interesting suggestion. Certainly score in the easy-GUI section. However, how well does it integrate with data analysis tools? Do you have experience with that? Do students have to export it to a XSL file, and then read into R/SPSS/MatLab/...? Or do you have some software package to make the distribution of the data easier?

Egon: I'm not sure what kind of formats you need for R, SPSS, or MatLab, but data can also be exported in several formats for other data analysis programs - you can use csv, txt, xls, html, pdf, and another format called ods that I'm not familar with.

Some data analysis functions are part of the spreadsheet, e.g. you can sort, sum, add, subtract, get the average, count, get the minimum and maximum values, multiply, and divide.

You can also insert several different functions and you can make a few different kinds of graphs.

I think, since this is based on Open Office, you probably get most (if not all) of the Open Office features, plus the ability to share work and chat.