The NASA Earth Observing System and dealing with all that data

The NASA Earth Observing System is an incredible resource for both science and education. One of the amazing things about it is all the different kinds and quantities of data are assembled together into pictures that even grade school kids can immediately comprehend.

How do they do it?

Each of the EOS satellites delivers a terabyte or more of data per day from many different instruments.

How do they take satellite imagery, rainfall statistics, temperature information, and other kinds of data and assemble these data into meaningful pictures?

The answer is HDF (hierarchical data format).

HDF is designed to handle large amounts of data. It was developed at the National Center for Supercomputing Applications (NCSA) (NCSA brought us Mosaic, the first web browser.)

To quote EOS:

NCSA developed HDF to assist users in the transfer and manipulation of scientific data across diverse operating systems and computer platforms, using FORTRAN and C calling interfaces and utilities. HDF supports a variety of data types: n-Dimensional scientific data arrays, tables, text annotations, several types of raster images and their associated color palettes, and metadata. The HDF library contains interfaces for storing and retrieving these data types in either compressed or uncompressed formats.

For each data object in an HDF file, predefined tags identify the type, amount, and dimensions of the data; and the file location of various objects. The self-describing capability of HDF files helps users to fully understand the file's structure and contents from the information stored in the file itself. A program interprets and identifies tag types in an HDF file and processes the corresponding data. A single HDF file can also accommodate different data types, such as symbolic, numerical, and graphical data; however, raster images and multidimensional arrays are often not geolocated. Because many earth science data structures need to be geolocated, NASA developed the HDF-EOS format with additional conventions and data types for HDF files.

What does HDF have to do with bioinformatics?

That's in the next post.

Categories

More like this

Linux has powerful graphics tools For the average user or the professional image manipulator, there is a range of OpenSource software that will run on Linux as well as (in some cases) other platforms such as Windows. As discussed earlier, there are two basic kinds of image: Bit mapped and…
UPDATE (January 2, 2016): The makers of Scrivener have decided to abandon their Linux project. Kudos for them for giving it a try. The Scrivener on Linux users were not many, and almost nobody donated to the project, and as far as I can tell, the project was not OpenSource and thus could not have…
What do genetic testing and genealogy have in common? The easy answer is that they're both used by people who are trying to find out who they are, in more ways than one. Another answer is that both tests can involve DNA sequence data. And that leads us to another question. If the sequence of my…
This is a guest post by professional photographer Scott Rowed, describing his experience in switching from Windows to Linux. Does Linux have the tools for a professional photographer? A few months ago I would have answered "no". After switching primarily to Linux I gradually migrated my…