This is a pushmi-pullyu post. I need some help with an environmental scan, so I'll get us started and the rest of you smart folks can amplify my knowledge.
I want to understand what's going on where with data curation specifically at the institutional level (no NOAA, no ICPSR, none of that) Stateside. Grant-funded is fine, though I'm doubly curious about programs that have been weaned (or are weaning themselves) off the grant money. Here are the programs I know about offhand:
- Institutional data curation: San Diego Supercomputer Center (right? I'm not entirely sure what they offer vis-a-vis long-term data stewardship), Purdue's D2C2, Cornell's DataStaR.
- Subject-specific but still (mostly?) institution-focused: Cornell's CUGIR (there must be a lot more GIS out there, mustn't there?), North Carolina's DRYAD
- Data-curation training: Illinois, North Carolina.
Tell me what I'm missing, please and thank you.
I'm not sure if this is what you are looking for but we built an open software product called ir+ which allows researchers, faculty and staff to author, share, collaborate and optionally publish into the repository - all rolled into one application. We are running it in production here:
That's close... but it's mostly for collaboration on publications, yes? I'm hoping for things a little closer to raw data. (Which isn't to say IR+ couldn't go in that direction! Very likely it could.)
Well, actually it's a private workspace for sharing of any type of file or data with no restriction on usage - so it can be used for either - it's up to the researcher.
Size does become an issue on upload - on campus; we have had examples of uploaded files larger than 200 MB. However, I'm not sure what raw data set size you are looking into. Gigabyte size data sets may be another story.
Right! That's right up my alley, then. Thanks! (And for small science, I think you've come pretty close to the 80/20 point. I suspect they'll let you know if/when they need more!)
The University of Idaho has INSIDE Idaho, the statewide clearinghouse for GIS data.
You'd surely have to include UC's CDL... their digital preservation initiative has even been re-branded University of California Curation Center (UC3).
And surely Johns Hopkins would be worth a mention; their Datanet-funded activity stems from local roots.
Several of the NDIIPP projects have strong local rots as well. Georgia Tech would be a good example, I think.
Excellent. Many thanks! (Now, I will wander off and try to find out why my emailed comment notifications aren't working properly...)