DIY LOCKSS?

When I was but grasshopper-knee tall, my father the anthropologist took me to his university's library to help him locate and photocopy articles in his area of study for his files. He had two or three file cabinets full of such copies. (He may still.)

I have similar file cabinets, two of them: my del.icio.us account and my Zotero library. The del.icio.us account consists merely of links. The Zotero library, on the other hand, includes the actual digital object(s) as often as I can manage it (even at a major research university like MPOW, I cannot always lay eyes on everything I want to read). Zotero is capable of holding onto those items for me, and even backing them up in "the cloud" (actually my university-provided, passworded WebDAV space) without setting them free on the open Web in ways that would clearly and obviously violate copyright.

Now let us consider LOCKSS, and particularly the variant known as "Controlled LOCKSS" or CLOCKSS. Without wandering into the techie weeds, these programs do some elementary digital preservation on the e-journal literature by reproducing it widely in a geographically-distributed fashion, and coming up with policies for the opening of parts of the dark archive thus created in case of a crisis that removes the normal methods of access.

The thought occurs that Zotero, Mendeley, and similar bibliographic managers are a sort of do-it-yourself LOCKSS system. Metadata? Check. Digital items? Check. Reproduced widely in a series of dark archives? Check. All we're missing is the crisis-policy piece.

I don't wish a huge e-journal database loss on anyone, believe me; we would all be the poorer. I do feel just a tiny bit relieved, though, about this particular emergent effect of the widespread use of popular bibliographic managers.

This adds another soupçon of urgency to the movement for open data, to my mind. Open data will often (storage space willing) be reproduced by other researchers wishing to work with them. All by itself, organically, that phenomenon helps insure those data against loss, destruction, falsification, and other evils.

Tags

More like this

Duncan Hull and colleagues just published an excellent, must-read article - Defrosting the Digital Library: Bibliographic Tools for the Next Generation Web: Many scientists now manage the bulk of their bibliographic information electronically, thereby organizing their publications and citation…
I got a very nice email the other day thanking me for being a clearinghouse for e-research information. I'm not quite sure I am that, but just in case I've become it without noticing… What I read in the area and think is worthwhile enough to keep around ends up in a few places, all of which have…
Tuesday seems a good day for tidbits. (I am head-down in my UKSG presentation and class stuff at the moment, so kindly forgive posting slowness.) One argument I rarely see made for open access that should perhaps be made more often is that it reduces friction in both accessing and providing…
Sometimes it's worthwhile to let my "toblog" folder on del.icio.us marinate a bit. Posts I recently ran across on two different blogs illuminate the same point so well that they deserve their own post here! Off the Map offers Huffman's Three Principles for Data Sharing, which are really principles…