Not turning up our noses

By dsalo on August 12, 2009.

I gave a talk for PALINET some little while ago about institutional repositories. The audience had been primed by the fantastic Peter Murray to think about looking after digital content as the "fourth great wave" of library work. (I wish that talk was online. It was absolutely brilliant.)

But not everyone was entirely onboard with that. I recall distinctly one distinguished-looking white-haired gentleman raising his hand. "We in libraries," he said (paraphrase mine), "have historically been purveyors of quality information. Authoritative information. On what basis should we jeopardize that raison d'etre for institutional repositories?"

Brave man, and he expressed well a resistance I've felt in my librarian colleagues near and far as long as I've been running IRs. Why do you collect that, they ask without asking. IRs established alongside established digital-library programs suffer worse, the parvenu being simply too declassé to mention in the same breath as library-blessed digital collections. The funny thing is, in a lot of these situations I suspect the digital library was resisted for a long time too; I suppose I can only shrug and be mildly pleased that IRs legitimize digital libraries by being the next target of scorn.

The thing is, if libraries are going to involve themselves in digital curation, we'll have to get over our yen for authority and finality. Even, dare I say it, quality.

Part of the reason for this is that in many fields, data-quality standards haven't been worked out yet. Cowboy data curators have to do their best and hope. Over time, this problem is likely to become less salient, which I expect will also lessen librarians' resistance to data curation—but I doubt the issue will ever go entirely away.

A related part of the reason is that data authority is a vexed question, and in most cases (it seems to me) the data will have to be collected and cared for well before the question of authority can be resolved. We just won't know what data are usefully authoritative until the researcher community has chewed them over a bit.

Part of the reason is that if we want decent-quality, well-described data, we just can't sit around until it's final. I've any number of war stories about stupid data that didn't have to be stupid; its collectors just didn't think through what they were doing until it was much, much too late. A librarian—any librarian!—could have asked the right questions and pointed to some of the right answers, but only early enough in the process to ensure that librarianly insights made it into the data-gathering process.

Sometimes, for all our best efforts, we'll find a dataset that needed an intervention that it didn't get. Sometimes, we'll have to sigh and take it anyway. Irreplaceability is one cogent reason to do so.

I expect that many librarians will find this an unpalatable set of outlook changes. The only counter I have is that they are necessary outlook changes if we are to participate in this service cluster.

More like this

Scholarly legitimacy

I had the honor to participate in a futurist exercise by ALA's Association for Library Collections and Technical Services. The short essays they solicited have been placed online; they are well worth perusal. I wish the discussants at ALA's Midwinter gathering a pleasant and stimulating exchange.…

Set your house in order

Roy Tennant sent me an email about my Access presentation in which he asked what libraries should do about the laundry-list of data-curation challenges I presented. (If you're curious, you can go view the presentation yourself, courtesy of the wonderful A/V folk at Access. The less-than-an-hour-…

Graft or hybridize?

I've lived all my short career in academic libraries thus far on the new-service frontier. In so doing, I've looked around and learned a bit about how academic libraries, research libraries in particular, tend to manage new services. With apologies to all the botanists I am about to offend by…

IRs, "data," and incentive

Many of my readers will already have seen the Nature special issue on data, data curation, and data sharing. If you haven't, go now and read; it's impossible to overestimate the importance of this issue turning up in such a widely-read venue. I read the opening of "Data sharing: Empty archives"…

Why should librarians give up their QC role for anything-goes IRs? Call it the digital flip. For print the filters are on the input side. For digital the filters are on the output side. Ask Google.

Well, the funny thing is, librarians' QC role, considered in the context of the entire process of knowledge production, is actually pretty minimal. We decide what to buy -- AFTER the whole process of selection, editing, review, and publication.

It's not authoritative because we selected it. We selected it because we thought it was authoritative.

So another way of saying what I said in this post is that we're getting involved with data well before the QC processes that we take for granted. The thing I find nifty about that is that our involvement may actually help the data be higher-quality and more authoritative!

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

More by this author

We're moving!

August 3, 2010

Looking for us? We're happy to say that we're part of the new Scientopia blogging collective. Come see us there!

Belated Zombie Day post

July 13, 2010

Oh, if I'd only had this picture for Zombie Day... Credit for the photo to UK Serials Group. Credit for the alteration of the speech bubble (you can see the original slide here if you care to) to Steve Lawson. Incidentally, I should have a postprint of an article based on this presentation up…

Promoting a comment: "Open and shared format"

July 8, 2010

Richard Wallis has taken my ribbing in good part, which I appreciate; his response is here and will reward your perusal. He also left a comment here, part of which I will make bold to reproduce: As to RDF underpinning the Linked Data Web - it is only as necessary as HTML was to the growth of the…

Small fry, blogging networks, and reputation

July 8, 2010

So, the PepsiCo blog thing. Right. Advance disclaimer: this is me talking, not either of my illustrious co-bloggers. We have not yet made a decision about what to do; one co-blogger is across the pond at a conference and the other is vacationing, so that discussion will have to wait a bit. This is…

I'd love to dance with you, but...

July 6, 2010

Richard Wallis of Talis (a library-systems vendor) posted The Data Publishing Three-Step to the Talis blog recently. My reaction to this particular brand of reductionism is… shall we say, impolitic. I just want to pat Richard on the head and croon "Who's the clever boy, then? You are! Yes, you are…