DOI! D!O!I! D-O-I! D.O.I.!

By cpikas on August 28, 2009.

I love the DOI. It's the best thing since sliced bread. Actually, it's better than sliced bread - I can slice my own bread - but I can't do what DOIs do so easily.

If you've been living under a rock for a while, you might not know that a DOI is a document object identifier - it's a unique identifier at the article or chapter level (or really at any level - like each image, each paragraph, or the whole book). Like you have ISBNs for books and ISSNs for journal (titles). What's really cool is that you can just put http://dx.doi.org/ in front of one, and get directed to the publisher's page for the article. What if you don't have access to the article at the journal's site? Well, you can enter the doi into your institution's handy open url resolver thingy (like SFX) and it will find the best place for that document (somehow this works less well, not sure why). It's persistent, it's interoperable, actionable (see the site: http://doi.org). The publisher can move the document around and keep the same url using these fabulous things (it's a handle, too).

Publishers have to pay for them - really just to keep the apparatus up - but it's worth every penny. A lot of publishers have gone back to assign a doi to their whole digital backfile (to, oh, 1680 or something).

I'm not the only one who thinks they're handy, APA has required the inclusion of the DOI in the citation for a while now. Oh, and if you use Connotea or ResearchBlogging.org you can just enter the DOI to get it to fill in the rest of the information using the system.

A couple of niggling points:

there are still a couple of publishers out there who don't participate (I know! Isn't that crazy?)
sometimes the research databases don't export them in the citation or the direct export to a citation manager (WHY????)
sometimes you get an article before it has a DOI or maybe after there's a DOI listed on it, but before it's registered in the system. I think this is becoming rarer, but it's a PITA. I get an RSS feed of early view articles from my society's publication. Used to be if you read the feed immediately, it would be 50-50 that the doi would work (it would always work a week later). Now it seems to be pretty immediate.
some ebook vendors don't provide these at the chapter level. That's really nice when they do. It would be really nice if the various CRC netbases did.

If you enjoy reading about specifications and standards and all that jazz, check out their site. For the rest of us, use DOIs and be happy.

More like this

Honestly, there is one thing I've never quite gotten about DOIs. Why not add the four characters "doi:" in front to make it fit as a URN scheme? Is there already such a mapping specified? While I understand that DOIs are about more than just the Web, in a sense, so are URNs, so implementing DOI as a particular and very nice URN schema makes sense to me, but maybe I'm missing something that makes this unreasonable...

Do DOIs actually do anything for us that URLs would not do if only people did not insist on messing with them? So far as I can see, a DOI is only as good as the assurance we have from the relevant organizations that it will never be messed with in a similar way, never changed, deleted, lost, or otherwise rendered useless. How far can we really trust that will be so? Doesn't introducing the IDF (or whatever) into the system just introduce one more possible point of failure, one more organization that might become careless or moribund? If they were actually archiving documents, that would be a different matter, but DOIs (unless I completely misunderstand) are just a second, redundant set of pointers.

Why would it not be just as good (and easier) to extract a undertaking from the relevant organizations (publishers, presumably) that they will, in future, refrain from changing the relevant URLs? Presumably, to make the DOI system work, publishers (or whoever else archives the actual files) have already had to undertake to make sure either that the files are never moved, or that if they are moved, the DOI will always be promptly updated to point to the right place. How is this easier (and more reliable, because it relies on fewer fallible organizations) than simply refraining from changing URLs?

What am I missing here?

@Chris - I don't know about the URN thing, but it seems to be discussed a lot on the doi page. See: http://www.doi.org/handbook_2000/enumeration.html#2.9.3
@Nigel - well, I think we've proven that publishers can't and won't promise to keep the URL stable! The point for handles or purls, afaik, is that you have one stable url and then the longer one can change at will and you just update the registry. So if a journal or even the publisher gets sold or transferred or if it's archived by another service, there are some legitimate times to update the url.

@Nigel - In reality, ownership and systems change. Ideally, every site admin would keep up the integrity of their URLs, past and present. However the 404 error is far more common. When a journal changes hands or the hosting platform changes, the URL changes but the DOI does not. I think DOI's give the publishers a business case to keep up access to their content when the changes occur but do not come with the perceived admin overhead. It is a single point of updated information versus a nest of httpd.conf edits or .forwards.

I am wondering: how essential ARE DOIs when it comes to citing (multi)media files in scholarly communication? http://network.nature.com/groups/citation-science/forum/topics/5402

I guess you, it's not "Aussie, Oi! Oi! Oi!", but "Documents, DOI! DOI!DOI!" :-)

Edit: I guess for you, ...

(Sorry, rushed typing.)

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

More by this author

Yeah, me too.

August 2, 2010

I'm also leaving ScienceBlogs, but it's not for the reasons some others have given. I don't think Pepsi's blog will hurt my real life reputation and besides, it's been pulled, there have been apologies - it's time to forgive. July was the first month I've gotten enough hits to get a paycheck - and…

Very cool - American Physical Society offers free access to public libraries

July 29, 2010

This APS rocks! Here's the press release from PAMnet: FOR IMMEDIATE RELEASE APS ONLINE JOURNALS AVAILABLE FREE IN U.S. PUBLIC LIBRARIES Ridge, NY, 28 July 2010: The American Physical Society (APS) announces a new public access initiative that will give readers and researchers in public libraries…

Michael Pater, Connecticut artist, died today

July 25, 2010

He was also my husband's uncle. I only found two of his images online, the remainder are photographs of prints we have on our walls - intentionally poor quality for those. He was a member of the Lyme Art Association, so there may be more information on their site. The Courant (Hartford, CT)…

Hey maybe scientists should do more than just wait for their journal to issue a press release on their new fabu article

July 25, 2010

The authors thesis is that the only mandatory communication of results is in peer reviewed journal articles. Scientists aren't required to do other communicating and often leave communication to the public to the media. They ask if is this is adequate given the very low percentage of scientific…

Well, sometimes you just have to Google it

July 21, 2010

So there I was, try all kinds of librarian ninja tricks on the fanciest, most expensive research databases money can buy (SciFinder, Reaxys, Inspec...) and no joy. Couldn't find what I needed. I'm perfectly willing to admit that I don't know all that much chemistry, but usually I do ok since I work…