The Matthew effect in science

By clock on March 2, 2009.

Douglas Kell: The Matthew effect in Science - citing the most cited:

The Matthew effect applies to journals and papers too - a highly cited journal or paper is likely to attract more citations (and mis-citations), probably for the simple psychological reasoning that 'if so many people cite it, it must be a reasonable paper to cite' (and such a paper is, by definition, more likely to appear in the reference list of another paper). Clearly that reasoning can be applied whether the paper has been read or otherwise. Simkin and Roychowdhury (2005 and 2007) note that a clear pointer to the citation of a paper one has not read is if it copies a mis-citation, and an analysis of the frequency of such serial mis-citations allows one to estimate, statistically, what fraction of cited articles have actually been read - at least at or near the time of writing a paper - by the citing author. Their analyses show (at least for certain physics papers) that "about 70-90% of scientific citations are copied from the lists of references used in other papers", and that a typical device is to start with a few recent ones plus their citations. Some aspects of this tendency in bibliometrics, especially with highly cited papers, can be detected from the power law form of the distribution of citation numbers, as in the Laws of Bradford and Lotka that I discussed before. Of course the mindless propagation of errors without checking sources properly is hardly confined to Science - a famous recent example with spoof data showed how some journalists simply copied Obituary material from Wikipedia!

I know people do this. Drives me crazy! Every paper I ever cited I read and re-read and re-read. Heck, I even tried to slog through papers in German (which I don't speak) if I thought they were relevant. But copy+paste just because others did? Nope.

More like this

Is it possible that some mis-citations arise more innocently? You would see the same effect if the researcher found the reference in another paper, copied it down (digitally or manually), read the paper properly, understood it, but then used the note of the reference they already have in their own text. Re-checking spelling or exact page numbers is not exactly exciting work!

And anyway, should the researcher necessarily read the entire paper? The reference indicates "this bit originally came from here, it's not my own", so it's an attribution, not an affidavit that the researcher has read each and every word of the paper.

Sam: If the original citation was wrong, how would the researcher have found the paper in the first place? I could see an erroneous paper title propagating in this fashion, but it is very hard to find a paper with an incorrect journal title, volume, or page number. (Not as hard as it was in the days before Google, but still...) That is one reason why getting the citation right is important.

If I want to look up an astronomy or particle physics paper, I'm just gonna plug the first 1 or 2 authors, and maybe the year, into ADS or SPIRES and download a copy. And the machine-generated BibTeX entry that they provide. Unless there's already an entry in the .bib file shared by my collaboration, potentially hundreds of people. That file was probably formed mostly by combining files from previous collaborations, and will be passed on to others in the future. Any errors will propagate, maybe for decades.
In fact, I just found a paper in SPIRES with my name mangled.

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

More by this author

New URL for this blog

July 5, 2011

Earlier this morning, I have moved my blog over to the Scientific American site - http://blogs.scientificamerican.com/a-blog-around-the-clock/. Follow me there (as well as the rest of the people on the new Scientific American blog network

New URL/feed for A Blog Around The Clock

July 26, 2010

This blog can now be found at http://blog.coturnix.org and the feed is http://blog.coturnix.org/feed/. Please adjust your bookmarks/subscriptions if you are interested in following me off-network.

A Farewell to Scienceblogs: the Changing Science Blogging Ecosystem

July 19, 2010

It is with great regret that I am writing this. Scienceblogs.com has been a big part of my life for four years now and it is hard to say good bye. Everything that follows is my own personal thinking and may not apply to other people, including other bloggers on this platform. The new contact…

Open Laboratory 2010 - submissions so far

July 19, 2010

The list is growing fast - check the submissions to date and get inspired to submit something of your own - an essay, a poem, a cartoon or original art. The Submission form is here so you can get started. Under the fold are entries so far, as well as buttons and the bookmarklet. The instructions…

Clock Quotes

July 18, 2010

At bottom every man know well enough that he is a unique being, only once on this earth; and by no extraordinary chance will such a marvelously picturesque piece of diversity in unity as he is, ever be put together a second time. - Friedrich Wilhelm Nietzsche