Finding scientific papers for free, one more experiment

tags: , , , ,

I meant for this to be a three part series, but in part II, I learned that one more experiment had to be done. I had to know if the articles I found in PubMed Central were the same articles that I found in PubMed.

Part I and part III cover the background and my favorite method. Now, we're going to find out if my favorite method is really enough.

In other words, I had this kind of problem (shown in the diagram) and I just had to know which case was correct:

i-b7c3f0f45492563ca28f4f124758029a-pubmed-and-pmc.gif

The method:
To test this, I did a PubMed search with term "cancer," as before, and limited the search to free, full, text.

Then, I clicked the Preview/Index tab, opened the Filter field, and selected either the pubmed pmc free filter or the pubmed pmc filter. (Both filters are shown in the image below.)

i-5bc63bb0334fe87b6f3620a4ae21a22d-filter.gif

Then, I clicked the AND button to add that term to my query. (Using the AND, OR, or NOT buttons works wonderfully, because everything is properly formattted with quotes and brackets.)

My results:
In part II, I found 220,219 articles on cancer in PubMed and 171,702 articles in PubMed Central. In today's experiment, I found that only 52,160 articles (a little more than a third, were shared between the two databases).

i-3f6fa639df6dc86aa841bf2bf1a69e68-history.gif

In other words, this diagram shows the correct situation.

i-3595c7ede28005bc6a396b223b0940fb-shared_articles.gif



What's the take home message?
PubMed Central contains articles that are not available in PubMed (with limits). So, to get as many articles as you can, you do need to search both databases. And, if that doesn't work, my commenters (here and here) have left a number of excellent suggestions!

Read the whole series:

  • part I A day in the life of an English physician,
  • part II Comparing different methods,
  • part III My new favorite method,
  • part IV One last experiment

Copyright Geospiza, Inc.

More like this

tags: PubMed, PubMed Central, medical informatics, bioinformatics, finding scientific articles This is the third, and last part in a three part series on finding free scientific papers. You can read the first part here: Part I: A day in the life of an English physician and the second part, where…
tags: PubMed, PubMed Central, medical informatics, bioinformatics, finding scientific articles This is the second part in a three part series on finding free scientific papers. You can read the first part here: Part I: A day in the life of an English physician Today, we do an experiment with…
tags: PubMed, PubMed Central, medical informatics, bioinformatics, finding scientific articles This three part series covers the problem of finding scientific articles, compares results from a few different methods, and presents instructions for the best method. A day in the life of an English…
An introduction to our Alaskan NSF Chautauqua course and a pre-course assignment. I don't know how well this will work, but I thought it might be interesting this year to experiment with blogging about our course and sharing some of our experiences with the rest of the world. Here's your chance…

Actually, my understanding is that PubMed Central *is* a strict subset of PubMed.

But, when you search PubMed Central, you are searching the fulltext, and so you find some articles you don't find with a search of PubMed abstracts

When you search PubMed, on the other hand, you will find additional articles which are not present in PubMed Central, even though they may be free on publishers sites (unfortunately some publishers, while making content free on their own site, do not allow PubMed Central to host a copy of it).

You're quite right that the ultimate end result is that you do need to separately search both databases in order to get as complete as possible a list of free articles - which is a bit of a pain.

Hi Matt,

Thanks for explaining why the searches give different results.

I thought that PubMed Central was a subset of PubMed as well, at least until I did the experiment. PubMed Central has lots of editorials - like from the British Medical Journal - that I just didn't see when I searched PubMed.