Now on ScienceBlogs: Dr. Rolando Arafiles: Antivaccine rhetoric, colloidal silver for the flu, and Morgellons disease

Enter to Win

Mixing Memory

An entrée of Cognitive Science with an occasional side of whatever the hell else I want to talk about.

Search

Profile

No3.jpg Cognitive stuff from a cognitive person. If you've got any requests, drop me an email. If it takes me a while to get to it, drop me another one.

The lovely banners were created by Anton Oetll and Todd Hartman.

April is the cruelest month, breeding lilacs out of the dead land, mixing memory and desire, stirring dull roots with spring rain.

iloveyoupzmyers.jpg

Reading Group

The Mixing Memory Reading Group is a place for experts and non-experts alike to discuss books and papers in cognitive science.

Recent Posts

Categories

Archives

Blogs For All and For None

Cognitive Science and the Like

The Lesser Sciences

Philosophy

Feminists

Politics Or Close to It

Seriously Good But I Don't Know What to Call It

Other Links

Journals

« Jackson Pollock Is Scary! Why People Hate Modern Art | Main | Mixing Memory is 2 Years Old »

Publishing and Statistical Significance

Category: Research & Theory
Posted on: September 20, 2006 3:12 PM, by Chris

There's been some hubbub recently over a study by Gerber and Malhotra (you can get a copy in pdf here), which shows a couple things. First, political science journals don't publish many articles that report negative (null) results, but instead tend to publish those that report statistically significant results. Second, a large portion of those statistically significant results involve probabilities that are pretty damn close to .05 (the generally accepted cutoff for statistical significance). My first reactions were duh, and who cares?

Of course, I'm not a political scientist, so I can't speak for them, but in psychology, everyone has always known that it's damn hard to get null results published. And there are good reasons why that's so. For one, null results are less informative. It's difficult to tell whether they're the result of a lack of the hypothesized relationship between your variables, or instead, the result of chance or methodological problems (especially a lack of statistical power). So, if you want to publish a null result, you've got a bunch of extra work to do. In addition to calculating power (which people in some discinplines do automatically, anyway), you're almost certainly going to have to run extra variations of the study (even more than you would with a statistically significant result) to show that your null result wasn't the result of methodological problems.

There's another good reason why they aren't published: they're not expected! I know, on the surface, it looks like they should be expected, because typically there's at least a 95% chance that you won't get statistically significant results. But in reality, getting statistically significant results is actually pretty likely, because researchers generally don't conduct studies unless they're pretty confident, for theoretical reasons or whatever, that the hypothesized relationship between their variables exists. So if you get null results, it's actually pretty surprising.

When null results do get published, it's usually because the researchers had good reason to expect them, so they undertook the extra steps required to make null results publishable. In general, that's only interesting when there's a heated debate about the relationship between two variables, and one theory predicts that there is none. Even then, there are often better ways of demonstrating that than producing null results. You run other studies that test (non-null) hypotheses that distinguish between competing theories, for example.

As for why so many of the results cluster around the .05 level, well, that's probably a cultural thing. Researchers tend to be overly obsessed with statistical significance (as this study ironically shows), and that means that when you've got results that are approaching significance, you're going to employ a few tricks to get closer to it. In psychology, for example, you might run a few more participants than you'd planned, thereby increasing your statistical power, or you might tweak your methodology and rerun the experiment. In most cases, I think these solutions are harmless, particularly since there's no real a priori reason to be obsessed with the .05 level in the first place. If you're close, but not below it, chances are you're onto something, but you need the extra little push to get people to pay attention. If you're close, but actually committing a Type I error, chances are subsequent research will discover that. Sure, it might cause people to use time and resources driving down theoretical dead ends, but that's just the way science works, and making a big deal out of it is kind of silly. So again, I say, duh, and who cares?

TrackBacks

TrackBack URL for this entry: http://scienceblogs.com/mt/pings/21715

Comments

1

Good points, all. Another reason to worry about "null" results is that they are frequently misinterpreted to mean the null hypothesis is true, not that it cannot be rejected as unlikely. Failure to reject still leaves open the possibility that any differences are real but there was insufficient statistical power or that some bias was involved.

Conversely, some results that are statistically significant are of no interest at all. They are just a reflection of a large number of measurements. If I were to measure the heights of all 6 year olds on the east coast and compare them to the heights of all six year olds on the west coast I doubt the average would be identical and I also guarantee you that a, say 1/64" difference, would be statistically significant. But who cares?

Posted by: revere | September 22, 2006 10:10 PM

2

Consider also this report Plus titled Why Most Published Research Findings Are False: http://medicine.plosjournals.org/perlserv?request=get-document&doi=10.1371/journal.pmed.0020124

The magazine Nature a few years ago ran a piece on this subject in which they proposed a special depository for null results. Don't know what became of it.

But clearly there is a selection bias in favor of positive results, which explains partly why so many of them are false.

Posted by: Luke Lea | September 29, 2006 5:45 PM

Post a Comment

(Email is required for authentication purposes only. On some blogs, comments are moderated for spam, so your comment may not appear immediately.)





ScienceBlogs

Search ScienceBlogs:

Go to:

Advertisement
Collective Imagination
Enter to win the daily giveaway
Advertisement
Collective Imagination

© 2006-2009 ScienceBlogs LLC. ScienceBlogs is a registered trademark of ScienceBlogs LLC. All rights reserved.