Jeffrey-Lindley Paradox

By mixingmemory on October 31, 2006.

Via Amy Perfors at the Harvard statistics blog, Social Science Statistics Blog, I learned of the Jeffrey-Lindley Paradox in statistics. The paradox is that if you have a sample large enough, you can get p-values that are very close to zero, even though the null hypothesis is true. You can read a very in depth explanation of the paradox here.

I don't find this either surprising or worrisome, as Perfors does. While I'd never heard of the paradox before (it's really pretty cool, if you're into statistics or Bayesian reasoning), everyone who's taken a statistics course understands the perils of large sample-sizes. The fact is, if you have two different groups, even from the same population, they are, by definition, two different groups, as they are composed of two different sets of individuals. As a result, where measures that are influenced by random variables are concerned, the means of the groups will be different, and if you get a large enough sample size, that difference will be statistically significant. Since everyone is aware of this, I can't imagine it's a problem. If it looks like someone's using a sample that's too large, so that any significant differences he or she might find are likely to be theoretically and practically uninteresting, people will pick up on it, either through effect size calculations or through subsequent research.

More like this

Statistics in sport?

Chad is bemoaning the increase of "stat-geekery" in sports:

Insignificant vs. Non-significant

Astrostatistics for Fun and Profit

A couple of astrostatistics related announcements: 1) The Astrostatistics & Astroinformatics Portal is open.

How to do Statistics Wrong

Telling people that they are doing statistics wrong is a cottage industry that I usually want nothing to do with, for various reasons including the fact that the naysayers are often blindly repeating stuff they heard but do not understand.

I just noticed this post of yours, so feel compelled to comment. :)

I definitely agree with you that to the extent people keep effect sizes in mind, this isn't a worrisome result; what I'm more worried about -- and failed to say in my post -- is related to, for lack of a better word, "meta" cognitive-science, or sociology of science. Because (as we know from much cognitive science research) people tend to think categorically, and because a significance level gives a nice "category" to fit results in, even if effect size is reported and it's small it's easy to just notice and remember the significance level. This tendency is made worse by the fact that sometimes there is no accepted notion of how big of an effect is "interesting", in the same way that there are accepted p-value thresholds. Thus, if we often run subjects until getting a significant p-value -- even if we report effect size -- what ends up staying in memory is just the result and the knowledge that it was significant. It might be better to stop collecting data earlier, thus possibly overlooking findings with small effect sizes, in order to just focus on and pinpoint the interesting and robust results.

Honestly, I don't think this is a big worry in practice, at least for a lot of reported work. I made the post mainly because I think the paradox is cool and wanted to talk about it. :) But effect size does matter, for many diverse reasons: and the more salient we make this point, the more often we emphasize it, the less I worry about being led astray because of cognitive factors like those I detailed in the last paragraph.

Hi Amy, thanks for commenting. I agree with your point about cognitive factors, though I think that's where further research sorts things out. Howver, like you, I mostly wrote this post because I thought the paradox was cool.

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

More by this author

Marvin

August 25, 2008

Back to real blogging soon, but before then, I wanted to post this. You probably saw a bit of this during NBC's Olympics coverage, but the whole thing has to be seen. It's one of the coolest things ever, though me being a huge Marvin Gaye fan might have something to do with me thinking that:

He's Just a Frackin' Adolescent Ass

July 26, 2008

Way, way back in September of 2005, a Danish newspaper published some cartoons depicting Muslims and their prophet, and in response, thousands of Muslim extremists responded with varying degrees of threatened and actual violence. As you all know, this resulted in a storm of media coverage around…

Fart Spray (And Disgust) Makes Moral Judgments More Severe

July 9, 2008

I've been meaning to post about this set of studies for a while, but because it's relevant to Chapter 4 of Lakoff's The Political Mind, I figured I'd better get around to it before I write the review of that chapter. It's been a while, but in the past, I've talked a lot about new theories of moral…

I Can't Understand Your Accent, So Keep Talking

July 8, 2008

I have this friend from New York who, most of the time, speaks in a normal (that is to say, southern) accent that she's acquired as a result of being surrounded for so long by people who speak the King's English ('cause Elvis was a southerner). Occasionally, though, usually after she's been talking…

The Political Mind, Part IV (Chapter 3)

July 7, 2008

In Chapter 3, we finally get to read all about the Strict Father and Nurturant Parent. I knew this was coming, of course, but for some reason, when I finally got to this chapter, I still felt surprised. I mean, at some point, you'd think he'd give up metaphors that even his own epigones can't find…