The curtain has been drawn and the secrets to data analysis revealed. Do you have a data set sitting around in need of analysis? Read this and learn how to find significant results somewhere — anywhere — in your data. Because negative results won’t get you published; and you won’t get hired/tenure if you don’t publish; and your career will be a failure. Here’s a taste:
There are always anomalies. The World Series has been swept 17 times, five more than the model would predict. Plug this into the BINOMDIST function in Excel. (Understanding how this function works is optional and may in some cases be a disadvantage.) You find that, if the probabilities in the model were correct, there would be 17 or more sweeps in 95 occurrences only 8% of the time. A rotten break: you’re three lousy percent under statistical significance.
(Via Statistical Modeling, Causal Inference, and Social Science.)