December 2005
Sun Mon Tue Wed Thu Fri Sat
        1 2 3
4 5 6 7 8 9 10
11 12 13 14 15 16 17
18 19 20 21 22 23 24
25 26 27 28 29 30 31

Authors' Committee

Chair:

Matt Blackwell (Gov)

Members:

Martin Andersen (HealthPol)
Kevin Bartz (Stats)
Deirdre Bloome (Social Policy)
Andy Eggers (Gov)
John Graves (HealthPol)
Rich Nielsen (Gov)
Maya Sen (Gov)
Gary King (Gov)

Weekly Research Workshop Sponsors

Alberto Abadie, Lee Fleming, Adam Glynn, Guido Imbens, Gary King, Arthur Spirling, Jamie Robins, Don Rubin, Chris Winship

Recent Comments

Recent Entries

Categories

Blogroll

Brad DeLong
Cognitive Daily
Complexity & Social Networks
Developing Intelligence
EconLog
The Education Wonks
Empirical Legal Studies
Free Exchange
Freakonomics
Health Care Economist
Junk Charts
Language Log
Law & Econ Prof Blog
Machine Learning (Theory)
Marginal Revolution
Mixing Memory
Mystery Pollster
New Economist
Political Arithmetik
Political Science Methods
Pure Pedantry
Science & Law Blog
Simon Jackman
Social Science++
Statistical modeling, causal inference, and social science

Archives

Notification

Powered by
Movable Type 4.24-en


« The BLOG inference engine | Main | Applied Statistics - Michael Hiscox and Nicholas Smyth »

7 December 2005

Fun with R2

Mike Kellermann

This semester, I have been one of the TFs for Gov 2000 (the introductory statistics course for Ph.D. students in the Government Department). It the first time that I've been on the teaching staff for a course, and it has been quite an experience so far. We've spent the past month or so introducing the basic linear model. Along the way, Ryan Moore (the other TF) and I have had some fun sharing the best quotes that we've come across about everyone's favorite regression output, R2:

Nothing in the CR model requires that R2 be high. Hence a high R2 is not evidence in favor of the model, and a low R2 is not evidence against it. Nevertheless, in empirical research reports, one often reads statements to the effect that "I have a high R2, so my theory is good," or "My R2 is higher than yours, so my theory is better than yours." (Arthur Goldberger, A Course in Econometrics, 1991)
Thus R2 measures directly neither causal strength nor goodness of fit. It is instead a Mulligan Stew composed of each of them plus the variance of the independent variable. Its use is best restricted to description of the shape of the point cloud with causal strength measured by the slopes and goodness of fit captured by the standard error of the regression. (Chris Achen, Interpreting and Using Regression, 1982)
Q: But do you really want me to stop using R2? After all, my R2 is higher than all of my friends and higher than those in all the articles in the last issue of APSR!
A: If your goal is to get a big R2, then your goal is not the same as that for which regression analysis was designed. The purpose of regression analysis and all of parametric statistical analyses is to estimate interesting population parameters....
If the goal is just to get a big R2, then even though that is unlikely to be relevant to any political science research question, here is some "advice": Include independent variables that are very similar to the dependent variable. The "best" choice is the dependent variable; your R2 will be 1.0. (Gary King, "How not to lie with statistics,"AJPS, 1986).

So this is old news, right? Maybe not. Quite possibly the thing that has surprised me the most so far is just how much students want R2 to tell them how good their model is. You could almost see the anguish in their faces as we read these quotes to them, particularly among those who have taken some statistics in the past. The question I want to throw out is, why is R2 such an attractive number? Why do we want to believe it? Maybe our cognitive science colleagues have some insight....

Posted by Mike Kellermann at December 7, 2005 5:00 AM