August 2008
Sun Mon Tue Wed Thu Fri Sat
          1 2
3 4 5 6 7 8 9
10 11 12 13 14 15 16
17 18 19 20 21 22 23
24 25 26 27 28 29 30
31            

Authors' Committee

Chair:

Andy Eggers (Gov)

Members:

Weihua An (Soc)
Kevin Bartz (Stats)
Sebastian Bauhoff (HealthPol)
John Graves (HealthPol)
Justin Grimmer (Gov)
Jens Hainmueller (Gov)
Mike Kellermann (Gov)
Ellie Powell (Gov)
Gary King (Gov)

Weekly Research Workshop Sponsors

Alberto Abadie, Lee Fleming, Adam Glynn, Guido Imbens, Gary King, Kevin Quinn, Jamie Robins, Don Rubin, Chris Winship

Recent Comments

Recent Entries

Categories

Blogroll

Brad DeLong
Cognitive Daily
Complexity & Social Networks
Developing Intelligence
EconLog
The Education Wonks
Empirical Legal Studies
Free Exchange
Freakonomics
Health Care Economist
Junk Charts
Language Log
Law & Econ Prof Blog
Machine Learning (Theory)
Marginal Revolution
Mixing Memory
Mystery Pollster
New Economist
Political Arithmetik
Political Science Methods
Pure Pedantry
Science & Law Blog
Simon Jackman
Social Science++
Statistical modeling, causal inference, and social science

Archives

Notification

Powered by
Movable Type 4.12


« The 80% Rule, Part II | Main | Detecting Attempted Election Theft »

3 May 2006

Sensitivity Analysis

Felix Elwert

Observational studies, however well done, remain exposed to the problem of unobserved confounding. In response, methods of formal sensitivity analysis are growing in popularity these days (see Jens's post on a related issue here.)

Rosenbaum and Rubin's basic idea is to hypothesize the existence of an unobserved covariate, U, and then to recompute point-estimates and p-values for a range of associations between this unobserved covariate and, in turn, the treatment T and the outcome Y. If moderate associations (= moderate confounding) change the inference about the effect of the treatment on the outcome we question the robustness of our conclusions.

But how to assess whether the critical association between U, T, and Y that would invalidate the standard results is large in substantive terms?

One popular strategy compares this critical association to the strength of the association between T, Y, and an important known (and observed) confounder. For example, one might say that the amount of unobserved confounding it would take to invalidate the conclusions of a study on the effect of sibship size on educational achievement would have to be at least as large as the amount of confounding generated by omitting parental education from the model.

This is indeed the strategy used in a few studies. But what if U should be taken to stand not for a single but for a whole collection of unobserved confounders? Clearly, it then is no longer credible to compare the critical association of U with the amount of confounding created by a single known covariate. Better to compare it to a larger set of observed confounders. But with larger sets of included variables, we have the problem of interactions between them, and of surpressing and amplifying relationships. In short, gauging the critical association of U with T and Y in substantive terms will become a whole lot less intuitive.

(FYI, Robins and his colleagues in epi have proposed an alternative method of sensitivity analysis, which hasn’t found followers in the social sciences yet, to my knowledge. I’m currently working on implementing their method in one of my projects.)

Posted by Felix Elwert at May 3, 2006 6:03 AM

Comments

check the paper by Altonji et al in the JPE that takes an interesting approach to this problem

Posted by: s at May 3, 2006 12:23 PM

Yes, very insightful to recognize that U really needs to be thought of as the linear composite of multiple potential confounders rather than a single hypothetical confounder. The substantive implication for this if a model does not explain a large portion of variance to begin with is that the subjective assessment of the possible effect sum of multiple potential confounders can be *much* larger than a single confounder, thereby invalidating judgments from sensitivity analyses that are based on speculation about a single hypothetical confounder.

Posted by: Jeremy at May 3, 2006 4:41 PM

> But what if U should be taken to stand not for
> a single but for a whole collection of
> unobserved confounders?

You can actually think of U as capturing a whole array of confounders and not only a single confounder. In fact, on can show that within the Rosenbaum sensitivity framework, the non-parametric Manksi bounds are an extreme case of hidden bias arising from U.

Jens

Posted by: Jens at May 4, 2006 9:33 AM

You are right, Jens, but that's not what I'm driving at. Rather, the problem I mentioned concerns gauging the critical associations of U in _substantive_ terms. In practice, most every inference falls apart if we are willing to allow for the existence of a sufficiently large unobserved U (hardly any no-assumption bound doesn't include zero). The question is: how much confounding is a lot of confounding? In other words, when does the size of a critical U put us at ease and when should it ring alarms? Rosenbaum and Rubin have suggested an interpretation that''s clearly very helpful, but it's also clearly not going to solve the issue for all applications.

Posted by: Felix Elwert at May 4, 2006 12:27 PM

Hi Felix,

yes this remains a problem I agree. But I think it will necessarily boil down to a judgment call; the magnitude of the hidden bias that turns over the findings has to be evaluated for each particular application. In particular it will matter on how many observed confounders one has conditioned already.

For example, consider the seminal Cornfield et al (1959) analysis that first assessed sensitivity to hidden bias necessary to invalidate the relationship between smoking and lung cancer. They found that hidden bias alone would have to be a nearly perfectly correlated with lung cancer and about 10 times more prevalent in smokers than non-smokers. That such a variable exists is highly unlikely (in particular from a genetic perspective). The burden of proof is then on the critics to find this magic omitted variable.

In other cases that may not be so clear of course. But I would not trust any findings that break down at low gamma values already (like 1.5 or 2) unless one has conditioned on many many confounders.

Best,
Jens

Posted by: Jens at May 4, 2006 2:38 PM