What does lack of evidence of a causal relationship tell you?

post by James_Miller · 2011-06-08T19:03:45.283Z · LW · GW · Legacy · 10 comments

Imagine that you know there is a strong correlation between X and Y.  Statistically competent scholars have extensively examined the causal relationship between X and Y and have failed to find a significant causal relationship and have failed to rule out the possibility that there is a significant causal relationship.  

Would it be reasonable for you to claim that the causal relationship between X and Y probably isn't too strong or it would have shown up clearly on statistical analysis?  At the very least, should learning of the negative results of the scholars cause you to decrease your estimate of the causal relationship between X and Y?

10 comments

Comments sorted by top scores.

comment by falenas108 · 2011-06-08T20:02:28.445Z · LW(p) · GW(p)

Yes, see this for details on why this is so.

But in probability theory, absence of evidence is always evidence of absence. If E is a binary event and P(H|E) > P(H), "seeing E increases the probability of H"; then P(H|~E) < P(H), "failure to observe E decreases the probability of H". P(H) is a weighted mix of P(H|E) and P(H|~E), and necessarily lies between the two.

Replies from: James_Miller
comment by James_Miller · 2011-06-08T20:13:29.265Z · LW(p) · GW(p)

Thanks! The second to last paragraph from your EY citation was exactly what I was looking for.

comment by GuySrinivasan · 2011-06-08T19:33:57.834Z · LW(p) · GW(p)

The causality looks like something. An obscure common cause is the most obvious (to me) source of the correlation if no one's put forth a plausible causal relationship yet. I'm not sure what you mean, though, by "causal relationship between X and Y"... do you mean specifically a relationship of the form "X -> ..Z.. -> Y" / "X <- ..Z.. <- Y", or do you mean "any causal structure connecting X and Y in any way"?

(Are you interested in some specific X,Y but phrasing it generally so we don't get distracted? I feel like seeing some examples of the failed tests run by statistically competent scholars would help me know what they haven't ruled out)

Replies from: James_Miller
comment by James_Miller · 2011-06-08T19:45:00.302Z · LW(p) · GW(p)

I'm mostly interested in whether X causes Y vs. whether some Z causes both X and Y.

Replies from: Richard_Kennaway, Cyan
comment by Richard_Kennaway · 2011-06-09T10:02:55.333Z · LW(p) · GW(p)

I didn't find that clear from your article. A correlation between X and Y tells you no more than that causality is present somewhere. It tells you absolutely nothing about whether X causes Y, Y causes X, Z causes X and Y, how long the causal chains are, or whether it's a sampling artefact due to common effects of X and Y.

comment by Cyan · 2011-06-08T20:46:01.644Z · LW(p) · GW(p)

Those options aren't mutually exclusive...

Replies from: Richard_Kennaway
comment by Richard_Kennaway · 2011-06-09T09:57:41.402Z · LW(p) · GW(p)

Or exhaustive. Imperfect sampling can produce sample correlations among variables with no causal connection. (Toy example: X and Y are independent, Z is jointly caused by X and Y and is equal to X+Y, and everyone is unwittingly sampling from a subpopulation with a narrow range of values of Z. Sample X and Y will have a high negative correlation.)

Replies from: Douglas_Knight
comment by Douglas_Knight · 2011-06-10T04:07:50.918Z · LW(p) · GW(p)

Could you give a concrete example of such sampling bias?

Replies from: Richard_Kennaway
comment by Richard_Kennaway · 2011-06-10T07:01:30.408Z · LW(p) · GW(p)

A real one? Not off hand, not being a statistician, but sampling bias is a standard problem that has to be guarded against in statistical investigations. It can affect not just the sample means of variables, but correlations and indeed every statistic whatsoever.

To flesh out the toy example with an imaginary narrative, suppose X = intelligence, Y = effort, and Z = exam grade. Suppose Z is highly correlated with X+Y. If we divide the population up by exam grade, we may find that in every subpopulation, X and Y are negatively correlated, even while in the whole population, X and Y are uncorrelated.

comment by Armok_GoB · 2011-06-09T11:45:42.822Z · LW(p) · GW(p)

If there is no correlation between how many apples I drop from a height and how many hits the bottom, then the most likely hypothesis is that either there is some kind of barrier and the dropped apples I'm noticing come from an unrelated source, or that some process is reacting to me dropping X apples and dropping k-x apples.