Why do Minimal Bayes Nets often correspond to Causal Models of Reality?

post by Dalcy (Darcy) · 2024-08-03T12:39:44.085Z · LW · GW · 1 comment

This is a question post.

Contents

1 comment

Chapter 2 of Pearl's Causality book claims you can recover causal models given only the observational data, under very natural assumptions of minimality and stability[1].

In graphical models lingo, Pearl identifies a causal model of the observational distribution with the distribution's perfect map (if they exist).

But I'm confused about a pretty fundamental point: "What does this have to do at all with causality??" More precisely:

To be clear, Pearl acknowledges this in his Temporal Bias Conjecture (2.8.2):

"In most natural phenomenon, the physical time coincides with at least one statistical time."

And Pearl conjectures that the reason for this is possibly because human language is optimized such that our [choice of variables / factorization of reality] makes the Temporal Bias true.

I ... guess that could be an explanation? But honestly I don't think I understand his point very well and I find it pretty unsatisfying. I would appreciate any explanation as to why it makes sense to identify perfect maps with Causal Models.

  1. ^

    Minimality: Choose the network structure that is minimally expressive among those that can express the observational distribution.

    • This is pretty reasonable imo, occam's razor blah blah

    Stability: Assume that there exists a network structure that perfectly captures all and only the independencies implied by the observational distribution. i.e. independencies are structural.

    • Stability is a reasonable assumption since it would be pretty unlikely for the conditional probability distributions to be fine-tuned as to cancel each other out and induce an independency not present in the network.

Answers

1 comment

Comments sorted by top scores.

comment by cubefox · 2024-08-04T00:21:59.288Z · LW(p) · GW(p)

Your question may be only aimed at people who have studied the relevant part of the book, but to me it is very unclear what you mean here with "recover" and "express" in "recover causal models given only the training data" or "minimally expressive among those that can express the observational distribution".