Using Finite Factored Sets for Causal Representation Learning?

post by David Reber (derber) · 2023-01-11T22:06:43.831Z · LW · GW · No comments

This is a question post.

Contents

  Answers
    3 Matthias G. Mayer
None
No comments

One of the primary motivations of Finite Factored Sets [LW · GW](shorthand: FFS) that initially caught my eye was, to quote Scott:

"Given a collection of variables and a joint probability distribution over those variables, Pearl can infer causal/temporal relationships between the variables.", the words "Given a collection of variables" are actually hiding a lot of the work.

And this becomes apparent with the toy model Magdalena analyzes at the end of her distillation post [LW · GW]: taking variables as primitives (as in the Pearl framework) means you have to make arguments about 'whether deterministic collapse occurs', rather than  variables arising naturally as in finite factored sets.

So:

EDIT: It may help to know that my motivation is "Can we apply a FFS algorithm for causal representation learning to learn objects (and physics) from a video? Or (more directly for alignment), to identify latent concepts embedded in a LLM?"

Answers

answer by Matthias G. Mayer · 2023-01-11T23:54:38.848Z · LW(p) · GW(p)

I'm working on the FFS framework in general. I'm currently writing up decidability of finite temporal inference. After this I will probably start working on efficient finite temporal inference which is what you're referencing if I understood correctly.

I'm also working on extending the framework to the infinite setting and am almost finished except for conditional orthogonality for uncountable sets.

I quite like the name Logical Time Theory, under which I will probably publish those results in a month or so.

comment by David Reber (derber) · 2023-01-12T20:03:02.818Z · LW(p) · GW(p)

I'm also working on extending the framework to the infinite setting and am almost finished except for conditional orthogonality for uncountable sets.

 

Hmm, what would be the intuition/application behind the uncountable setting? Like, when would one want that (I don't mind if it's niche, I'm just struggling to come up with anything)?

Replies from: matthias-georg-mayer
comment by Matthias G. Mayer (matthias-georg-mayer) · 2023-01-17T18:14:14.391Z · LW(p) · GW(p)

A direct application would need that you have an uncountable variable. You might want to do this if you have enough evidence to say this confidently. As a simple example imagine a real-valued graph where all your data points lie almost on the identity diagonal. You might then want to infer a variable which is the identity.

As a more general application, we want to model infinities because the world is probably infinite in some aspects. We then want a theorem that tells us, that even if the underlying model is infinite, if you have enough data points then you are close enough, like with the Strong law of Large numbers, for example.

No comments

Comments sorted by top scores.