$500 Bounty Problem: Are (Approximately) Deterministic Natural Latents All You Need?

johnswentworth

$500 Bounty Problem: Are (Approximately) Deterministic Natural Latents All You Need?

post by johnswentworth, David Lorell · 2025-04-21T20:19:30.808Z · LW · GW · 0 comments

  Some Intuition From The Exact Case
    Approximation Adds Qualitatively New Behavior
  The Problem
    "Stochastic" Natural Latents
    "Deterministic" Natural Latents
      See the linked post for explanation of a variable appearing multiple times in a diagram, and how the approximation conditions for those diagrams simplify to entropy bounds.
    What We Want For The Bounty
    Why We Want This
None
No comments

Our posts on natural latents have involved two distinct definitions, which we call "stochastic" and "deterministic" natural latents. We conjecture that, whenever there exists a stochastic natural latent (to within some approximation), there also exists a deterministic natural latent (to within a comparable approximation). We are offering a $500 bounty to prove this conjecture.

Some Intuition From The Exact Case

In the exact case, in order for a natural latent to exist over random variables , the distribution has to look roughly like this:

Each value of $X_{1}$ and each value of $X_{2}$ occurs in only one "block", and within the "blocks", $X_{1}$ and $X_{2}$ are independent. In that case, we can take the (exact) natural latent to be a block label.

Notably, that block label is a deterministic function of X.

However, we can also construct other natural latents for this system: we simply append some independent random noise to the block label. That natural latent is not a deterministic function of X; it's a "stochastic" natural latent.

In the exact case, if a stochastic natural latent exists, then the distribution must have the form pictured above, and therefore the block label is a deterministic natural latent. In other words: in the exact case, if a stochastic natural latent exists, then a deterministic natural latent also exists. The goal of the $500 bounty is to prove that this still holds in the approximate case.

Approximation Adds Qualitatively New Behavior

If you want to tackle the bounty problem, you should probably consider a distribution like this:

Distributions like this can have approximate natural latents, while being qualitatively different from the exact picture. A concrete example is the biased die: $X_{1}$ and $X_{2}$ are each 500 rolls of a biased die of unknown bias (with some reasonable prior on the bias). The bias itself will typically be an approximate stochastic natural latent, but the lower-order bits of the bias are not approximately deterministic given X (i.e. they have high entropy given X).

The Problem

"Stochastic" Natural Latents

Stochastic natural latents were introduced in the original Natural Latents post [LW · GW]. Any latent $Λ$ over random variables $X_{1}, X_{2}$ is defined to be a stochastic natural latent when it satisfies these diagrams:

... and $Λ$ is an approximate stochastic natural latent (with error $ϵ$ ) when it satisfies the approximate versions of those diagrams to within $ϵ$ , i.e.

$ϵ \geq D_{K L} (P [X, Λ] | | P [Λ] P [X_{1} | Λ] P [X_{2} | Λ])$

$ϵ \geq D_{K L} (P [X, Λ] | | P [X_{2}] P [X_{1} | X_{2}] P [Λ | X_{1}])$

$ϵ \geq D_{K L} (P [X, Λ] | | P [X_{1}] P [X_{2} | X_{1}] P [Λ | X_{2}])$

Key thing to note: if $Λ$ satisfies these conditions, then we can create another stochastic natural latent $Λ^{'}$ by simply appending some random noise to $Λ$ , independent of $X$ . This shows that $Λ$ can, in general, contain arbitrary amounts of irrelevant noise while still satisfying the stochastic natural latent conditions.

"Deterministic" Natural Latents

Deterministic natural latents were introduced in a post by the same name [LW · GW]. Any latent $Λ$ over random variables $X_{1}, X_{2}$ is defined to be a deterministic natural latent when it satisfies these diagrams:

... and $Λ$ is an approximate deterministic natural latent (with error $ϵ$ ) when it satisfies the approximate versions of those diagrams to within $ϵ$ , i.e.

$ϵ \geq D_{K L} (P [X, Λ] | | P [Λ] P [X_{1} | Λ] P [X_{2} | Λ])$

$ϵ \geq H (Λ | X_{1})$

$ϵ \geq H (Λ | X_{2})$

See the linked post for explanation of a variable appearing multiple times in a diagram, and how the approximation conditions for those diagrams simplify to entropy bounds.

Note that the deterministic natural latent conditions, either with or without approximation, imply the stochastic natural latent conditions; a deterministic natural latent is also a stochastic natural latent.

Also note that one can instead define an approximate deterministic natural latent via just one diagram, and this is also a fine starting point for purposes of this bounty:

What We Want For The Bounty

We'd like a proof that, if a stochastic natural latent exists over two variables $X_{1}, X_{2}$ to within approximation $ϵ$ , then a deterministic natural latent exists over those two variables to within approximation roughly $ϵ$ . When we say "roughly", e.g. $2 ϵ$ or $3 ϵ$ would be fine; it may be a judgement call on our part if the bound is much larger than that.

We're probably not interested in bounds which don't scale to zero as $ϵ$ goes to zero, though we could maybe make an exception if e.g. there's some way of amortizing costs across many systems such that costs go to zero-per-system in aggregate (though we don't expect the problem to require those sorts of tricks).

Bounds should be global, i.e. apply even when $ϵ$ is large. We're not interested in e.g. first or second order approximations for small $ϵ$ unless they provably apply globally.

We might also award some fraction of the bounty for a counterexample. That would be much more of a judgement call, depending on how thoroughly the counterexample kills hope of any conjecture vaguely along these lines.

In terms of rigor and allowable assumptions, roughly the level of rigor and assumptions in the posts linked above is fine.

Why We Want This

Deterministic natural latents are a lot cleaner both conceptually and mathematically than stochastic natural latents. Alas, they're less general... unless this conjecture turns out to be true, in which case they're not less general. That sure would be nice.

0 comments

Comments sorted by top scores.

$500 Bounty Problem: Are (Approximately) Deterministic Natural Latents All You Need?

Contents

Some Intuition From The Exact Case

Approximation Adds Qualitatively New Behavior

The Problem

"Stochastic" Natural Latents

"Deterministic" Natural Latents

What We Want For The Bounty

Why We Want This

0 comments