Non-Monotonic Infra-Bayesian Physicalism

marcus-ogren

Non-Monotonic Infra-Bayesian Physicalism

post by Marcus Ogren · 2025-04-02T12:14:19.783Z · LW · GW · 0 comments

  Why Physicalism?
  Γ and 2Γ: Formalizing Instantiation
  Definitions
  Bridge Transforms and Monotonicity
  Properties of computationalist hypotheses
  Further reading and future research
None
No comments

Infra-Bayesian physicalism (IBP) is a mathematical formalization of computationalist metaphysics: the view that whether something exists is a matter of whether a particular computation is instantiated anywhere in the universe in any way. In this post, we cover the basics of IBP, both rigorously and informally, its relevance to agent foundations and the alignment problem, and present new definitions that remove the biggest limitation in the previous formulation of IBP: the monotonicity requirement. Having read the original post on IBP [LW · GW] is not required to understand this post, though prior familiarity with infra-Bayesianism is useful for understanding the technical parts.

Why Physicalism?

A physicalist agent - that is, an agent that uses IBP - does not regard itself as inherently special. This is in contrast to a Cartesian agent. A precise definition of an IBP agent is highly technical and can be found in the original IBP post, but here are some ways in which physicalist agents and Cartesian agents differ:

To a Cartesian agent, actions and observations are fundamental. To a physicalist agent, computations are fundamental. A physicalist agent's rewards are determined by which computations are instantiated in the universe and to what extent; we will give a rigorous formalism for this later in the post.
Cartesian agents have a sharp Cartesian boundary between the agent and the environment. Physicalist agents do not.
A Cartesian agent is a fundamental part of its ontology. It models the world purely in terms of its actions and observations, so it naturally treats its actions observations as "more fundamental" than the laws of physics. A physicalist agent naturally views all of the universe on equal footing.

IBP naturally solves many problems in agent foundations:

The problem of privilege: A Cartesian agent using the Solomonoff prior will give unreasonably high credence to beliefs such as, "If every single pixel recorded by my camera is the exact same color, this will immediately cause the fine-structure constant to double". This is because, in relation to the agent's input channel, the Kolmogorov complexity of an otherwise-simple hypothesis that depends on a pixel's color is very low. Such issues are also relevant when a Cartesian agent is considering simulation hypotheses. By contrast, a physicalist agent will give low credence to such hypotheses since a description of these pixels based on the fundamental laws of physics is extremely complicated; solving the problem of privilege was the main motivation for developing IBP.^[1]
Unobserved rewards: With a Cartesian agent, it is most natural to have its rewards be determined by its observations, but we want an AI to care about people regardless of the extent to which they're being observed. Substantial progress [LW · GW] has been made on solving this problem for Cartesian agents, but the IBP formalism solves the problem of unobserved rewards automatically since a physicalist agent is not observation-centric.
Defining what it means for two agents to be aligned: Even if a Cartesian agent cares about what it isn't observing, its reward function is ultimately defined in terms of its own actions and observations. This means that the rewards of different Cartesian agents are defined over different spaces, which poses a serious complication in trying to provide a rigorous definition of what it means for two agents to have the same rewards. Having such a definition isn't just important for solving the alignment problem; it's crucial for merely formalizing the alignment problem. By contrast, all physicalist agents have the same domain for their reward functions, so the definition of two physicalist agents having the same reward function is trivial.
Embedded agency [LW · GW]: Since physicalist agents lack a Cartesian boundary, the problem of embedded agency is handled by default and learned the same way as the agent's environment; the robot/computer/meatbag in which the agent is embodied is just another part of the universe.^[2]
Decision theory: IBP agents use functional decision theory. A formalism that used causal decision theory instead would be considerably more complicated as it would require describing what "time" or a "cause" is.
Anthropic problems: The formalism of a Cartesian agent presupposes its own existence, so it is unclear how Cartesian agents could use the fact that some Hypothesis A predicts the agent's existence more strongly than Hypothesis B as evidence in favor of Hypothesis A. The physicalist framework has no presumption of self-existence. A physicalist agent can recognize that its source code has more influence under hypotheses in which it's more likely to be instantiated, and will optimize its actions mainly for hypotheses in which it has influence.
Mindcrime [? · GW]: Physicalist agents only care about how a computation/person is instantiated insofar as it affects which other computations are instantiated; whether they are instantiated in a human's brain, a supercomputer under Siberia, or the mind of the physicalist agent itself is immaterial. Thus, IBP handles the problem of mindcrime by default.^[3]

While it's important to solve these problems for their own sake, perhaps a bigger point is this: Based on priors, it seems unlikely that an arbitrary solution to the problem of privilege would also address all these other problems incidentally. The fact that IBP does address these problems^[4] is evidence that IBP is getting at some "deeper truth" in agent foundations, and (more speculatively) that agentic behaviors arising in AI systems (LLMs or otherwise) may eventually converge to behaving similarly to physicalist agents. Moreover, IBP provides the necessary framework for Physicalist Superimitation [LW · GW], a proposed solution to the alignment problem.

and $2^{Γ}$ : Formalizing Instantiation

A physicalist agent cares about which computations are instantiated in the universe. This begs the question: What, exactly, does it mean for a computation to be instantiated? In this section, we provide a rigorous formalism that also allows for possibilities beyond the binary of "instantiated" and "not instantiated".

We denote the space of "computational universes" by $Γ$ . We think of an element of $Γ$ as specifying the result of every computation.^[5] For example:

What's $7 \times 9$ ?
Is 13872901381 a prime number?
In Conway's Game of Life, if you start from <12-gigabyte description of a starting position>, what color is the square at (0, 0) after 1,000,000 timesteps?
How would <gigantic mess of code that precisely describes all of your neurons and their interactions> respond to <precise mathematical description of the sensory inputs from being passionately kissed by a tall, strong man in the middle of a thunderstorm>?
How would the version of ChatGPT from 12:00 on January 1, 2025 respond to the query, "were do birds go when it rains???" if the random bits used by the algorithm are just a string of 0s?

However, we don't need to worry about all computational facts, so we take $Γ$ to be a finite set.^[6] Mathematically, that's all there is to it: $Γ$ is a finite set whose elements encode conceivable results for all the computations we care about. (For most elements of $Γ$ , some of these encoded results will be incorrect.) In toy examples (such as the ones in the post on IBP and quantum mechanics [LW · GW]), $Γ$ may consist solely of how the source code of the physicalist agent in question responds to various inputs.

Now we consider the set $2^{Γ}$ : the set of all possible subsets of $Γ$ . Basically, we view an element $α \in 2^{Γ}$ as the set of all computational universes that are consistent with a particular history of the physical universe; we'll explain this connection more rigorously when we define bridge transforms. Here we discuss the interpretation of a given element of $2^{Γ}$ .

Suppose that for all $γ \in α$ , $γ$ says that $7 \times 9 = 63$ . This means that only a single result of the computation "What's $7 \times 9$ ?" is consistent with $α$ , so we say that this computation is instantiated according to $α$ . If some $γ \in α$ said that $7 \times 9 = 16$ and another $γ^{'} \in α$ said that $7 \times 9 = 42$ , it would mean that nothing in the physical universe could depend on which of these (if either) was. It would require that nobody put " $7 \times 9$ " into a functional calculator, that nobody calculated $7 \times 9$ in their head, that 63 items were never arranged in a 7 by 9 grid, etc. Similarly, if every $γ$ in $α$ gives exactly the same precise mathematical answer to "How would <gigantic mess of code that precisely describes all of your neurons and their interactions> respond to <precise mathematical description of the sensory inputs from being passionately kissed by a tall, strong man in the middle of a thunderstorm>?", it means that the computation of your brain experiencing a passionate kiss during a thunderstorm has been instantiated according to $α$ . Or, in layman's terms, $α$ says you've been kissed by a tall, strong man in the middle of a thunderstorm.^[7]

You can still have imperfect information about the result of a computation without it being instantiated. If you're a straight man who dislikes getting wet, you can conclude that being kissed by a man during a thunderstorm would be unpleasant without the full computation being run. Only knowing the mathematically precise manner in which it's unpleasant requires the computation being fully instantiated. This is how physicalist agents naturally avoid mindcrime; if a physicalist agent wants to know some facts about a computation but having that computation be instantiated could incur a significant loss, the agent may use some heuristics to narrow down the possible results of the computation, but would avoid conducting a detailed simulation that would tell it everything.

For later definitions, we'll need the set ${e l}^{Γ} = {(γ, α) \in (Γ \times 2^{Γ}) | γ \in α}$ . This is the set of $(γ, α)$ pairs such that $γ \in Γ$ is consistent with $α$ , where $α$ can be anything in $2^{Γ}$ . ${e l}^{Γ}$ is the domain of a physicalist agent's loss function.

Definitions

We start with some definitions from infra-Bayesianism.

Definition 1: A contribution $θ$ on a finite^[8] set $X$ is a function from $X$ to the interval $[0, 1]$ with $\sum_{x \in X} θ (x) \leq 1$ . The space of all contributions on $X$ is denoted $Δ^{c} X$ .

Basically, a contribution is a probability distribution, except the "probabilities" may sum to something less than one.

There is a natural order on $Δ^{c} X$ : Given $θ_{1}, θ_{2} \in Δ^{c} X$ , $θ_{1} \leq θ_{2}$ iff $\forall x \in X : θ_{1} (x) \leq θ_{2} (x)$ .

Definition 2: An ultracontribution^[9] $Θ$ on $X$ is a set of contributions on $X$ that is:

Closed (when viewed as a subset of $R^{X}$ )
Convex (If $θ_{1}, θ_{2} \in Θ$ and $p \in [0, 1]$ , $p θ_{1} + (1 - p) θ_{2} \in Θ$ )
Downward closed (If $θ_{1} \in Θ$ and $θ_{2} \leq θ_{1}$ , then $θ_{2} \in Θ$ )
Nonempty (Due to downward closure, this equivalently means that $Θ$ must contain the contribution $θ (x) = 0$ )

The space of all ultracontributions on $X$ is denoted $□^{c} X$ .

None of the conditions for a set of contributions to be an ultracontribution is of immense conceptual importance; thinking of an ultracontribution as an arbitrary set of contributions is sufficient for developing an intuition.

For thinking about ultracontributions, it's helpful to compare Bayesian agents to infra-Bayesian agents. A Bayesian agent's worldview consists of a single probability distribution, and the Bayesian agent will take whatever actions will minimize the expected value of its loss function for this probability distribution.^[10] An infra-Bayesian agent's worldview consists of a single ultracontribution - a set of contributions (which is like a set of probability distributions). An infra-Bayesian agent will take whatever actions will minimize the worst expected loss across all of these contributions. The infra-Bayesian agent regards every contribution in its ultracontribution as a possibility. There is no notion of one contribution being "more likely" than another; every contribution in the ultracontribution is on even footing.

Other concepts we will need:

Given an ultracontribution $Θ$ on $X$ , $s u p p Θ$ is the set of all $x \in X$ such that $\exists θ \in Θ : θ (x) > 0$ .
Given a function $s : X \to Y$ and a contribution $θ \in Δ^{c} X$ , the pushforward of $θ$ by $s$ is a contribution $s_{*} θ \in Δ^{c} Y$ given by $s_{*} θ (y) = \sum_{x \in s^{- 1} (y)} θ (x)$ . Thinking of $θ$ as a probability distribution, $s_{*}$ takes all the probability mass that was on $x$ and sends it to $y = s (x)$ , letting it pile up if multiple points in $X$ get mapped to $y$ . The pushforward of an ultracontribution $Θ \in □^{c} X$ by $s$ is $s_{*} Θ \in □^{c} Y$ , where $s_{*} Θ$ is what you get from sending every $θ \in Θ$ to $s_{*} θ$ . Formally, $s_{*} Θ = {s_{*} θ | θ \in Θ}$ . More abstractly, $s_{*}$ represents both a function $Δ^{c} X \to Δ^{c} Y$ and a function $□^{c} X \to □^{c} Y$ .^[11]
Given a subset $X^{'} \subseteq X$ , the characteristic function $χ_{X^{'}}$ is an operator $Δ^{c} X \to Δ^{c} X$ defined by

χ_{X^{'}} θ (x) = {\begin{matrix} θ (x) & if x \in X^{'} 0 & otherwise \end{matrix}

For $Θ \in □^{c} X$ , $χ_{X^{'}} Θ = {χ_{X^{'}} θ | θ \in Θ}$ .
Fixing $y \in X$ , we define the contribution $δ_{y}$ by $δ_{y} (x) = {\begin{matrix} 1 & if x = y 0 & otherwise \end{matrix}$

Recalling our previous discussion of $Γ$ and $2^{Γ}$ , we now have the tools for defining a computationalist hypothesis.

Definition 3: A computationalist hypothesis is a $Θ \in □^{c} (Γ \times 2^{Γ})$ such that

$s u p p Θ \subseteq {e l}^{Γ}$
For all mappings $s : Γ \to Γ$ , $χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} Θ \subseteq Θ$

The first condition means that computational hypotheses are internally consistent. If $Γ$ is the 3-element set ${1, 2, 3}$ , $Θ$ cannot contain a contribution that gives nonzero probability to an element like $(3, {1, 2})$ that says, "Only computational universes 1 and 2 are consistent with the physical universe; also, computational universe 3 is the one that answers mathematical questions correctly."

The second condition means a computationalist hypothesis cannot assert with high confidence that a computation will have a given result without that computation being instantiated. More concretely, if $Θ$ contains $δ_{(1, {1, 2, 3})}$ , condition (2) means that $Θ$ must also contain the contributions $δ_{(2, {1, 2, 3})}$ and $δ_{(3, {1, 2, 3})}$ . (These contributions can be obtained by letting $s (γ) = 2$ or $s (γ) = 3$ .) It is possible for $Θ$ to contain the contribution $δ_{(1, {1})}$ without containing a contribution that gives nonzero probability to other possibilities. The bottom line: If $Θ$ claims that a computational universe is consistent with the physical universe, it must incorporate the possibility that this computational universe is correct. $Θ$ cannot know that $7 \times 9 = 63$ without $7 \times 9$ getting computed somewhere.

Bridge Transforms and Monotonicity

Given a description of the physical universe, how can we determine which programs are instantiated, i.e., select elements of $2^{Γ}$ ? To answer this question we use bridge transforms: functions that take joint beliefs over computational and physical universes as input and produce beliefs about which computations are instantiated.

We will use $Φ$ to denote the set of all physical universes we're considering. As with $Γ$ , $Φ$ has no mathematical structure beyond being a finite^[12] set. Here are some examples of sets that could be used for $Φ$ :

The set of all functions ${1, 2, \dots T} \times {1, 2, \dots N} \times {1, 2, \dots N} \to {0, 1}$ . This can be interpreted as the set of all conceivable histories of a two-state cellular automaton that is run for $T$ timesteps on an $N \times N$ toroidal grid.
The set of all histories of the Iterated Prisoner's Dilemma of length $T$ .
$2^{Γ}$ . This possibility is important conceptually since it allows us to define a notion of idempotence for bridge transforms.
The union of the above sets.
Any finite subset of the space of all conceivable physical universes. An element of $Φ$ would be a fully-detailed, timeless description of everything that exists according to some theory.

However, the details of $Φ$ are of minimal importance to understanding IBP. We don't care about $Φ$ in its own right; it's just something we'll use to produce computationalist hypotheses. We formalize joint beliefs over computational and physical universes as elements of $□^{c} (Γ \times Φ)$ . Note that entanglements of beliefs about computations and physics are often quite natural. With $Φ$ as a set of possible histories of a cellular automaton, different elements of $Φ$ naturally correspond to computational universes that give different answers to "How will this cellular automaton evolve according to <rules>?". Straightforward entanglements also arise from considering agents; physical universes in which an agent chooses "cooperate" in the Prisoner's Dilemma will correspond to computational universes in which the agent's source code yields "cooperate" when given the appropriate input.

We now define the liberal bridge transform. The liberal bridge transform is a mapping $B r : □^{c} (Γ \times Φ) \to □^{c} (Γ \times 2^{Γ} \times Φ)$ . The resulting ultracontribution contains all contributions that are "consistent" with the ultracontribution over $Γ \times Φ$ in a sense that is closely related to the conditions of a computationalist hypothesis.

Definition 4: Let $Γ$ and $Φ$ be finite sets and let $Ξ \in □^{c} (Γ \times Φ)$ . The liberal bridge transform of $Ξ$ is $B r (Ξ) \in □^{c} (Γ \times 2^{Γ} \times Φ)$ , where $ω$ is an element of $B r (Ξ)$ if and only if:

$s u p p ω \subseteq {e l}^{Γ} \times Φ$
For all mappings $s : Γ \to Γ$ , ${p r}_{Γ \times Φ} χ_{{e l}^{Γ} \times Φ} (s \times {i d}_{2^{Γ} \times Φ})_{*} ω \in Ξ$

Here, ${p r}_{Γ \times Φ}$ is the projection operator; ${p r}_{Γ \times Φ} (γ, α, ϕ) = (γ, ϕ)$ . These conditions closely mirror those for computationalist hypothesis. In fact, the liberal bridge transform provides an alternative characterization of computationalist hypotheses:

Proposition 1: $Θ \in □^{c} (Γ \times 2^{Γ})$ is a computationalist hypothesis if and only if:

$s u p p Θ \subseteq {e l}^{Γ}$
$({i d}_{Γ} \times {d i a g}_{2^{Γ}})_{*} Θ \subseteq B r (Θ)$

For using the bridge transform on $Θ \in □^{c} (Γ \times 2^{Γ})$ we take $Φ = 2^{Γ}$ . $d i a g$ is the diagonal operator that sends $x$ to $(x, x)$ . In condition (2), $({i d}_{Γ} \times {d i a g}_{2^{Γ}})$ simply takes $(γ, α) \in Γ \times 2^{Γ}$ to $(γ, α, α)$ .

For the proof of Proposition 1 we'll be working with two copies of $2^{Γ}$ in the bridge transform; for clarity, we'll denote the one that's treated as the physical universe as $2_{p h y s}^{Γ}$ , the one produced by the bridge transform as $2_{m a t h}^{Γ}$ , and write ${e l}_{m a t h}^{Γ}$ and ${e l}_{p h y s}^{Γ}$ for the subsets ${e l}^{Γ} \subseteq Γ \times 2_{m a t h}^{Γ}$ and ${e l}^{Γ} \subseteq Γ \times 2_{p h y s}^{Γ}$ . First, however, we need a basic lemma:

Lemma 1: For $θ \in Δ^{c} (Γ \times 2_{p h y s}^{Γ})$ ,

χ_{{e l}_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}} (s \times {i d}_{2_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}})_{*} ({i d}_{Γ} \times {d i a g}_{2^{Γ}})_{*} θ

= ({i d}_{Γ} \times {d i a g}_{2^{Γ}})_{*} χ_{{e l}_{p h y s}^{Γ}} (s \times {i d}_{2_{p h y s}^{Γ}})_{*} θ

Proof: Consider a $δ_{(γ, α)} \in Δ^{c} (Γ \times 2_{p h y s}^{Γ})$ . The LHS sends this element

δ_{(γ, α)} \to δ_{(γ, α, α)} \to δ_{(s (γ), α, α)} \to {\begin{matrix} δ_{(s (γ), α, α)} & if s (γ) \in α 0 & if s (γ) \notin α \end{matrix}

The RHS sends this element

δ_{(γ, α)} \to δ_{(s (γ), α)} \to {\begin{matrix} δ_{(s (γ), α)} & if s (γ) \in α 0 & if s (γ) \notin α \end{matrix} \to {\begin{matrix} δ_{(s (γ), α, α)} & if s (γ) \in α 0 & if s (γ) \notin α \end{matrix}

proving Lemma 1 in the case that $θ = δ_{(γ, α)}$ . Any contribution can be written as a linear combination of $δ$ -contributions and all of the mappings in Lemma 1 are linear, so Lemma 1 holds for all $θ \in Δ^{c} (Γ \times 2_{p h y s}^{Γ})$ . $■$

Proof of Proposition 1: Condition (1) is the same in Proposition 1 and Definition 3, and both of them also ensure that $Θ$ is an ultracontribution. Therefore, we only need to worry about condition (2).

First, let $Θ$ be a computationalist hypothesis. Showing that $({i d}_{Γ} \times {d i a g}_{2^{Γ}})_{*} Θ \subseteq B r (Θ)$ is equivalent to showing that, for all $s : Γ \to Γ$ and all $ω \in ({i d}_{Γ} \times {d i a g}_{2^{Γ}})_{*} Θ$ , we have ${p r}_{Γ \times 2_{p h y s}^{Γ}} χ_{{e l}_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}} (s \times {i d}_{2_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}})_{*} ω \in Θ$ . Writing $ω = ({i d}_{Γ} \times {d i a g}_{2^{Γ}})_{*} θ$ for some $θ \in Θ$ , we have

{p r}_{Γ \times 2_{p h y s}^{Γ}} χ_{{e l}_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}} (s \times {i d}_{2_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}})_{*} ({i d}_{Γ} \times {d i a g}_{2^{Γ}})_{*} θ

= {p r}_{Γ \times 2_{p h y s}^{Γ}} ({i d}_{Γ} \times {d i a g}_{2^{Γ}})_{*} χ_{{e l}_{p h y s}^{Γ}} (s \times {i d}_{2_{p h y s}^{Γ}})_{*} θ

= χ_{{e l}_{p h y s}^{Γ}} (s \times {i d}_{2_{p h y s}^{Γ}})_{*} θ \in Θ

The first equality is from Lemma 1. The inclusion on the final line follows from the definition of a computationalist hypothesis. To show that the conditions of Proposition 1 imply that $Θ$ is a computationalist hypothesis, we know that

{p r}_{Γ \times 2_{p h y s}^{Γ}} χ_{{e l}_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}} (s \times {i d}_{2_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}})_{*} ({i d}_{Γ} \times {d i a g}_{2^{Γ}})_{*} θ \in Θ

from condition (2) of Proposition 1 and the definition of the bridge transform. The above equalities show that this equals $χ_{{e l}_{p h y s}^{Γ}} (s \times {i d}_{2_{p h y s}^{Γ}})_{*} θ$ , so condition (2) of Definition 3 is fulfilled. $■$

For what follows, we will need to introduce the information order on $Δ^{c} (Γ \times 2^{Γ} \times Φ)$ . Given $(γ_{1}, α_{1}, ϕ_{1}), (γ_{2}, α_{2}, ϕ_{2}) \in Γ \times 2^{Γ} \times Φ$ , we write $(γ_{1}, α_{1}, ϕ_{1}) ⪯ (γ_{2}, α_{2}, ϕ_{2})$ iff $γ_{1} = γ_{2}$ , $ϕ_{1} = ϕ_{2}$ , and $α_{1} \subseteq α_{2}$ .^[13] Then, given $ω_{1}, ω_{2} \in Δ^{c} (Γ \times 2^{Γ} \times Φ)$ , we have $ω_{1} ⪯ ω_{2}$ iff for all functions $f : Γ \times 2^{Γ} \times Φ \to [0, \infty)$ such that $(γ_{1}, α_{1}, ϕ_{1}) ⪯ (γ_{2}, α_{2}, ϕ_{2}) ⟹ f (γ_{1}, α_{1}, ϕ_{1}) \leq f (γ_{2}, α_{2}, ϕ_{2})$ (that is, for all nondecreasing $f$ ), we have

\sum x \in Γ \times 2^{Γ} \times Φ ω_{1} (x) f (x) \leq \sum x \in Γ \times 2^{Γ} \times Φ ω_{2} (x) f (x)

This means that $ω_{1} ⪯ ω_{2}$ if for every smidgeon of probability assigned by $ω_{1}$ there's a corresponding smidgeon of probability assigned by $ω_{2}$ , and the smidgeon of probability assigned by $ω_{2}$ is assigned to an element of $Γ \times 2^{Γ} \times Φ$ that is the same as for $ω_{1}$ , except the $2^{Γ}$ component of $ω_{2}$ may be a superset of the $2^{Γ}$ component of $ω_{1}$ .^[14]

We can get a computationalist hypothesis from $B r (Ξ)$ by projecting to $Γ \times 2^{Γ}$ . However, every computationalist hypothesis produced this way contains contributions that correspond to the possibility that every computation is instantiated. For example, if $δ_{(γ, ϕ)} \in Θ$ , it is easy to see that $δ_{(γ, {γ}, ϕ)} \in B r (Ξ)$ from the definition of the liberal bridge transform. Such contributions claim that any given configuration of the physical universe will only be consistent with a single configuration of the computational universe, which implies that all computations are instantiated. More broadly, if $ω \in B r (Ξ)$ and $ω^{'} ⪯ ω$ , we must also have $ω^{'} \in B r (Ξ)$ (Proposition 2.4 [LW · GW] in the original IBP post). This was called the monotonicity principle. The original post on IBP discussed the consequences of only considering computationalist hypotheses that are derived from the liberal bridge transform, and therefore obey the monotonicity principle, in great detail. Here we define a conservative bridge transform that can yield any computationalist hypothesis, and thus is unbound from the monotonicity principle and the limitations discussed in the original IBP post.

Definition 5: Let $Γ$ and $Φ$ be finite sets and let $Ξ \in □^{c} (Γ \times Φ)$ . Denote the set of maximal elements of $B r (Ξ)$ in the information order by $Ω_{m a x}$ . Define $Ω = {χ_{{e l}^{Γ} \times Φ} (s \times {i d}_{s^{Γ} \times Φ})_{*} (ω) | s : Γ \to Γ, ω \in Ω_{m a x}}$ , and let $¯ ¯¯¯¯¯¯ ¯ C H (Ω)$ denote the closed convex hull of $Ω$ .^[15] The conservative bridge transform of $Ξ$ , denoted $C B r (Ξ) \in □^{c} (Γ \times 2^{Γ} \times Φ)$ , is the downward closure of $¯ ¯¯¯¯¯¯ ¯ C H (Ω)$ in the natural order on contributions, i.e. the set ${ω | \exists ω^{'} \in ¯ ¯¯¯¯¯¯ ¯ C H (Ω) : ω \leq ω^{'}}$

The big difference between the liberal and conservative bridge transforms is the step of only considering maximal elements of $B r (Ξ)$ in the information order; going from $Ω_{m a x}$ to $Ω$ is necessary to make the resulting set closed under $χ_{{e l}^{Γ} \times Φ} (s \times {i d}_{s^{Γ} \times Φ})_{*}$ , and the rest is just bookkeeping to ensure that we actually end up with an ultracontribution.

Proposition 2: ${p r}_{Γ \times 2^{Γ}} C B r (Ξ)$ is a computationalist hypothesis.

Proof: For condition (1) of Definition 3 ( $s u p p Θ \subseteq {e l}^{Γ}$ ), we observe that ${p r}_{Γ \times 2^{Γ}} Ω$ has its support on ${e l}^{Γ}$ . $Δ^{c} {e l}^{Γ}$ is a closed, convex, and downward closed (in the natural order on contributions), so ${p r}_{Γ \times 2^{Γ}} C B r (Ξ)$ satisfies this condition as well.

For condition (2) of Definition 3 ( $\forall s : Γ \to Γ : χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} Θ \subseteq Θ$ ), we again note that it is satisfied by $Θ = {p r}_{Γ \times 2^{Γ}} Ω$ . What remains is showing that it is preserved by taking the downward closure of the closed convex hull. Here, noting that the projection commutes with the operations we're interested in, we let $Θ \subseteq Δ^{c} (Γ \times 2^{Γ})$ be an arbitrary subset (not necessarily an ultracontribution) that satisfies $\forall s : Γ \to Γ : χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} Θ \subseteq Θ$ and prove that each of the following operations preserves this condition:

Convex hull: Let $θ \in C H (Θ)$ , so $θ = \sum_{i} p_{i} θ_{i}$ for $θ_{i} \in Θ, p_{i} \in [0, 1]$ .

χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} \sum i p_{i} θ_{i}

= \sum i p_{i} χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} θ_{i} \in C H (Θ)

Closure: Let $θ_{1}, θ_{2} \dots \in Θ$ be a convergent sequence and let $θ = {lim}_{i \to \infty} θ_{i}$ . We need to show, for any $s : Γ \to Γ$ , that there is a series in $Θ$ that converges to $χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} θ$ . Such a series is given by $χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} θ_{i}$ , which proves closure.

Downward closure: If $θ \in Θ$ and $θ^{'} \leq θ$ , then $χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} θ^{'} \leq χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} θ$ . $■$

Lemma 2: If $Θ$ is a computational hypothesis, the set of maximal elements of $B r (Θ)$ in the information order is supported on the diagonal of $Γ \times 2_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}$ , i.e. the set of elements of the form $(γ, α, α)$ .

Proof: Let $ω \in B r (Θ)$ . First we show that for all $(γ, α_{m a t h}, α_{p h y s}) \in s u p p (ω)$ we have $α_{m a t h} \subseteq α_{p h y s}$ . Suppose that $(γ, α_{m a t h}, α_{p h y s}) \in s u p p (ω)$ with $α_{m a t h}$ containing some element $γ^{'} \notin α_{p h y s}$ . By downward closure, this means that $p δ_{(γ, α_{m a t h}, α_{p h y s})} \in B r (Θ)$ for some $p > 0$ . Let $s (γ^{''}) = γ^{'}$ be the constant function. By the definition of the liberal bridge transform we must have ${p r}_{Γ \times 2_{p h y s}^{Γ}} χ_{{e l}_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}} (s \times {i d}_{2_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}})_{*} (p δ_{(γ, α_{m a t h}, α_{p h y s})}) \in Θ$ . We compute

{p r}_{Γ \times 2_{p h y s}^{Γ}} χ_{{e l}_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}} (s \times {i d}_{2_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}})_{*} (p δ_{(γ, α_{m a t h}, α_{p h y s})})

= {p r}_{Γ \times 2_{p h y s}^{Γ}} χ_{{e l}_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}} (p δ_{(γ^{'}, α_{m a t h}, α_{p h y s})})

= p δ_{(γ^{'}, α_{p h y s})}

This violates the support condition of a computationalist hypothesis since $γ^{'} \notin α_{p h y s}$ , establishing a contradiction. This shows that for all $(γ, α_{m a t h}, α_{p h y s}) \in s u p p (ω)$ we have $α_{m a t h} \subseteq α_{p h y s}$ .

Now we show that for every maximal $ω \in B r (Θ)$ we have $(γ, α_{m a t h}, α_{p h y s}) \in s u p p (ω)$ we have $α_{m a t h} = α_{p h y s}$ . Consider the function $r : Γ \times 2_{m a t h}^{Γ} \times 2_{p h y s}^{Γ} \to Γ \times 2_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}$ defined by

r (γ, α_{m a t h}, α_{p h y s}) = (γ, α_{p h y s}, α_{p h y s})

Since $α_{m a t h} \subseteq α_{p h y s}$ we have $r_{*} (ω) ⪰ ω$ for all $ω \in B r (Θ)$ . $ω \in B r (Θ)$ implies that ${p r}_{Γ \times 2_{p h y s}^{Γ}} ω \in Θ$ by Proposition 2.1 [LW · GW] of the original IBP post. Furthermore, Proposition 1 says that $({i d}_{Γ} \times {d i a g}_{2^{Γ}})_{*} Θ \subseteq B r (Θ)$ , so $r_{*} (ω) = ({i d}_{Γ} \times {d i a g}_{2^{Γ}})_{*} {p r}_{Γ \times 2_{p h y s}^{Γ}} ω$ is in $B r (Θ)$ . $r_{*} (ω) = ω$ iff $ω$ is supported on the diagonal, so all maximal elements of $B r (Θ)$ are supported on the diagonal. $■$

Proposition 3: If $Θ$ is a computationalist hypothesis, ${p r}_{Γ \times 2_{m a t h}^{Γ}} C B r (Θ) = Θ$ (idempotence).

Proof: First, we show that ${p r}_{Γ \times 2_{m a t h}^{Γ}} C B r (Θ) \subseteq Θ$ . Since $Θ$ is convex, closed, and downward closed, we only need to show that, for $Ω$ as defined in Definition 5, ${p r}_{Γ \times 2_{m a t h}^{Γ}} Ω \subseteq Θ$ . Let $ω \in Ω$ . Then $ω = χ_{{e l}_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}} (s \times {i d}_{2_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}})_{*} (ω^{'})$ for some $ω^{'}$ in the set of maximal elements in $B r (Θ)$ . By Lemma 2, $ω^{'}$ is supported on the diagonal of $Γ \times 2_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}$ , i.e. the set of elements of the form $(γ, α, α)$ . From this it follows that ${p r}_{Γ \times 2_{m a t h}^{Γ}} ω^{'} \in Θ$ since ${p r}_{Γ \times 2_{m a t h}^{Γ}} ω^{'} = {p r}_{Γ \times 2_{p h y s}^{Γ}} ω^{'} \in Θ$ by Proposition 2.1 [LW · GW] of the original IBP post, and thus

θ = {p r}_{Γ \times 2_{m a t h}^{Γ}} ω

= {p r}_{Γ \times 2_{m a t h}^{Γ}} χ_{{e l}_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}} (s \times {i d}_{2_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}})_{*} (ω^{'})

= χ_{{e l}_{m a t h}^{Γ}} (s \times i d_{2_{m a t h}^{Γ}})_{*} {p r}_{Γ \times 2_{m a t h}^{Γ}} ω^{'} \in Θ

The inclusion follows from the definition of a computationalist hypothesis and ${p r}_{Γ \times 2_{m a t h}^{Γ}} ω^{'} \in Θ$ . The final equality can be proved similarly to Lemma 1; all the mappings are linear, ${p r}_{Γ \times 2_{m a t h}^{Γ}} χ_{{e l}_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}} (s \times {i d}_{2_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}})_{*}$ sends

δ_{(γ, α, α)} \to δ_{(s (γ), α, α)} \to {\begin{matrix} δ_{(s (γ), α, α)} & if s (γ) \in α 0 & if s (γ) \notin α \end{matrix} \to {\begin{matrix} δ_{(s (γ), α)} & if s (γ) \in α 0 & if s (γ) \notin α \end{matrix}

and $χ_{{e l}_{m a t h}^{Γ}} (s \times i d_{2_{m a t h}^{Γ}})_{*} {p r}_{Γ \times 2_{m a t h}^{Γ}}$ sends

δ_{(γ, α, α)} \to δ_{(γ, α)} \to δ_{(s (γ), α)} \to {\begin{matrix} δ_{(s (γ), α)} & if s (γ) \in α 0 & if s (γ) \notin α \end{matrix}

Next we show that $Θ \subseteq {p r}_{Γ \times 2_{m a t h}^{Γ}} C B r (Θ)$ . By Proposition 1, $({i d}_{Γ} \times {d i a g}_{2^{Γ}})_{*} Θ \subseteq B r (Θ)$ . Letting $θ$ be an element of $Θ$ and $ω = ({i d}_{Γ} \times {d i a g}_{2^{Γ}})_{*} θ \in B r (Θ)$ , $ω$ will be a maximal element of $B r (Θ)$ in the information order $(⪯)$ if it is a maximal element of $B r (Θ)$ in the natural order on contributions $(\leq)$ . This follows from Lemma 2: the maximal elements (in the information order) of $B r (Θ)$ are supported on the diagonal, so the only way for $ω = ({i d}_{Γ} \times {d i a g}_{2^{Γ}})_{*} θ$ to not be maximal (in the information order) is to have an $ω^{'}$ such that $\forall x \in (Γ, 2_{m a t h}^{Γ}, 2_{p h y s}^{Γ}) : ω^{'} (x) \geq ω (x)$ and $\exists x \in (Γ, 2_{m a t h}^{Γ}, 2_{p h y s}^{Γ}) : ω^{'} (x) > ω (x)$ , which is equivalent to $ω^{'} > ω$ (in the natural order on contributions).

By Proposition 2.1 [LW · GW] of the original IBP post, we have $Θ = {p r}_{Γ \times 2_{p h y s}^{Γ}} B r (Θ)$ , so a maximal $θ \in Θ$ in the natural order on contributions must correspond to a maximal $ω \in B r (Θ)$ in the natural order on contributions.^[16] Looking at the definition of the conservative bridge transform, we see if $ω$ is a maximal element of $B r (Θ)$ then $ω \in Ω$ . (To see this, just let $s (γ) = γ$ and remember that $B r (Θ)$ is supported on ${e l}_{m a t h}^{Γ} \times 2_{p h y s}^{Γ}$ .) $Ω \subseteq C B r (Θ)$ (as is clear from the definition of the conservative bridge transform), so we must have $ω \in C B r (Θ)$ , and therefore $θ \in {p r}_{Γ \times 2_{m a t h}^{Γ}} C B r (Θ)$ if $θ$ is maximal. Finally, $C B r (Θ)$ is downward closed, so if $θ$ is a non-maximal element of $Θ$ , i.e., $θ < θ^{'}$ for some maximal $θ^{'} \in Θ$ , the fact that $θ^{'} \in {p r}_{Γ \times 2_{m a t h}^{Γ}} C B r (Θ)$ also ensures that $θ \in {p r}_{Γ \times 2_{m a t h}^{Γ}} C B r (Θ)$ . $■$

Corollary 1: Every computationalist hypothesis can be derived as the conservative bridge transform of some $Ξ \in Γ \times Φ$ for some $Φ$ .

Unlike the liberal bridge transform, the conservative bridge transform can yield any computationalist hypothesis - including ones that say that certain programs aren't running. Using the conservative bridge transform instead of the liberal bridge transform eliminates the monotonicity principle.

Properties of computationalist hypotheses

We now state and prove some basic facts about computationalist hypotheses.

Proposition 4: Let $f : 2^{Γ} \to {t r u e, f a l s e}$ be a boolean function and $A$ be a subset of $Γ$ . If for all $(γ, α)$ in the support of a computationalist hypothesis $Θ$ we have $γ \in A$ if and only if $f (α)$ is true, then for all $(γ, α) \in s u p p Θ$ we have either $α \subseteq A$ or $α \cap A = \emptyset$ .

Proof: Let $A \subseteq Γ$ and suppose there is a $(γ, α) \in s u p p Θ$ such that $γ_{A}, γ_{A^{c}} \in α$ , but $γ_{A} \in A$ and $γ_{A^{c}} \notin A$ . Let $θ \in Θ$ be supported on this $(γ, α)$ . Define $s^{'}, s^{''} : Γ \to Γ$ by $s^{'} (λ) = γ_{A}$ and $s^{''} (λ) = γ_{A^{c}}$ . Then $(γ_{A}, α)$ is in the support of $χ_{{e l}^{Γ}} s_{*}^{'} (θ)$ and $(γ_{A^{c}}, α)$ is in the support of $χ_{{e l}^{Γ}} s_{*}^{''} (θ)$ . Both these contributions are in $Θ$ by the definition of a computationalist hypothesis.$\$Both of these elements have the same $α$ and exactly one of them has its $Γ$ -component in $A$ , contradicting the assumption of the proposition. $■$

We can let $f (α)$ be, "Does $α$ say Computation #1 is instantiated?". If $A$ is the set of all $γ \in Γ$ which say Computation #2 yields a certain result (let's call it $0$ ) and $Θ$ claims that Computation #1 will only be instantiated if Computation #2 yields the result $0$ , then Proposition 4 means that Computation #2 must be instantiated according to $Θ$ .^[17]

Proposition 5: The intersection of any family of computationalist hypotheses is a computationalist hypothesis.

Proof: The intersection of downward-closed closed convex sets is downward-closed, closed, and convex, so the intersection of (potentially infinitely many) ultracontributions is an ultracontribution. Next, let $θ \in ⋂ Θ_{i}$ . Then $\forall i : θ \in Θ_{i}$ , so we have $\forall i : \forall s : Γ \to Γ : χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} θ \in Θ_{i}$ , and therefore $\forall s : Γ \to Γ : χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} θ \in ⋂ Θ_{i}$ . We also have $s u p p θ \subseteq {e l}^{Γ}$ from $θ \in Θ_{1}$ . $■$

Proposition 6: The convex hull of two computationalist hypotheses is a computationalist hypothesis.

Proof: The convex hull of two closed convex sets is closed and convex. For downward closure, let $θ \in C H (Θ_{1}, Θ_{2})$ and let $θ^{'} \leq θ = p θ_{1} + (1 - p) θ_{2}$ for some $p \in [0, 1]$ , $θ_{1} \in Θ_{1}$ , and $θ_{2} \in Θ_{2}$ . Write $θ^{'} = min (θ^{'}, p θ_{1}) + (θ^{'} - min (θ^{'}, p θ_{1}))$ . The first term is $\leq p θ_{1}$ , so it must equal $p$ times some element of $Θ_{1}$ by the downward closure of $Θ_{1}$ . The second term must be $\leq (1 - p) θ_{2}$ because $θ^{'} - p θ_{1} \leq (1 - p) θ_{2}$ and $θ^{'} - θ^{'} \leq (1 - p) θ_{2}$ since $θ_{2}$ is non-negative. By the downward closure of $Θ_{1}$ and $Θ_{2}$ , this means that $θ^{'} = p θ_{1}^{'} + (1 - p) θ_{2}^{'}$ for some $θ_{1}^{'} \in Θ_{1}$ and $θ_{2}^{'} \in Θ_{2}$ , so $θ^{'} \in C H (Θ_{1}, Θ_{2})$ .

Now we need to show that $\forall s : Γ \to Γ : χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} C H (Θ_{1}, Θ_{2}) \subseteq C H (Θ_{1}, Θ_{2})$ . Let $θ = p θ_{1} + (1 - p) θ_{2}$ be an element of $C H (Θ_{1}, Θ_{2})$ . $χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*}$ is a linear operator, so we have

χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} θ = χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} (p θ_{1} + (1 - p) θ_{2})

= p χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} θ_{1} + (1 - p) χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} θ_{2})

$χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} θ_{i} \in Θ_{i}$ since the $Θ_{i}$ are computationalist hypotheses, so we have

χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} C H (Θ_{1}, Θ_{2}) \subseteq C H (Θ_{1}, Θ_{2}) ■

Proposition 7: A finite mixture of computationalist hypotheses is a computational hypothesis. That is, if $Θ_{i}$ are computationalist hypotheses and for all $i \leq n$ we have $p_{i} \geq 0$ and $\sum_{i = 1}^{n} p_{i} \leq 1$ , then $\sum_{i = 1}^{n} p_{i} Θ_{i}$ is a computationalist hypothesis.

Proof: We need to show that $\forall s : Γ \to Γ : χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*} \sum_{i = 1}^{n} p_{i} Θ_{i} \subseteq \sum_{i = 1}^{n} p_{i} Θ_{i}$ . As in the proof of Proposition 6, this follows from the linearity of $χ_{{e l}^{Γ}} (s \times i d_{2^{Γ}})_{*}$ and the fact that the $Θ_{i}$ are computationalist hypotheses. Now we only need to show convexity, closure, and downward closure:

Convexity: Let $θ \in \sum_{j} q_{j} \sum_{i = 1}^{n} p_{i} Θ_{i}$ . Then

θ = \sum j q_{j} \sum i p_{i} θ_{j i} = \sum i p_{i} \sum j q_{j} θ_{j i} \in \sum i p_{i} Θ_{i}

Closure: Consider a convergent sequence $\sum_{i} p_{i} θ_{1 i}, \sum_{i} p_{i} θ_{2 i} \dots$ . We need to show that ${lim}_{m \to \infty} \sum_{i} p_{i} θ_{m i} \in \sum_{i} p_{i} Θ_{i}$ .We know that $Δ^{c} (Γ \times 2^{Γ})$ is compact since $Γ$ is finite, so our sequence must contain a convergent subsequence $\sum_{i} p_{i} θ_{k_{1} i}, \sum_{i} p_{i} θ_{k_{2} i}, \dots$ for which the sequence $θ_{k_{1} i}, θ_{k_{2} i} \dots$ converges to some $θ_{i}$ for all $i$ . $θ_{i} \in Θ_{i}$ since $Θ_{i}$ is closed, so ${lim}_{m \to \infty} \sum_{i} p_{i} θ_{k_{m} i} = \sum_{i} p_{i} θ_{i} \in \sum_{i} p_{i} Θ_{i}$ .

Downward closure. From the proof of Proposition 6 we see that the mixture of two computationalist hypotheses is downward closed. That this holds for any $n$ follows by induction since we can consider the mixture of $k + 1$ computationalist hypotheses as a mixture of $\frac{1}{\sum_{i = 1}^{k} p_{i}} \sum_{i = 1}^{k} p_{i} Θ_{i}$ and $Θ_{k + 1}$ . $■$

If computationalist hypotheses did not obey any rules such as Propositions 4-7, that would be suggest that they were not a natural mathematical concept, and therefore not a promising tool for understanding agent foundations. While none of these properties are surprising, the fact computationalist hypotheses possess them is (weak) evidence of computationalist hypotheses being natural and useful.

This is far from a complete list of the properties of computationalist hypotheses. The properties of the liberal bridge transform, which is related to computationalist hypotheses via Proposition 1, are investigated more fully in the original IBP post.^[18]

Non-Monotonic Infra-Bayesian Physicalism

Contents

Why Physicalism?

Definitions

Bridge Transforms and Monotonicity

Properties of computationalist hypotheses

Further reading and future research

0 comments