Eight Definitions of Observability

scott-garrabrant

Eight Definitions of Observability

post by Scott Garrabrant · 2020-11-10T23:37:07.827Z · LW · GW · 26 comments

  1. Definition from Subsets
    1.1. Example
  2. Conditional Policies Definition
    2.1. Example
  3. Additive Definitions
    3.1. Example
  4. Multiplicative Definitions
    4.1. Powerless Outside of a Subset
    4.2. Multiplicative Definitions of Observables
    4.3. Example
    4.4. Updatelessness
  5. Internalizing-Externalizing Definitions
    5.1. Example
None
26 comments

This is the eleventh post in the Cartesian frames [LW · GW] sequence. Here, we compare eight equivalent definitions of observables, which emphasize different philosophical interpretations.

Throughout this post, we let be a Cartesian frame over a nonempty set $W$ , we let $V = {S_{1}, \dots, S_{n}}$ be a finite partition of $W$ , and we let $v : W \to V$ send each element of $W$ to its part in $V$ .

The condition that $V$ is finite is an important one. Many of the definitions below can be extended to infinite partitions, and the theory of observability for infinite partitions is probably nice, but we are not discussing it here. The condition that $W$ is nonempty is just ruling out some degenerate cases

1. Definition from Subsets

The definitions in this post will talk about when a finite partition $V$ of $W$ is observable in $C$ . This will make some of the definitions more elegant, and it is easy to translate back and forth between the new definitions of the observability of a finite partition and the old definitions of the observability of a subset.

Definition: We say $C$ 's agent can observe a finite partition $V$ of $W$ if for all parts $S_{i} \in V$ , $S_{i} \in Obs (C)$ . We let ${Obs}^{'} (C)$ denote the set of all finite partitions of $W$ that are observable in $C$ .

Claim: For any nonempty strict subset $S \subset W$ , $C$ 's agent can observe $S$ if and only if $C's$ agent can observe ${S, (W ∖ S)}$ .

Proof: If $C$ 's agent can observe ${S, (W ∖ S)}$ , then clearly $C$ 's agent can observe $S$ . If $C's$ agent can observe $S$ , then since observability is closed under complements, $C$ 's agent can observe $W ∖ S$ , and so can observe ${S, (W ∖ S)}$ . $□$

1.1. Example

In "Introduction to Cartesian Frames [LW · GW]," we gave the example of an agent that can choose between unconditionally carrying an umbrella, unconditionally carrying no umbrella, carrying an umbrella iff it's raining, and carrying an umbrella iff it's sunny:

$C_{0} = \begin{matrix} \begin{matrix} r & s \end{matrix} \begin{matrix} u n u \leftrightarrow r u \leftrightarrow s \end{matrix} & ⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} u r & u s n r & n s u r & n s n r & u s \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ \end{matrix}$

Here, $Obs (C_{0}) = {{}, {u r, n r}, {u s, n s}, W}$ , so the partition $V = {R, S}$ is observable in $C_{0}$ , where $R = {u r, n r}$ and $S = {u s, n s}$ .

As we go through the definitions in this post, we will repeatedly return to $C_{0}$ and show how to understand $C_{0}$ 's observables in terms of our new definitions.

Before presenting fundamentally new definitions, we will modify our two old definitions to be about finite partitions instead of subsets.

2. Conditional Policies Definition

Definition: We say that $C$ 's agent can observe a finite partition $V$ of $W$ if for all functions $f : V \to A$ , there exists an element $a_{f} \in A$ such that for all $e \in E$ , $f (v (a_{f} \cdot e)) \cdot e = a_{f} \cdot e$ .

Claim: This definition is equivalent to the definition from subsets.

Proof: We work by induction on the number of parts in $V$ . Since $W$ is nonempty, $V$ has at least one part. If $V = {W}$ has one part, we clearly have that $C$ 's agent can observe $V$ under the definition from subsets. For the conditional policies definition, we also have that $C$ 's agent can observe $V$ , since we can take $a_{f} = f (W)$ , and thus, for all $e \in E$ ,

\begin{matrix} f (v (a_{f} \cdot e)) \cdot e & = f (W) \cdot e = a_{f} \cdot e . \end{matrix}

If $V = {S_{1}, \dots, S_{n}}$ has $n$ parts, consider the partition $V^{'} = {S_{1} \cup S_{2}, S_{3}, \dots, S_{n}}$ which unions together the first two parts $S_{1}$ and $S_{2}$ of $V$ . Let $v^{'} : W \to V^{'}$ send each element of $W$ to its part in $V^{'}$ .

First, assume that $C$ 's agent can observe $V$ according to the definition from subsets. Then, since observability of subsets is closed under unions, $C$ 's agent can also observe $V^{'}$ under the definition from subsets, and thus also under the conditional policies definition.

Given a function $f : V \to A$ , let $f^{'} : V^{'} \to A$ be given by $f^{'} (S_{1} \cup S_{2}) = f (S_{2})$ , and $f^{'} (S_{i}) = f (S_{i})$ on all other inputs. Since $C$ 's agent can observe $V^{'}$ under the conditional policies definition, we can let $a_{f^{'}}$ be such that for all $e \in E$ , $f^{'} (v^{'} (a_{f^{'}} \cdot e)) \cdot e = a_{f^{'}} \cdot e$ .

Choose an $a_{f} \in A$ such that $a_{f} \in if (S_{1}, f (S_{1}), a_{f}^{'})$ , which we can do because $S_{1}$ is observable in $C$ . Observe that for all $e \in E$ , we have that if $a_{f} \cdot e \in S_{1}$ , then

\begin{matrix} f (v (a_{f} \cdot e)) \cdot e & = f (S_{1}) \cdot e = a_{f} \cdot e, \end{matrix}

if $a_{f} \cdot e \in S_{2}$ , we have $a_{f} \cdot e = a_{f^{'}} \cdot e$ , and thus

\begin{matrix} f (v (a_{f} \cdot e)) \cdot e & = f (S_{2}) \cdot e = f^{'} (S_{1} \cup S_{2}) \cdot e = f^{'} (v^{'} (a_{f^{'}} \cdot e)) \cdot e = a_{f^{'}} \cdot e = a_{f} \cdot e, \end{matrix}

and finally if $a_{f} \cdot e \in S_{i}$ for some $i \neq 1, 2$ , we still have have $a_{f} \cdot e = a_{f^{'}} \cdot e$ , and thus

\begin{matrix} f (v (a_{f} \cdot e)) \cdot e & = f (S_{i}) \cdot e = f^{'} (S_{i}) \cdot e = f^{'} (v^{'} (a_{f^{'}} \cdot e)) \cdot e = a_{f^{'}} \cdot e = a_{f} \cdot e . \end{matrix}

Thus, $C$ 's agent can observe $V$ according to the conditional policies definition.

Conversely, if $C$ 's agent can observe $V$ according to the conditional policies definition, then to show that $C$ 's agent can observe $V$ according to the definition from subsets, it suffices to show that the agent can observe $S_{i}$ for all $S_{i} \in V$ . Thus, we need to show that for any $a_{0}, a_{1} \in A$ , there exists an $a_{2} \in A$ with $a_{2} \in if (S_{i}, a_{0}, a_{1})$ .

Indeed, if we let $f : V \to A$ send $S_{i}$ to $a_{0}$ , and send all other inputs to $a_{1}$ , then we can take an $a_{f}$ such that for all $e \in E$ , $f (v (a_{f} \cdot e)) \cdot e = a_{f} \cdot e$ . But then, if $a_{f} \cdot e \in S_{i}$ , then

\begin{matrix} a_{f} \cdot e & = f (v (a_{f} \cdot e)) \cdot e = f (S_{1}) \cdot e = a_{0} \cdot e, \end{matrix}

and otherwise,

\begin{matrix} a_{f} \cdot e & = f (v (a_{f} \cdot e)) \cdot e = a_{1} \cdot e . \end{matrix}

Thus, $C$ 's agent can observe $V$ according to the definition from subsets. $□$

2.1. Example

Let $C_{0} = (A, E, \cdot)$ be defined as in the §1.1 example, with $R = {u r, n r}$ , $S = {u s, n s}$ , and $V = {R, S}$ .

$A = {u, n, u \leftrightarrow r, u \leftrightarrow s}$ is a four-element set, and $V = {R, S}$ is a two-element set, so there are sixteen functions $f : V \to A$ . For each function, there is a possible agent $a_{f} \in A$ that satisfies $f (v (a_{f} \cdot e)) \cdot e = a_{f} \cdot e$ for all $e \in E$ . We can illustrate the sixteen functions and the corresponding $a_{f} \in A$ in a sixteen-row table:

$f (R)$	$f (S)$	$a_{f}$
$u$	$u$	$u$
$u$	$n$	$u \leftrightarrow r$
$u$	$u \leftrightarrow r$	$u \leftrightarrow r$
$u$	$u \leftrightarrow s$	$u$
$n$	$u$	$u \leftrightarrow s$
$n$	$n$	$n$
$n$	$u \leftrightarrow r$	$n$
$n$	$u \leftrightarrow s$	$u \leftrightarrow s$
$u \leftrightarrow r$	$u$	$u$
$u \leftrightarrow r$	$n$	$u \leftrightarrow r$
$u \leftrightarrow r$	$u \leftrightarrow r$	$u \leftrightarrow r$
$u \leftrightarrow r$	$u \leftrightarrow s$	$u$
$u \leftrightarrow s$	$u$	$u \leftrightarrow s$
$u \leftrightarrow s$	$n$	$n$
$u \leftrightarrow s$	$u \leftrightarrow r$	$n$
$u \leftrightarrow s$	$u \leftrightarrow s$	$u \leftrightarrow s$

Since there is an $a_{f} \in A$ for each function, $C_{0}$ 's agent can observe $V$ according to the conditional policies definition.

3. Additive Definitions

Next, we give an additive definition of observables. This is a version of our categorical definition of observables from "Controllables and Observables, Revisited [LW · GW]," modified to be about finite partitions.

Definition: We say $C$ 's agent can observe a finite partition $V = {S_{1}, \dots, S_{n}}$ of $W$ if there exist $C_{1}, \dots C_{n}$ , Cartesian frames over $W$ , with $C_{i} ◃ ⊥_{S_{i}}$ such that $C ≃ C_{1} & \dots & C_{n}$ .

This can also be strengthened to a constructive version of the additive definition, which we will call the assuming definition.

Definition: We say $C$ 's agent can observe a finite partition $V = {S_{1}, \dots, S_{n}}$ of $W$ if $C ≃ {Assume}_{S_{1}} (C) & \dots & {Assume}_{S_{n}} (C)$ .

Claim: These definitions are equivalent to each other and the definitions above.

Proof: We assume that $n \geq 2$ , and that $A$ is nonempty. The case where $n = 1$ and the case where $A = {}$ are trivial.

If $C$ 's agent can observe $V$ according to the assuming definition of observables, then it can also clearly observe $V$ according to the additive definition, since ${Assume}_{S_{1}} (C) ◃ ⊥_{S_{1}}$ .

Next, assume that $C$ 's agent can observe $V$ according to the additive definition. We will show that $C$ 's agent can observe $S_{1}$ . Consider the pair of Cartesian frames $C_{1}$ and $C_{2} & \dots & C_{n}$ . Observe that $C_{1} ◃ ⊥_{S_{1}}$ and that $C_{2} & \dots & C_{n} ◃ ⊥_{W ∖ S_{1}}$ , and that $C ≃ C_{1} & (C_{2} & \dots & C_{n})$ . Thus, $S_{1}$ is observable in $C$ . Symmetrically, $S_{i}$ is observable in $C$ for all $i = 1, \dots n$ , and thus $V$ is observable in $C$ according to the definition from subsets.

Finally, assume that $C$ 's agent can observe $V$ according to the conditional policies definition (and also the definition from subsets). We will show that $C ≃ C_{1} & \dots & C_{n}$ , where $C_{i} = {Assume}_{S_{i}} (C)$ .

We have $C_{1} & \dots & C_{n} = (A^{n}, E_{1} ⊔ \dots ⊔ E_{n}, ⋆)$ , where $C_{i} = (A, E_{i}, \cdot_{i})$ , and $⋆$ is given by $(a_{1}, \dots, a_{n}) ⋆ e = a_{i} \cdot e$ , where $e \in E_{i}$ .

First observe that for every $e \in E$ , there is a unique $i \in {1, \dots, n}$ such that $e \in E_{i}$ . This is because there exists an $a_{0} \in A$ , and from the definition from subsets, $C$ 's agent can observe each $S_{i}$ , and so given an $e \in E$ , if $a_{0} \cdot e \in S_{i}$ , it must be the case that for all $a \in A,$ $a \cdot e \in S_{i}$ . Thus, we have that that $E = E_{1} ⊔ \dots ⊔ E_{n}$ .

We construct $(g_{0}, h_{0}) : (A^{n}, E, ⋆) \to C$ and $(g_{1}, h_{1}) : C \to (A^{n}, E, ⋆)$ which compose to something homotopic to the identity in each order. Let $g_{1} : A \to A^{n}$ be the diagonal, given by $g_{1} (a) = (a, \dots, a)$ . Let $h_{0}$ and $h_{1}$ be the identity on $E$ . Let $g_{0}$ be given by $g_{0} (a_{1}, \dots, a_{n}) = a_{f}$ , where $f : V \to A$ is given by $f (S_{i}) = a_{i}$ , and $a_{f}$ satisfies $f (v (a_{f} \cdot e)) \cdot e = a_{f} \cdot e$ for all $e \in E$ , which is possible by the conditional policies definition.

To see that $(g_{1}, h_{1})$ is a morphism, observe that for all $a \in A$ and $e \in E$ ,

\begin{matrix} g_{1} (a) ⋆ e & = (a, \dots, a) ⋆ e = a \cdot e = a \cdot h_{1} (e) . \end{matrix}

To see that $(g_{0}, h_{0})$ is a morphism, observe that for all $(a_{1}, \dots, a_{n}) \in A$ , and $e \in E$ , if we let $f : V \to A$ be given by $f (S_{i}) = a_{i}$ , we have

\begin{matrix} g_{0} (a_{1}, \dots, a_{n}) \cdot e & = f (v (g_{0} (a_{1}, \dots, a_{n}) \cdot e)) \cdot e = f (S_{i}) \cdot e = a_{i} \cdot e = (a_{1}, \dots, a_{n}) ⋆ e = (a_{1}, \dots, a_{n}) ⋆ h_{0} (e), \end{matrix}

where $i$ is such that $e \in E_{i}$ . The fact that $(g_{0}, h_{0})$ and $(g_{1}, h_{1})$ compose to something homotopic to the identity in both orders follows from the fact that $h_{0} \circ h_{1}$ and $h_{1} \circ h_{0}$ are the identity on $E$ . Thus, $C ≃ {Assume}_{S_{1}} (C) & \dots & {Assume}_{S_{n}} (C)$ , and so $V$ is observable in $C$ according to the assuming definition. $□$

3.1. Example

Let $C_{0}$ be defined as in the previous examples, with $R = {u r, n r}$ and $S = {u s, n s}$ . By the assuming definition, there exist two frames

$C_{1} = {Assume}_{R} (C_{0}) = \begin{matrix} \begin{matrix} r \end{matrix} \begin{matrix} u n \end{matrix} & (\begin{matrix} u r n r \end{matrix}) \end{matrix}$

and

$C_{2} = {Assume}_{S} (C_{0}) = \begin{matrix} \begin{matrix} s \end{matrix} \begin{matrix} u n \end{matrix} & (\begin{matrix} u s n s \end{matrix}) \end{matrix}$

such that $C_{0} ≃ C_{1} & C_{2}$ .

This example both illustrates the idea behind the additive definitions, and shows the construction used in the assuming definition. This is also the same example we provided to illustrate products of Cartesian frames in "Additive Operations on Cartesian Frames [LW · GW]."

Another way of thinking about the additive definition of observables: Recall "Committing, Assuming, Externalizing, and Internalizing" §3.2 (Committing and Assuming Can Be Defined Using Lollipop and Tensor [LW · GW]), where we saw that ${Assume}_{S} (C) ≅ 1_{S} \otimes C$ . This means that (up to isomorphism) we can restate $C_{0} ≃ C_{1} & C_{2}$ as $C_{0} ≃ (1_{R} \otimes C_{0}) & (1_{S} \otimes C_{0})$ , i.e.,

$\begin{matrix} ⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} u r & u s n r & n s u r & n s n r & u s \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ \end{matrix} ≃ \begin{matrix} (\begin{matrix} u r & n r \end{matrix}) \end{matrix} \otimes \begin{matrix} ⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} u r & u s n r & n s u r & n s n r & u s \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ \end{matrix} & \begin{matrix} (\begin{matrix} u s & n s \end{matrix}) \end{matrix} \otimes \begin{matrix} ⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} u r & u s n r & n s u r & n s n r & u s \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ \end{matrix}$ .

This (equivalent) framing makes it easier to keep track of what "assuming" is doing categorically, so that we can see what interfaces between frames we are relying on when we say that something is "observable" using an additive definition.

4. Multiplicative Definitions

Our multiplicative definitions will depend on a notion of agents being powerless outside of a subset.

4.1. Powerless Outside of a Subset

Definition: Given a subset $S$ of $W$ , we say that $C$ 's agent is powerless outside $S$ if for all $e \in E$ , and all $a_{0}, a_{1} \in A$ , if $a_{0} \cdot e \notin S$ , then $a_{0} \cdot e = a_{1} \cdot e$ .

To say that $C$ 's agent is powerless outside $S$ is to say that the if the world is at all dependent on $C$ 's agent, then the world must be in $S$ .

Here are some lemmas about being powerless outside of a subset, which we will use later.

Lemma: If $C$ 's agent is powerless outside $S$ and $T \supseteq S$ , then $C$ 's agent is powerless outside $T$ .

Proof: Trivial. $□$

Lemma: If $C$ and $D$ 's agents are both powerless outside $S$ , then $C \otimes D$ 's agent is powerless outside $S$ .

Proof: Let $D = (B, F, ⋆)$ , and let $C \otimes D = (A \times B, hom (C, D^{*}), ⋄)$ . Consider some $(a_{0}, b_{0}), (a_{1}, b_{1}) \in A \times B$ and $(g, h) \in hom (C, D^{*})$ . We will use the fact that if $a_{0} \cdot h (b_{0}) \notin S$ then $a_{0} \cdot h (b_{0}) = a_{1} \cdot h (b_{0})$ , and the fact that if $b_{0} ⋆ g (a_{1}) \notin S$ then $b_{0} ⋆ g (a_{1}) = b_{1} ⋆ g (a_{1})$ . Observe that if $(a_{0}, b_{0}) ⋄ (g, h) \notin S$ , then

\begin{matrix} (a_{0}, b_{0}) ⋄ (g, h) & = a_{0} \cdot h (b_{0}) = a_{1} \cdot h (b_{0}) = b_{0} ⋆ g (a_{1}) = b_{1} ⋆ g (a_{1}) = (a_{1}, b_{1}) ⋆ (g, h) . \end{matrix}

$□$

Now, we are ready for our first truly new definition of the observability of a finite partition.

4.2. Multiplicative Definitions of Observables

Definition: We say that $C$ 's agent can observe a finite partition $V = {S_{1}, \dots, S_{n}}$ of $W$ if $C ≃ C_{1} \otimes \dots \otimes C_{n}$ , where each $C_{i}$ 's agent is powerless outside $S_{i}$ .

Again, we also have a constructive version of this definition:

Definition: We say that $C$ 's agent can observe a finite partition $V = {S_{1}, \dots, S_{n}}$ of $W$ if $C ≃ C_{1} \otimes \dots \otimes C_{n}$ , where $C_{i} = {Assume}_{S_{i}} (C) & 1_{T_{i}}$ , where $T_{i} = (W ∖ S_{i}) \cap Image (C)$ .

Claim: These definitions are equivalent to each other and equivalent to the definitions above.

Proof: First, observe that if $C$ 's agent can observe $V$ according to the constructive version of the multiplicative definition, it can also observe $V$ according to the nonconstructive version of the multiplicative definition, since the agent of ${Assume}_{S_{i}} (C) & 1_{T_{i}}$ is clearly powerless outside $S_{i}$ .

Next, we show that if $C$ 's agent can observe $V$ according to the nonconstructive multiplicative definition, it can also observe $V$ according to the definition from subsets. Let $C ≃ D = C_{1} \otimes \dots \otimes C_{n}$ , where each $C_{i}$ 's agent is powerless outside $S_{i}$ . It suffices to show that $D$ 's agent can observe $V$ , since the definition from subsets is equivalent to the additive definition, and thus closed under biextensional equivalence. Thus, it suffices to show that $D$ 's agent can observe $S_{i}$ for all $i = 1, \dots, n$ . We will show that $D$ 's agent can observe $S_{1}$ , and the rest will follows by symmetry.

Let $C_{1} = (A_{1}, E_{1}, \cdot_{1})$ , and let $D_{1} = (B_{1}, F_{1}, ⋆_{1}) = C_{2} \otimes \dots \otimes C_{n}$ . We start by showing that $D_{1}$ 's agent is powerless outside $W ∖ S_{1}$ . We have that the agents of $C_{2}, \dots, C_{n}$ are all powerless outside $W ∖ S_{1}$ , since being powerless outside something is closed under supersets. Thus we have that $D_{1}$ 's agent is powerless outside $W ∖ S_{1}$ , since being powerless outside $W ∖ S_{1}$ is closed under tensor.

Thus, we have $D = (A_{1} \times B_{1}, hom (C, D^{*}), ⋄) = C_{1} \otimes D_{1}$ , with $C_{1}$ 's agent powerless outside $S_{1}$ and $D_{1}$ 's agent powerless outside $W ∖ S_{1}$ . Given an arbitrary $(a_{1}, b_{1}), (a_{2}, b_{2}) \in A_{1} \times B_{1}$ , we will show that $(a_{1}, b_{2}) \in if (S_{1}, (a_{1}, b_{1}), (a_{2}, b_{2}))$ , and thus show that $D$ 's agent can observe $S_{1}$ .

It suffices to show that for all $(g, h) : C \to D^{*}$ , if $(a_{1}, b_{2}) ⋄ (g, h) \in S_{1}$ , then $(a_{1}, b_{2}) ⋄ (g, h) = (a_{1}, b_{1}) ⋄ (g, h)$ , and if $(a_{1}, b_{2}) ⋄ (g, h) \notin S_{1}$ , then $(a_{1}, b_{2}) ⋄ (g, h) = (a_{2}, b_{2}) ⋄ (g, h)$ . Indeed, if $(a_{1}, b_{2}) ⋄ (g, h) \in S_{1}$ , then, since $D_{1}$ 's agent is powerless outside $W ∖ S_{1}$ , we have

\begin{matrix} (a_{1}, b_{2}) ⋄ (g, h) & = b_{2} ⋆_{1} g (a_{1}) = b_{1} ⋆_{1} g (a_{1}) = (a_{1}, b_{1}) ⋄ (g, h) . \end{matrix}

Similarly, if $(a_{1}, b_{2}) ⋄ (g, h) \notin S_{1}$ , then, since $C_{1}$ 's agent is powerless outside $S$ , we have

\begin{matrix} (a_{1}, b_{2}) ⋄ (g, h) & = a_{1} \cdot_{1} h (b_{2}) = a_{2} \cdot_{1} h (b_{2}) = (a_{2}, b_{2}) ⋄ (g, h) . \end{matrix}

Thus, $D$ 's agent can observe $S_{1}$ , so $C$ 's agent can observe $V$ according to the definition from subsets.

Finally, we assume that $C$ 's agent can observe $V$ according to the assuming definition, and show that $C$ 's agent can observe $V$ according to the constructive version of the multiplicative definition.

We work by induction on $n$ , the number of parts. The case where $n = 1$ is trivial. Let $C ≃ {Assume}_{S_{1}} (C) & \dots & {Assume}_{S_{n}} (C)$ . Thus, we also have that $C ≃ {Assume}_{S_{1} \cup S_{2}} (C) & {Assume}_{S_{3}} (C) & \dots & {Assume}_{S_{n}} (C)$ , and so by induction, we have that $C ≃ ({Assume}_{S_{1} \cup S_{2}} (C) & 1_{T_{1} \cap T_{2}}) \otimes C_{3} \otimes \dots \otimes C_{n}$ , where $C_{i}$ and $T_{i}$ are as in the constructive multiplicative definition. Thus, it suffices to show that

\begin{matrix} {Assume}_{S_{1} \cup S_{2}} (C) & 1_{T_{1} \cap T_{2}} & ≃ C_{1} \otimes C_{2} = ({Assume}_{S_{1}} (C) & 1_{T_{1}}) \otimes ({Assume}_{S_{2}} (C) & 1_{T_{2}}) . \end{matrix}

First, observe that we have $C ≃ D_{1} & D_{2} & D_{3}$ , where $D_{1} = {Assume}_{S_{1}}$ (C), $D_{2} = {Assume}_{S_{2}} (C)$ , and $D_{3} = {Assume}_{S_{3}} (C) & \dots & {Assume}_{S_{n}} (C)$ . Let $D_{i} = (B_{i}, F_{i}, ⋆_{i})$ . Let $R_{i} = Image (D_{i})$ .

Observe that $T_{1} = R_{2} \cup R_{3}$ , $T_{2} = R_{1} \cup R_{3}$ , and $T_{1} \cup T_{2} = R_{3}$ , and observe that ${Assume}_{S_{1} \cup S_{2}} (C) ≃ D_{1} & D_{2}$ . Thus it suffices to show that $(D_{1} & 1_{R_{2} \cup R_{3}}) \otimes (D_{2} & 1_{R_{1} \cup R_{3}}) ≃ D_{1} & D_{2} & 1_{R_{3}}$ .

Let $D_{1} & 1_{R_{2} \cup R_{3}} = (B_{1}, F_{1} ⊔ R_{2} ⊔ R_{3}, ∙_{1})$ , let $D_{2} & 1_{R_{1} \cup R_{3}} = (B_{2}, F_{2} ⊔ R_{1} ⊔ R_{3}, ∙_{2})$ , and let $D_{1} & D_{2} & 1_{R_{3}} = (B_{1} \times B_{2}, F_{1} ⊔ F_{2} ⊔ R_{3}, ∙_{3})$ where $∙_{1}$ , $∙_{2}$ , and $∙_{3}$ are all given by $b ∙_{i} f = b ⋆_{1} f$ if $f \in F_{1}$ , $b ∙_{i} f = b ⋆_{2} f$ if $f \in F_{2}$ , and $b ∙_{i} f = f$ otherwise.

Let $H = hom (B_{1} & 1_{R_{2} \cup R_{3}}, (D_{2} & 1_{R_{1} \cup R_{3}})^{*})$ . Let $(D_{1} & 1_{R_{2} \cup R_{3}}) \otimes (D_{2} & 1_{R_{1} \cup R_{3}}) = (B_{1} \times B_{2}, H, ∙_{4})$ ), where

\begin{matrix} (b_{1}, b_{2}) ∙_{4} (g, h) & = b_{1} ∙_{1} h (b_{2}) = b_{2} ∙_{2} g (b_{1}) . \end{matrix}

Observe that for any $f_{1} \in F_{1}$ , there is a $(g_{f_{1}}, h_{f_{1}}) \in H$ , given by $g_{f_{1}} (b_{1}) = b_{1} \cdot_{1} f_{1}$ and $h_{f_{1}} (b_{2}) = f_{1}$ . This is clearly a morphism, since

\begin{matrix} b_{1} ∙_{1} h_{f_{1}} (b_{2}) & = b_{1} ∙_{1} f_{1} = b_{1} \cdot_{1} f_{1} = g_{f_{1}} (b_{1}) = b_{2} ∙_{2} g_{f_{1}} (b_{1}) . \end{matrix}

Similarly, for any $f_{2} \in F_{2}$ , there is a morphism $(g_{f_{2}}, h_{f_{2}}) \in H$ given by $g_{f_{2}} (b_{1}) = f_{2}$ and $h_{f_{2}} (b_{2}) = b_{2} \cdot_{2} f_{2}$ . Finally, for any $r \in R_{3}$ , there is a morphism $(g_{r}, h_{r}) \in H$ , given by $g_{r} (b_{1}) = h_{r} (b_{2}) = r$ , which is also clearly a morphism.

We show that these are in fact all of the morphisms in $H$ . Indeed, let $(g, h)$ be a morphism in $H$ , let $b_{1}$ be an element of $B_{1}$ , and let $b_{2}$ be an element of $b_{2}$ . Let

\begin{matrix} r & = b_{2} ∙_{2} g (b_{1}) = b_{1} ∙_{1} h (b_{2}) . \end{matrix}

If $r \in R_{3}$ , then $g (b_{1}) = h (b_{2}) = r$ , so given any $b_{1}^{'} \in B_{1}$ ,

\begin{matrix} b_{2} ∙_{2} g (b_{1}^{'}) & = b_{1}^{'} ∙_{1} h (b_{2}) = r \end{matrix}

$\in R_{3}$ , and so

\begin{matrix} g (b_{1}^{'}) & = b_{2} ∙_{2} g (b_{1}^{'}) = r . \end{matrix}

Similarly, for any $b_{2}^{'} \in B_{2}$ , $h (b_{2}^{'}) = r$ and so $(g, h) = (g_{r}, h_{r})$ .

If $r \in R_{1}$ , then $g (b_{1}) = r$ , and $h (b_{2}) \in F_{1}$ . Let $f_{1} = h (b_{2})$ . Given any $b_{1}^{'} \in B_{1}$ ,

\begin{matrix} b_{2} ∙_{2} g (b_{1}^{'}) & = b_{1}^{'} ∙_{1} h (b_{2}) = b_{1}^{'} ∙_{1} f_{1} \end{matrix}

$\in R_{1}$ , so

\begin{matrix} g (b_{1}^{'}) & = b_{2} ∙_{2} g (b_{1}^{'}) = b_{1}^{'} ∙_{1} f_{1} = b_{1}^{'} \cdot_{1} f_{1} . \end{matrix}

Given any $b_{2}^{'} \in B_{2}$ ,

\begin{matrix} b_{1} ∙_{1} h (b_{2}^{'}) & = b_{2}^{'} ∙_{2} g (b_{1}) = b_{2}^{'} ∙_{2} r = r \end{matrix}

$\in R_{1}$ , and so

\begin{matrix} h (b_{2}^{'}) & = b_{1} ∙_{1} h (b_{2}^{'}) = r . \end{matrix}

Thus, $(g, h) = (g_{f_{1}}, h_{f_{1}})$ .

Finally, if $r \in R_{2}$ , we similarly have $(g, h) = (g_{f_{2}}, h_{f_{2}})$ , where $f_{2} = g (b_{1}) \in F_{2}$ .

We construct a pair of morphisms

\begin{matrix} (g_{0}, h_{0}) : (B_{1} \times B_{2}, H, ∙_{4}) \to (B_{1} \times B_{2}, F_{1} ⊔ F_{2} ⊔ R_{3}, ∙_{3}) \end{matrix}

and

\begin{matrix} (g_{1}, h_{1}) : (B_{1} \times B_{2}, F_{1} ⊔ F_{2} ⊔ R_{3}, ∙_{3}) \to (B_{1} \times B_{2}, H, ∙_{4}), \end{matrix}

by letting $g_{0}$ and $g_{1}$ be the identity on $B_{1} \times B_{2}$ , letting $h_{0} : F_{1} ⊔ F_{2} ⊔ R_{3} \to H$ be given by $h_{0} (f) = (g_{f}, h_{f})$ as above. Since we have shown that $h_{0}$ is surjective, we let $h_{1}$ be any right inverse to $h_{0}$ . It is easy to show that both of these are morphisms by the construction of $(g_{f}, h_{f})$ , and they compose to something homotopic to the identity in both orders since $g_{0} \circ g_{1}$ and $g_{1} \circ g_{0}$ are the identity of $B_{1} \times B_{2}$ .

Thus $(D_{1} & 1_{R_{2} \cup R_{3}}) \otimes (D_{2} & 1_{R_{1} \cup R_{3}}) ≃ D_{1} & D_{2} & 1_{R_{3}}$ , so $C$ 's agent can observe $V$ according to the constructive multiplicative definition, completing the proof. $□$

You may have noticed that the last part of the proof would have been much simpler if $\otimes$ distributed over $&$ , but $\otimes$ does not in general distribute over $&$ . ( $\otimes$ distributes over $\oplus$ and $⅋$ distributes over $&$ .)

In this case, however, $\otimes$ does distribute over $&$ . I do not plan on going over it now, but there is actually an interesting relationship between observables and cases where $\otimes$ distributes over $&$ .

4.3. Example

Let $C_{0}$ be defined as in the previous examples, with $R = {u r, n r}$ and $S = {u s, n s}$ . Let $T_{X} = (W ∖ X) \cap Image (C_{0})$ , so that $1_{T_{R}} = 1_{S}$ and $1_{T_{S}} = 1_{R}$ . By the multiplicative definitions of observables, there then exist two frames

$C_{1} = {Assume}_{R} (C) & 1_{S} = \begin{matrix} \begin{matrix} r & u s & n s \end{matrix} \begin{matrix} r \to u r \to n \end{matrix} & (\begin{matrix} u r & u s & n s n r & u s & n s \end{matrix}) \end{matrix}$

and

$C_{2} = {Assume}_{S} (C) & 1_{R} = \begin{matrix} \begin{matrix} s & u r & n r \end{matrix} \begin{matrix} s \to u s \to n \end{matrix} & (\begin{matrix} u s & u r & n r n s & u r & n r \end{matrix}) \end{matrix}$

such that $C_{0} ≃ C_{1} \otimes C_{2}$ .

Here, $C_{1}$ is an agent that treats the "makes decisions when it's sunny" part of itself as though it were an external process. Similarly, $C_{2}$ externalizes its ability to make decisions when it's rainy.

This example illustrates both multiplicative definitions, and also shows the construction used in the constructive multiplicative definition.

Appealing again to the fact that ${Assume}_{S} (C) ≅ 1_{S} \otimes C$ , we also have the option of restating $C_{0} ≃ C_{1} \otimes C_{2}$ here as $C_{0} ≃ ((1_{R} \otimes C_{0}) & 1_{S}) \otimes ((1_{S} \otimes C_{0}) & 1_{R})$ . In words, this says that $Agent (C_{0})$ is (biextensionally equivalent to) a team consisting of:

that very agent, picking an action after the environment either (a) gives it a promise it will rain or (b) makes it powerless and doesn't rain; and
that very agent, picking an action after the environment either (a) gives it a promise it won't rain or (b) makes it powerless and rains.

4.4. Updatelessness

The relationship between observables' additive and multiplicative definitions is interesting. You can think of the additive definition as updateful, while the multiplicative definition is updateless.

The $C_{i}$ in the additive definition are basically given a promise that the world will end up in $S_{i}$ . The $C_{i}$ in the multiplicative definition, however, are instead given a promise that their choices have no effect on worlds outside of $S_{i}$ .

I think the updateless factorization is better, and thus prefer the multiplicative definition in spite of the fact that it is more complicated.

When an updateless agent observes something, it becomes the version of itself that only affects the worlds in which it makes that observation. When an updateful agent observes something, we assume that all the worlds in which it does not make that observation do not exist. The fact that the additive and multiplicative definitions above are equivalent illustrates the equivalence of the updateful and updateless views in the simple cases where there is true observation. However, they diverge as soon as you want to try to approximate observation. The updateless view approximates better, as it makes sense to think of a subagent that has only a very small effect on worlds in which it does not make the observation that it makes.

Also, note that the $C_{i}$ in the additive definition are not subagents of $C$ , but they are additive sub-environments. The $C_{i}$ in the multiplicative definition are multiplicative subagents of $C$ .

5. Internalizing-Externalizing Definitions

Next, we have the nonconstructive internalizing-externalizing definition of observables.

Definition: We say that $C$ 's agent can observe a finite partition $V$ of $W$ if either $A = {}$ or $C$ is biextensionally equivalent to something in the image of ${External}_{V} \circ {Internal}_{V}$ .

Again, we have a constructive version of this definition.

Definition: We say that $C$ 's agent can observe a finite partition $V$ of $W$ if either $A = {}$ or $C ≃ {External}_{V} ({Internal}_{V} (C))$ .

Claim: These definitions are equivalent to each other and to the definitions above.

Proof: The case where $A = {}$ is trivial, so we assume that $A$ is nonempty. Clearly if $C$ 's agent can observe $V$ under the constructive internalizing-externalizing definition, then $V$ is also observable in $C$ under the non-constructive version.

Next, assume that $C$ is in the image of ${External}_{V} \circ {Internal}_{V}$ (up to biextensional equivalence). Recall that the image of ${Internal}_{V}$ up to biextensional equivalence is exactly those Cartesian frames $(B, F, ⋆)$ such then $F$ is nonempty and for all $f_{0}, f_{1} \in F$ and $b \in B,$ we have $v (b ⋆ f_{0}) = v (b ⋆ f_{1})$ . Thus, $C ≃ {External}_{V} (B, F, ⋆)$ , where $(B, F, ⋆)$ is of this form. Let $v_{B} : B \to V$ send each element $b \in B$ to the unique $v_{b} \in V$ such that $v (b ⋆ f) = v_{b}$ for all $f \in F$ , and let $V_{B}$ be the image of $v_{B}$ . Then, ${External}_{V} (B, F, ⋆) = (B / X, X \times F, ⋄)$ , where $X = {{b \in B | v_{B} (b) = v^{'}} | v^{'} \in V_{B}}$ , and $q ⋄ (x, f) = q (x) ⋆ f$ .

Let $V_{B} = {v_{1}, \dots, v_{m}}$ , and let $B_{i} = {b \in B | v_{B} (b) = v_{i}}$ . Then, we clearly have that ${External}_{V} (B, F, ⋆) ≅ (B_{1} \times \dots \times B_{m}, V_{B} \times F, ∙)$ , where $(b_{1}, \dots, b_{m}) ∙ (v_{i}, f) = b_{i} ⋆ f$ . But this is clearly isomorphic to $D_{1} & \dots & D_{m}$ , where $D_{i} = (B_{i}, F, ⋆_{i})$ , where $b ⋆_{i} f = b ⋆ f$ . Thus, $C$ 's agent can observe $V$ according to the nonconstructive additive definition of observables.

Finally, we assume that $C$ 's agent can observe $V$ according to the nonconstructive additive definition of observables, and we show that $C$ 's agent can observe $V$ according to the constructive internalizing-externalizing definition. Let $C ≃ C_{1} & \dots & C_{n}$ , where $C_{i} ◃ ⊥_{S_{i}}$ . Let $C_{i} = (A_{i}, E_{i}, \cdot_{i})$ , and without loss of generality, let $C = C_{1} & \dots & C_{n} = (A, E, \cdot)$ , where $A = A_{1} \times \dots \times A_{n}$ and $E = E_{1} ⊔ \dots ⊔ E_{n}$ .

First, we show that ${Internal}_{V} (C) ≃ C_{1} \oplus \dots \oplus C_{n}$ . Let $C_{1} \oplus \dots \oplus C_{n} = (A_{1} ⊔ \dots ⊔ A_{n}, E_{1} \times \dots \times E_{n}, ⋆)$ . Observe that (since $A$ is nonempty), ${Internal}_{V} (C) ≅ (A \times F, B / F, ⋆^{'})$ , where $F = {E_{1}, \dots, E_{n}}$ , where $(a, f) ⋆ q = a \cdot q (f)$ .

We construct

\begin{matrix} (g_{0}, h_{0}) : (A_{1} ⊔ \dots ⊔ A_{n}, E_{1} \times \dots \times E_{n}, ⋆) \to (A \times F, B / F, ⋆^{'}) \end{matrix}

and

\begin{matrix} (g_{1}, h_{1}) : (A \times F, B / F, ⋆^{'}) \to (A_{1} ⊔ \dots ⊔ A_{n}, E_{1} \times \dots \times E_{n}, ⋆) \end{matrix}

as follows. Let $g_{1} ((a_{1}, \dots, a_{n}), E_{i}) = a_{i}$ . Let $g_{0} (a_{i}) = ((a_{1}, \dots, a_{i}, \dots, a_{n}), E_{i})$ , where $a_{i} \in A_{i}$ , and $a_{j} \in A_{j}$ is chosen arbitrarily for $j \neq i$ . Let $h_{0} (q) = (q (E_{1}), \dots, q (E_{n}))$ , and $h_{1} (e_{1}, \dots, e_{n}) = q$ , where $q (E_{i}) = e_{i}$ . Clearly, $h_{0}$ and $h_{1}$ are inverses.

To see that $(g_{0}, h_{0})$ is a morphism, observe that for all $a_{i} \in A_{1} ⊔ \dots ⊔ A_{n}$ and $q \in B / F$ , we have

\begin{matrix} g_{0} (a_{i}) ⋆^{'} q & = ((a_{1}, \dots, a_{i}, \dots, a_{n}), E_{i}) ⋆^{'} q = (a_{1}, \dots, a_{i}, \dots, a_{n}) \cdot q (E_{i}) = a_{i} \cdot_{i} q (E_{i}) = a_{i} ⋆ (q (E_{1}), \dots, q (E_{n})) = a_{i} ⋆ h_{0} (q), \end{matrix}

where $a_{i} \in A_{i}$ .

To see that $(g_{1}, h_{1})$ is a morphism, observe that for all $((a_{1}, \dots, a_{n}), E_{i}) \in A \times F$ , and for all $(e_{1}, \dots e_{n}) \in E_{1} \times \dots \times E_{n}$ , we have

\begin{matrix} g_{1} ((a_{1}, \dots, a_{n}), E_{i}) ⋆ (e_{1}, \dots, e_{n}) & = a_{i} ⋆ (e_{1} \dots, e_{n}) = a_{i} \cdot_{i} e_{i} = (a_{1}, \dots, a_{n}) \cdot e_{i} = (a_{1}, \dots, a_{n}) \cdot h_{1} (e_{1}, \dots, e_{n}) (E_{i}) = ((a_{1}, \dots, a_{n}), E_{i}) ⋆ h_{1} (e_{1}, \dots, e_{n}) . \end{matrix}

It is clear that $(g_{0}, h_{0}) \circ (g_{1}, h_{1})$ and $(g_{1}, h_{1}) \circ (g_{0}, h_{0})$ are both homotopic to the identity, since $h_{0} \circ h_{1}$ and $h_{1} \circ h_{0}$ are both the identity.

Now, we have that ${Internal}_{V} (C_{1} & \dots & C_{n}) ≃ C_{1} \oplus \dots \oplus C_{n}$ , and so we also have dually that ${External}_{V} (C_{1} \oplus \dots \oplus C_{n}) ≃ C_{1} & \dots & C_{n}$ . Thus, $C ≃ {External}_{V} ({Internal}_{V} (C))$ . $□$

The thing that is going on here is that when $C$ internalizes $V$ , the agent of $C$ then has the full ability to choose how $V$ goes (among ways of $V$ going that were possible in $C$ ). ${Internal}_{V} (C)$ might have other choices than just choosing how $V$ goes. If it does, then it can freely entangle those other choices with the choice of $V$ however it wants.

When $C$ then externalizes $V$ , it loses all control over $V$ . However, it preserves the ability to entangle all of its other choices with the way that $V$ goes. This ability for the agent to entangle its choices with $V$ is exactly what it means to say " $V$ is observable."

5.1. Example

Let $C_{0}$ be defined as in the previous examples, with $V = {{u r, n r}, {u s, n s}}$ . By the internalizing-externalizing definitions, there exists a frame

${Internal}_{V} (C_{0}) ≅ \begin{matrix} \begin{matrix} \end{matrix} \begin{matrix} (u, r) (u, s) (n, r) (n, s) (u \leftrightarrow r, r) (u \leftrightarrow r, s) (u \leftrightarrow s, r) (u \leftrightarrow s, s) \end{matrix} & ⎛ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ \begin{matrix} u r u s n r n s u r n s n r u s \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎠ \end{matrix}$ ,

which is biextensionally equivalent to

$C_{1} = \begin{matrix} \begin{matrix} \end{matrix} \begin{matrix} u r n r u s n s \end{matrix} & ⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} u r n r u s n s \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ \end{matrix}$ .

We then have that

${External}_{V} (C_{1}) ≅ \begin{matrix} \begin{matrix} r & s \end{matrix} \begin{matrix} (r \to u r, s \to u s) (r \to n r, s \to n s) (r \to u r, s \to n s) (r \to n r, s \to u r) \end{matrix} & ⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} u r & u s n r & n s u r & n s n r & u s \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ \end{matrix}$ ,

which is isomorphic to $C_{0}$ .

This example illustrates both internalizing-externalizing definitions, and also shows the construction used in the constructive definition.

In our next post, we'll conclude the sequence by showing how to formalize agents that learn and act over time using Cartesian frames.

26 comments

Comments sorted by top scores.

comment by Ramana Kumar (ramana-kumar) · 2021-02-05T09:55:33.395Z · LW(p) · GW(p)

How is this supposed to work (focusing on the claim specifically)?

and so

\begin{matrix} h (b_{2}^{'}) & = b_{1} ∙_{1} h (b_{2}^{'}) = r . \end{matrix}

Thus, $(g, h) = (g_{f_{1}}, h_{f_{1}})$ .

Earlier, $h_{f_{1}}$ was defined as follows:

given by $g_{f_{1}} (b_{1}) = b_{1} \cdot_{1} f_{1}$ and $h_{f_{1}} (b_{2}) = f_{1}$

but there is no reason to suppose $f_{1} = r$ above.

Replies from: rohinmshah

↑ comment by Rohin Shah (rohinmshah) · 2021-02-10T17:04:24.876Z · LW(p) · GW(p)

The problem is a bit earlier actually:

This isn't true, because $∙_{1}$ doesn't just ignore $b_{1}$ here (since $r \in R_{1}$ ). I think the route is to say "Let $h (b_{2}^{'}) = f_{2}$ . Then $∙_{1}$ must treat $f_{1}$ and $f_{2}$ identically, meaning that either they are equal, or the frame is biextensionally equivalent to one where they are equal."

Replies from: ramana-kumar

↑ comment by Ramana Kumar (ramana-kumar) · 2021-02-11T08:25:29.721Z · LW(p) · GW(p)

Using the idea we talked about offline, I was able to fix the proof - thanks Rohin!
Summary of the fix:
When and $D_{2}$ are defined, additionally assume they are biextensional (take their biextensional collapse), which is fine since we are trying to prove a biextensional equivalence. (By the way this is why we can't take $b_{1} = b_{2}$ , since we might have $A \supseteq B_{1} \neq B_{2} \subseteq A$ after biextensional collapse.) Then to prove $h = h_{f_{1}}$ , observe that for all $b \in B_{1}$ , $b ∙_{1} h (b_{2}^{'}) = b ∙_{1} h (b_{2})$ which means $b ⋆_{1} h (b_{2}^{'}) = b ⋆_{1} f_{1}$ , hence $h (b_{2}^{'}) = f_{1}$ since a biextensional frame has no duplicate columns.

comment by Ramana Kumar (ramana-kumar) · 2021-01-29T19:23:42.718Z · LW(p) · GW(p)

this is clearly isomorphic to , where $D_{i} = (B_{i}, F, ⋆_{i})$ , where $b ⋆_{i} f = b ⋆ f$ . Thus, $C$ 's agent can observe $V$ according to the nonconstructive additive definition of observables.

I think this is only true if $V_{B}$ partitions $W$ , or, equivalently, if $v_{B}$ is surjective. This isn't shown in the proof. Is it supposed to be obvious?

EDIT: may be able to fix this by assigning any $s \in V$ that is not in $V_{B}$ to the frame $⊤$ so it is harmless in the product of $D_{i}$ s -- I will try this.

comment by Ramana Kumar (ramana-kumar) · 2021-01-26T18:53:34.223Z · LW(p) · GW(p)

and observe that

This cannot be true. I can prove in general ${A s s u m e}_{S_{1} \cup S_{2}} (C) ≆ {A s s u m e}_{S_{1}} (C) & {A s s u m e}_{S_{2}} (C)$ whenever $| A g e n t (C) | > 1$ by observing that the agent cardinalities on each side differ.

Replies from: Scott Garrabrant

↑ comment by Scott Garrabrant · 2021-01-27T08:09:31.556Z · LW(p) · GW(p)

Yep, changed it to .

Replies from: ramana-kumar

↑ comment by Ramana Kumar (ramana-kumar) · 2021-01-28T12:40:18.696Z · LW(p) · GW(p)

I haven't yet figured out why it's true under - I'll keep trying, but let me know if there's a quick argument for why this holds. (Default next step for me would be to see if I can restrict attention to the world $S_{1} \cup S_{2}$ then do something similar to my other comment [LW(p) · GW(p)].)

Replies from: Scott Garrabrant

↑ comment by Scott Garrabrant · 2021-01-28T23:24:01.157Z · LW(p) · GW(p)

I am confused, why is it not identical to your other comment?

Replies from: ramana-kumar

↑ comment by Ramana Kumar (ramana-kumar) · 2021-01-29T08:26:06.384Z · LW(p) · GW(p)

Because and $S_{2}$ are not a partition of the world here.

EDIT: but what we actually need in the proof is ${A s s u m e}_{S_{1}} (C) & {A s s u m e}_{S_{2}} (C) & \dots ≃ {A s s u m e}_{S_{1} \cup S_{2}} (C) & \dots$ where the $\dots$ do result in a partition, so I think this will work out the same as the other comment. I'm still not convinced about biextensional equivalence between the frames without the rest of the product.

Replies from: ramana-kumar

↑ comment by Ramana Kumar (ramana-kumar) · 2021-01-29T10:52:33.342Z · LW(p) · GW(p)

And it seems we do actually need in the proof to justify:

Thus it suffices to show that $(D_{1} & 1_{R_{2} \cup R_{3}}) \otimes (D_{2} & 1_{R_{1} \cup R_{3}}) ≃ D_{1} & D_{2} & 1_{R_{3}}$ .

Without it, we have to show $(D_{1} & 1_{R_{2} \cup R_{3}}) \otimes (D_{2} & 1_{R_{1} \cup R_{3}}) ≃ {A s s u m e}_{S_{1} \cup S_{2}} (C) & 1_{R_{3}}$ instead.

Replies from: ramana-kumar

↑ comment by Ramana Kumar (ramana-kumar) · 2021-01-30T00:12:52.641Z · LW(p) · GW(p)

UPDATE: I was able to prove in general whenever $S_{1}$ and $S_{2}$ are disjoint and both in $O b s (C)$ , with help from Rohin Shah, following the "restrict attention to world $S_{1} \cup S_{2}$ " approach I hinted at earlier.

comment by Ramana Kumar (ramana-kumar) · 2021-01-26T18:22:36.405Z · LW(p) · GW(p)

where (C), $D_{1} = {Assume}_{S_{1}} (C)$

Presumably two of those indices should be $2$

Replies from: Scott Garrabrant

↑ comment by Scott Garrabrant · 2021-01-27T08:10:59.596Z · LW(p) · GW(p)

Fixed, thanks.

comment by Ramana Kumar (ramana-kumar) · 2021-01-26T14:19:37.813Z · LW(p) · GW(p)

Let . Thus, we also have that $C ≃ {Assume}_{S_{1} \cup S_{2}} (C) & {Assume}_{S_{3}} (C) & \dots & {Assume}_{S_{n}} (C)$

I'm not seeing why this follows. I'll look for a counterexample, but in the meantime maybe there's a simple explanation for why we can combine the product of two assumes as an assume of the union? (I think the only relevant assumption in this context is that the $S_{i}$ s partition the world; but I might be missing some other important assumption.)

EDIT: I can see how maybe this will follow from the definition of observability of a partition from subsets (which we are also assuming) and the fact that $O b s$ is closed under union... will try to figure that out. -- Yep I think this works out. Sorry for the confusion.

comment by Ramana Kumar (ramana-kumar) · 2021-01-26T10:44:59.899Z · LW(p) · GW(p)

Let , and let $D_{1} = (B_{1}, F_{1}, ⋆_{1}) = C_{1} \otimes \dots \otimes C_{n}$

Is $D_{1}$ supposed to be $C_{2} \otimes \dots$ here, rather than including $C_{1}$ ?

Replies from: Scott Garrabrant

↑ comment by Scott Garrabrant · 2021-01-27T08:13:27.442Z · LW(p) · GW(p)

Fixed, thanks.

comment by Ramana Kumar (ramana-kumar) · 2021-01-24T15:50:51.452Z · LW(p) · GW(p)

Next, assume that 's agent can observe $V$ according to the additive definition. We will show that $C$ 's agent can observe $S_{1}$ .

I might be misunderstanding this, but the proof suggests you're actually assuming the assuming definition here, not the additive definition. In which case we may be missing the proof of implication of any of the other definitions from the additive definition.

Replies from: Scott Garrabrant

↑ comment by Scott Garrabrant · 2021-01-27T08:46:37.001Z · LW(p) · GW(p)

I think I fixed it. Thanks.

comment by Ramana Kumar (ramana-kumar) · 2021-01-24T08:11:29.386Z · LW(p) · GW(p)

Definition: We say that 's agent can observe a finite partition $V$ of $W$ if for all functions $f : V \to A$ , there exists an element $a_{f} \in A$ such that for all $e \in E$ , $f (v (a_{f} \cdot e)) \cdot e = a_{f} \cdot e$ .
Claim: This definition is equivalent to the definition from subsets.

This doesn't hold in the degenerate case $W = \emptyset$ , since then we have an empty function $f$ but no elements of $A$ . (But the definition from subsets holds trivially.)

Replies from: Scott Garrabrant

↑ comment by Scott Garrabrant · 2021-01-27T08:27:45.524Z · LW(p) · GW(p)

This was annoying to fix, so I just made nonempty in the intro to the post.

comment by adamShimi · 2020-11-11T19:51:48.497Z · LW(p) · GW(p)

This is the first post in the sequence that I fully read since the Introduction. So I'm not going to be able to say anything really useful about the proofs. Still, I was curious about the philosophical aspects of these definitions, so I read this post anyway.

That being said, I still think that I understood some part of the definitions, after checking terms from previous posts. My handwavy understanding of your definitions is

The definitions about subset and conditional policy just rephrase that an observable is something on which the agent can condition it's own policy. So the agent can observe the partition if it can condition policy on the set of the partition in which it finds itself.
The additive definitions say that that the agent can observe the partition if the cartesian frame can be decomposed into mutually exclusive cartesian frames, one for each set of the partition, in which the agent acts as if it is in a world of this set.
The multiplicative definitions say that the agent can observe the partition if the cartesian frame can be decomposed into a product of cartesian frames, one for each subset of the partition, such that the agent is unable to impact the world if it is outside its subset of the partition. The interpretation of the product is that there's a supervisor agent that control all agents at the same time, and so here, it controls all of them until the observation, after which he morally only controls the one in the observed set (because the other are powerless).
The internalizing-externalizing definitions say the agent can observe the partition if the cartesian frame can be decomposed into the composition of making the agent able to choose in which set of V it is, but then removing this choice from it, which amount to letting it condition on V, without actually giving it the power to do so.

Is there something really wrong here?

Also, I'm curious if you have an interpretation of the differences between internalizing-externalizing definitions and the others, just like your section on updatelessness compared additive and multiplicative definitions. (Really cool section philosophically, by the way!)

Replies from: Scott Garrabrant

↑ comment by Scott Garrabrant · 2020-11-11T20:29:33.970Z · LW(p) · GW(p)

Seems right, except I don't use the word "product" for the multiplicative definition.

I don't have much to say about the internalizing-externalizing definition philosophically. One thing to say is that I think the condition that observes $S$ is a weaker notion of observability, that might actually agree with philosophical intuition more, and the internalizing-externalizing definition might be easier to interpret if you are thinking in terms of this condition.

comment by Ramana Kumar (ramana-kumar) · 2021-01-30T09:21:39.636Z · LW(p) · GW(p)

Let

nit: $B_{1}$ should be $D_{1}$ here

and let $b_{2}$ be an element of $b_{2}$ .

and the second $b_{2}$ should be $B_{2}$ . I think for these $b_{1}$ and $b_{2}$ to exist you might need to deal with the $A = \emptyset$ case separately (as in Section 5). (Also couldn't you just use the same $b$ twice?)

Replies from: ramana-kumar

↑ comment by Ramana Kumar (ramana-kumar) · 2021-01-30T13:50:57.192Z · LW(p) · GW(p)

Indeed I think the case may be the basis of a counterexample to the claim in 4.2. I can prove for any (finite) $W$ with $| W | > 1$ that there is a finite partition $V$ of $W$ such that $C$ 's agent observes $V$ according to the assuming definition but does not observe $V$ according to the constructive multiplicative definition, if I take $C = n u l l .$

Replies from: ramana-kumar

↑ comment by Ramana Kumar (ramana-kumar) · 2021-02-05T10:59:31.881Z · LW(p) · GW(p)

I presume the fix here will be to add an explicit escape clause to the multiplicative definitions. I haven't been able to confirm this works out yet (trying to work around this [LW(p) · GW(p)]), but it at least removes the $n u l l$ counterexample.

Replies from: ramana-kumar

↑ comment by Ramana Kumar (ramana-kumar) · 2021-02-11T08:29:17.053Z · LW(p) · GW(p)

With the other problem resolved, I can confirm that adding an escape clause to the multiplicative definitions works out.

Eight Definitions of Observability

Contents

1. Definition from Subsets

2. Conditional Policies Definition

3. Additive Definitions

4. Multiplicative Definitions

5. Internalizing-Externalizing Definitions

26 comments