Additive Operations on Cartesian Frames

scott-garrabrant

Additive Operations on Cartesian Frames

post by Scott Garrabrant · 2020-10-26T15:12:14.556Z · LW · GW · 6 comments

  1. What Do These Morphisms Represent?
    1.1. Morphisms as Interfaces
    1.2. Morphisms as Differences in Agents' Strength
    1.3. Simple Examples of Morphisms
    1.4. Examples of Morphisms Going Both Ways
  2. Self-Duality
  3. Sums of Cartesian Frames
  4. Products of Cartesian Frames
  Footnotes
None
6 comments

The mathematical object (but not the philosophical interpretation) of a Cartesian Frame [LW · GW] is studied under the name "Chu space."

(In category theory, Chu spaces are usually studied in the special case of . To learn more about Chu spaces, see Vaughan Pratt's guide to papers and nLab's page on the Chu construction.)

In this post and the next one, I'll mostly be discussing standard facts about Chu spaces. I'll also discuss how to interpret the standard definitions as statements about agency.

Chu spaces form a category as a special case of the Chu construction. You may notice a strong similarity between operations on Cartesian frames and operations in linear logic, coming from the fact that the Chu construction is also intimately related to linear logic, and is used in the semantics for linear logic.

Linear logic has a large number of operations—additive conjunction ( $&$ ), multiplicative conjunction ( $\otimes$ ), and so on—and many of those symbols will turn out to have interpretations for Cartesian frames, and they're actually going to be meaningful interpretations in this setting. For that reason, we'll be stealing much of our notation from linear logic, though this sequence won't assume familiarity with linear logic.

Definition: $Chu (W)$ is the category whose objects are Cartesian frames over $W$ , whose morphisms from $C = (A, E, \cdot)$ to $D = (B, F, ⋆)$ are pairs of functions $(g : A \to B, h : F \to E)$ , such that $a \cdot h (f) = g (a) ⋆ f$ for all $a \in A$ and $f \in F$ , and whose composition of morphisms is given by $(g_{1}, h_{1}) \circ (g_{0}, h_{0}) = (g_{1} \circ g_{0}, h_{0} \circ h_{1})$ .

The composition of two morphisms $C_{0} \to C_{1}$ and $C_{1} \to C_{2}$ , then, sends the agent of $C_{0}$ to $C_{2}$ and sends the environment of $C_{2}$ to $C_{0}$ .

Claim: $Chu (W)$ is a category.

Proof: It suffices to show that composition is well-defined and associative and there exist identity morphisms. For identity, $({id}_{A}, {id}_{E})$ is clearly an identity on $C = (A, E, \cdot)$ , where ${id}_{X}$ is the identity map from $X$ to itself.

The composition $(g_{1}, h_{1}) \circ (g_{0}, h_{0})$ of $(g_{0}, h_{0}) : C_{0} \to C_{1}$ with $(g_{1}, h_{1}) : C_{1} \to C_{2}$ is $(g_{1} \circ g_{0}, h_{0} \circ h_{1}) : C_{0} \to C_{2}$ . To verify that this is a morphism, we just need that $a_{0} \cdot_{0} h_{0} (h_{1} (e_{2})) = g_{1} (g_{0} (a_{0})) \cdot_{2} e_{2}$ for all $a_{0} \in Agent (C_{0})$ , and $e_{2} \in Env (C_{2})$ , where $\cdot_{i} = Eval (C_{i})$ . Indeed,

\begin{matrix} a_{0} \cdot_{0} h_{0} (h_{1} (e_{2})) & = g_{0} (a_{0}) \cdot_{1} h_{1} (e_{2}) = g_{1} (g_{0} (a_{0})) \cdot_{2} e_{2}, \end{matrix}

since each component is a morphism.

Associativity of the composition follows from the fact that it is just a pair of compositions of functions on sets, and composition is associative for sets. $□$

1. What Do These Morphisms Represent?

1.1. Morphisms as Interfaces

A Cartesian frame is a first-person perspective. The agent $A$ finds itself in a certain situation or game, where it expects to encounter an environment $E$ . The morphisms in $Chu (W)$ allow the agent of one Cartesian frame to play the game of another Cartesian frame.

We can think of the morphisms from $C = (A, E, \cdot)$ to $D = (B, F, ⋆)$ as ways of fitting the agent of $C$ into the environment of $D$ . Indeed, for every morphism $(g, h) : C \to D$ , one can construct a Cartesian frame $(A, F, ⋄)$ , whose agent matches $C$ 's agent, and whose environment matches $D$ 's environment, with $⋄$ given by $a ⋄ f = a \cdot h (f) = g (a) ⋆ f$ . (The morphism from $C$ to $D$ can actually be viewed as the composition of $({id}_{A}, h) : C \to (A, F, ⋄)$ with $(g, {id}_{F}) : (A, F, ⋄) \to D$ .)

Two random large Cartesian frames will typically have no morphisms between them. When there is a morphism, the morphism functions as an interface that allows the agent $A$ to interact with some other environment $F$ . However, we aren't just randomly throwing $A$ and $F$ together. $A$ 's interaction with $F$ factors through the function $h : F \to E$ , so $A$ can in a sense still be thought of as using an interface where it interacts with $E$ . It just interacts with an $e \in E$ that is of the form $h (f)$ for some $f \in F$ . But this is happening simultaneously with the dual view in which $F$ can be thought of as still interacting with $B$ !

Since a Cartesian frame is a first-person perspective, you can imagine $A$ having the internal experience of interacting with $E$ , while $F$ has the "experience" of interacting with $B$ . The morphism's job is to be the translation interface that allows this $A$ and $F$ to interact with each other, while preserving their respective internal experiences in such a way that they feel like they're interacting with $E$ and $B$ respectively. $A$ gets to play $B$ 's game, while still thinking that it is playing its own game.

1.2. Morphisms as Differences in Agents' Strength

We can also interpret the existence of a morphism from $C = (A, E, \cdot)$ to $D = (B, F, ⋆)$ as saying something like " $D$ 's agent is at least as strong as $C$ 's agent."

This is easiest to see for a morphism $(g, h) : C \to D$ where $g$ and $h$ are both injective. In this case, it is as though $A \subseteq B$ and $F \subseteq E$ , so $D$ 's agent has more options to choose between and fewer environments it has to worry about.

Since some of the environments in $E ∖ F$ might have been good for the agent, the agent isn't necessarily strictly better off in $D$ ; but in a zero-sum game, the agent will indeed be strictly better off. I think this justifies saying that $C$ 's agent is in some sense weaker than $D's$ agent.

If $g$ or $h$ is not injective, we could duplicate elements of $B$ and $E$ to make it injective, so the interpretation " $C$ 's agent is no stronger than $D$ 's agent" is reasonable in that case as well. In particular, the existence of a morphism from $C$ to $D$ implies that $Ensure (C) \subseteq Ensure (D)$ (and thus $Ctrl (C) \subseteq Ctrl (D)$ ).

However, the existence of a morphism is stronger than just saying the set of ensurables is larger. The morphism from $C$ to $D$ can be thought of as telling $D$ 's agent how to strategy-steal from $C$ 's agent, and thus do anything that $C$ 's agent can do.

We now provide a few examples to illustrate morphisms between Cartesian frames. (If you're ready to forge ahead, skip to §2 [LW · GW] instead.)

1.3. Simple Examples of Morphisms

Imagine a student who is deciding between staying up late studying for a test ( $a_{s}$ ) or ignoring the test ( $a_{i}$ ). We will represent the student with a Cartesian frame over letter grades, where $W = {A+, A, A-, B+, B, B-, C+, C, C-, D+, D, D-, F}$ .

If the student doesn't study, her final grade is always a C+, represented by the possible world $C+$ . If she does study, she may oversleep and get a bad grade (represented by the environment selecting $e_{o}$ and putting her in $D-$ ). If she studies and doesn't oversleep, she is uncertain about whether her teacher is typical ( $e_{t}$ , resulting in $A-$ ) or unusually demanding ( $e_{d}$ , resulting in $B+$ ). We represent this with the frame

$C_{T} = \begin{matrix} \begin{matrix} e_{t} & e_{d} & e_{o} \end{matrix} \begin{matrix} a_{s} a_{i} \end{matrix} & (\begin{matrix} A- & B+ & D- C+ & C+ & C+ \end{matrix}) \end{matrix}$ .

Let us also suppose that yesterday, the student had the extra option of copying another student's answers on test day to get a sure A+. However, she decided not to cheat. We represent the student's options yesterday, prior to precommitting, with the frame

$C_{Y} = \begin{matrix} \begin{matrix} f_{t} & f_{d} & f_{o} \end{matrix} \begin{matrix} b_{s} b_{i} b_{c} \end{matrix} & ⎛ ⎜ ⎝ \begin{matrix} A- & B+ & D- C+ & C+ & C+ A+ & A+ & A+ \end{matrix} ⎞ ⎟ ⎠ \end{matrix}$ .

There is a morphism from the student's frame today to her frame yesterday, representing the fact that $Agent (C_{T})$ can be plugged into $Agent (C_{Y})$ 's game, or that the student was "stronger" yesterday than she is today.

Let us also suppose that the student's teacher is in fact demanding. If the student today knew this fact, we would instead represent her perspective with the frame

$C_{T^{'}} = \begin{matrix} \begin{matrix} e_{d}^{'} & e_{o}^{'} \end{matrix} \begin{matrix} a_{s}^{'} a_{i}^{'} \end{matrix} & (\begin{matrix} B+ & D- C+ & C+ \end{matrix}) \end{matrix}$ .

Here, we have a morphism from the student today ( $C_{T}$ ) to her perspective if she had an additional promise from the environment ( $C_{T^{'}}$ ). This represents the fact that $C_{T^{'}}$ can strategy-steal from a version of herself who knows strictly less.

Given two Cartesian frames $C_{0}$ and $C_{1}$ , I am not aware of an efficient universal method for determining whether there exists a morphism from $C_{0}$ to $C_{1}$ . Indeed, I conjecture that this problem might be NP-complete. In the above cases, however, we can see that there exist morphisms from $C_{T}$ to the other two frames by observing that $C_{T}$ is effectively $C_{Y}$ with a row deleted, or $C_{T^{'}}$ with a column added.

While $Agent (C_{Y})$ and $Agent (C_{T^{'}})$ are both stronger than $Agent (C_{T})$ , we have no morphisms between $C_{Y}$ and $C_{T^{'}}$ ; their options are different enough that we can't compare their strength directly.

1.4. Examples of Morphisms Going Both Ways

Every Cartesian frame has an identity morphism pointing to itself; and as we'll discuss in the next post, whenever two Cartesian frames $C$ and $D$ are equivalent (in a sense to be defined), there will be a morphism going from $C$ to $D$ and another going from $D$ to $C$ . But not all pairs of Cartesian frames with morphisms going both ways are equivalent. Consider, for example,

$C_{1} = \begin{matrix} \begin{matrix} e_{0} & e_{1} \end{matrix} \begin{matrix} a_{0} a_{1} \end{matrix} & (\begin{matrix} w_{0} & w_{0} w_{0} & w_{1} \end{matrix}) \end{matrix} and D_{1} = \begin{matrix} \begin{matrix} f_{0} \end{matrix} \begin{matrix} b_{0} \end{matrix} & (\begin{matrix} w_{0} \end{matrix}) \end{matrix}$ .

In $C_{1} = (A, E, \cdot)$ , the default outcome is $w_{0}$ , but the agent and environment can handshake to produce $w_{1}$ . In $D_{1} = (B, F, ⋆)$ , there are no choices, and there's only one possible world, $w_{0}$ .

It turns out that there is a morphism $(g, h) : C_{1} \to D_{1}$ , where $g$ is the constant function $b_{0}$ and $h$ is the constant function $e_{0}$ ; and there is a second morphism $(g^{'}, h^{'}) : D_{1} \to C_{1},$ where $g^{'}$ is the constant function $a_{0}$ and $h^{'}$ is the constant function $f_{0}$ . We can interpret these like so:

There is a morphism $C_{1} \to D_{1}$ because $D_{1}$ is effectively $C_{1}$ plus a promise from the environment "I'll choose $e_{0}$ ." The agent in $D_{1}$ is "stronger" in the sense that it has fewer possible environments to worry about. There is less the environment can do to interfere with the agent's choices.
There is a morphism $D_{1} \to C_{1}$ because $C_{1}$ 's agent has strictly more options than $D_{1}'s$ agent: moving from $D_{1}$ to $C_{1}$ lets you retain the option to produce $w_{0}$ if you want, but it also lets you try for $w_{1}$ .

So we can view the smaller matrix as the larger matrix plus a promise from the environment "I'll choose $e_{0}$ ," or we can view it as the larger matrix plus a commitment from the agent "I'll choose $a_{0}$ ."

This example demonstrates that my intuitive statement "wherever there's a morphism from $C$ to $D$ , $D$ is at least as strong as $C$ " conflates two different notions of "stronger." These notions often go together, but come apart in situations such as the handshake example. Like the hypothetical student in $C_{T^{'}}$ , the agent of $D_{1}$ is "stronger" in the sense that the environment can't do as much to get in the way. But like the not-yet-precommitted student in $C_{Y}$ , the agent of $C_{1}$ is "stronger" in the sense that it has more options.

2. Self-Duality

A key property of $Chu (W)$ is that it is self-dual.

Definition: Let $-^{*} : Chu (W) \to Chu (W)^{op}$ be the functor given by $(A, E, \cdot)^{*} = (E, A, ⋆)$ , where $e ⋆ a = a \cdot e$ , and $(g, h)^{*} = (h, g)$ .

The more standard notation for dual in linear logic would be $-^{⊥}$ , but this is horrible notation.¹ [LW(p) · GW(p)]

Claim: $-^{*}$ is an isomorphism between $Chu (W)$ and $Chu (W)^{op}$ .

Proof: First, we show $-^{*}$ is a functor. The objects in $Chu (W)^{op}$ are the same as in $Chu (W)$ , the morphisms from $D$ to $C$ in $Chu (W)^{op}$ are the morphisms from $C$ to $D$ in $Chu (W)$ , and composition is the same, but with the order reversed. $-^{*}$ clearly preserves identity morphisms. To show that $-^{*}$ preserves composition, we have

\begin{matrix} (g_{0}, h_{0})^{*} \circ^{op} (g_{1}, h_{1})^{*} & = (h_{1}, g_{1}) \circ (h_{0}, g_{0}) = (h_{1} \circ h_{0}, g_{0} \circ g_{1}) = ((g_{0}, h_{0}) \circ (g_{1}, h_{1}))^{*} . \end{matrix}

To see that it is an isomorphism, we need a left and right inverse. We will abuse notation and also write $-^{*}$ for the functor from $Chu (W)^{op}$ to $Chu (W)$ given by $(E, A, ⋆)^{*} = (A, E, \cdot)$ , where $a \cdot e = e ⋆ a$ , and $(h, g)^{*} = (g, h)$ . Clearly, we have $-^{*} : Chu (W) \to Chu (W)^{op}$ and $-^{*} : Chu (W)^{op} \to Chu (W)$ composing to the identity in both orders, so $-^{*}$ is an isomorphism. $□$

Going back to our visualization of Cartesian frames as matrices, $-^{*}$ just takes the transpose of the matrix, swapping agent with environment. " $Chu (W)$ is self-dual" is another way of saying that transposing a Cartesian frame always gives you another Cartesian frame.

Philosophically, depending on our interpretation, this may be doing something weird. We talk about possible agents and possible environments, but we may mean something different by "possible" in those two cases.

Since we are imagining events from the point of view of the agents, "possible agents" is referring to all of the ways the agent can choose to be by exercising its "free will." We could think of "possible environments" similarly, or we could think of possible environments as representing the agent's uncertainty.

Under the view where possible environments represent uncertainty, $-^{*}$ is pointing to an interesting duality that swaps choices with uncertainty, swaps the "could" of "I could do X" with the "could" of "The world could have property Y," and (if we add probability to the mix) swaps mixed strategies with probabilistic uncertainty. "What will I do?" becomes "What game am I playing?", or "What is the world-as-a-function-of-my-action like?"

I will introduce many operations on Cartesian frames, so it will help to highlight even the basic properties as I go. Here, I'll note:

Claim: For any Cartesian frame $C$ , $(C^{*})^{*} = C$ .

Proof: Trivial. $□$

3. Sums of Cartesian Frames

The first binary operation on Cartesian frames I want to introduce is the sum, $\oplus$ .

Definition: For Cartesian frames $C = (A, E, \cdot)$ and $D = (B, F, ⋆)$ over $W$ , $C \oplus D$ is the Cartesian frame $(A ⊔ B, E \times F, ⋄)$ , where $a ⋄ (e, f) = a \cdot e$ if $a \in A$ , and $a ⋄ (e, f) = a ⋆ f$ if $a \in B$ .

The sum takes the disjoint union of the agents and the Cartesian product of the environments, and does the obvious thing with the evaluation function. The agent can choose any strategy from $A$ or from $B$ , and the environment has to respond to that strategy. We can interpret this as an agent that can choose between two different first-person perspectives: it can decide to interact with the environment as the agent of $C$ , or as the agent of $D$ .

Maybe "Rebecca the chess player" is considering which chess opening to employ, whereas "Rebecca the food-eater" is considering putting her plate down on the chess board and having lunch instead. "Rebecca the agent that can choose between playing chess and having lunch" is the sum of the other two Rebeccas.

If Rebecca tunnel-visions on the chess game, she may not consider her other options. Likewise if she tunnel-visions on lunch. If she inhabits the perspective of the third Rebecca, she can instead decide between chess moves and decide whether she wants to be playing chess at all.

Meanwhile, the environment must use a policy that selects an option from $E$ if the agent chooses from $A$ , and selects an option from $F$ if the agent chooses from $B$ .

In the chess example: The environment must be able to respond to different chess moves, but it must also be able to respond to Rebecca deciding to play a different game.

To give a formal example, let $C_{2} = (A, E, \cdot)$ and $D_{2} = (B, F, ⋆)$ be given by the matrices

$C_{2} = \begin{matrix} \begin{matrix} e_{0} & e_{1} \end{matrix} \begin{matrix} a_{0} a_{1} \end{matrix} & (\begin{matrix} w_{0} & w_{1} w_{2} & w_{3} \end{matrix}) \end{matrix} and D_{2} = \begin{matrix} \begin{matrix} f_{0} & f_{1} & f_{2} \end{matrix} \begin{matrix} b_{0} b_{1} b_{2} \end{matrix} & ⎛ ⎜ ⎝ \begin{matrix} w_{4} & w_{5} & w_{6} w_{7} & w_{8} & w_{9} w_{10} & w_{11} & w_{12} \end{matrix} ⎞ ⎟ ⎠ \end{matrix}$ .

Here, $C_{2} \oplus D_{2}$ is given by

$C_{2} \oplus D_{2} = \begin{matrix} \begin{matrix} e_{0} f_{0} & e_{0} f_{1} & e_{0} f_{2} & e_{1} f_{0} & e_{1} f_{1} & e_{1} f_{2} \end{matrix} \begin{matrix} a_{0} a_{1} b_{0} b_{1} b_{2} \end{matrix} & ⎛ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ \begin{matrix} w_{0} & w_{0} & w_{0} & w_{1} & w_{1} & w_{1} w_{2} & w_{2} & w_{2} & w_{3} & w_{3} & w_{3} w_{4} & w_{5} & w_{6} & w_{4} & w_{5} & w_{6} w_{7} & w_{8} & w_{9} & w_{7} & w_{8} & w_{9} w_{10} & w_{11} & w_{12} & w_{10} & w_{11} & w_{12} \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎠ \end{matrix}$ .

If we wish to interpret $C_{2} \oplus D_{2}$ temporally, we can say: The agent first chooses what game to play. The environment then, as a function of which game was chosen, "chooses" what it does; and the agent simultaneously chooses its own move within the game it picked.

Definition: Let $0$ be given by the Cartesian frame $0 = ({}, {e}, \cdot)$ , where $Agent (0)$ is the empty set, $Env (0) = {e}$ is any singleton set, and $Eval (0)$ is trivial, since it has empty domain.

Claim: $\oplus$ is commutative and associative, and $0$ is the identity of $\oplus$ (up to isomorphism).

Proof: Trivial. $□$

Returning to our interpretation of morphisms as differences in agents' strength: The agent of $C \oplus D$ can choose between being the agent from $C$ or the agent from $D$ , and so is stronger than either. Indeed, we can think of $C \oplus D$ 's agent as the weakest agent that is stronger than both $C$ 's agent and $D$ 's agent. Mathematically, this translates to $\oplus$ being the categorical coproduct in $Chu (W)$ .

Theorem: $C_{0} \oplus C_{1}$ is the coproduct of $C_{0}$ and $C_{1}$ in $Chu (W)$ , and $0$ is initial in $Chu (W)$ .

Proof: First, we show that $0$ is initial. We want to show that there exists a unique morphism from $0$ to a given $C$ . Indeed, a morphism from $0$ to $C = (A, E, \cdot)$ is a function from ${}$ to $A$ along with a function from $E$ to ${e}$ , and there is always exactly one such pair of functions, regardless of what $A$ and $E$ are. It is also easy to see that this pair of functions is a morphism, since the condition for morphism is empty, since $Agent (0)$ is empty. Thus $0$ is initial.

Let $C_{i} = (A_{i}, E_{i}, \cdot_{i})$ , and let $C_{0} \oplus C_{1} = (A_{0} ⊔ A_{1}, E_{0} \times E_{1}, ⋄)$ . We want to show that there exist inclusion morphisms $ι_{0} : C_{0} \to C_{0} \oplus C_{1}$ and $ι_{1} : C_{1} \to C_{0} \oplus C_{1}$ such that for any Cartesian frame $D = (B, F, ⋆)$ , and any pair of morphisms $ϕ_{0} : C_{0} \to D$ and $ϕ_{1} : C_{1} \to D,$ we have that there exists a unique morphism $ϕ : C_{0} \oplus C_{1} \to D$ such that $ϕ \circ ι_{0} = ϕ_{0}$ and $ϕ \circ ι_{1} = ϕ_{1}$ .

First, we need to specify $ι_{i} : (A_{i}, E_{i}, \cdot_{i}) \to (A_{0} ⊔ A_{1}, E_{0} \times E_{1}, ⋄)$ . We let $ι_{i} = (j_{i}, k_{i})$ , where $j_{i} : A_{i} \to A_{0} ⊔ A_{1}$ is just the the obvious inclusion of $A_{i}$ into $A_{0} ⊔ A_{1}$ , and $k_{i} : E_{0} \times E_{1} \to E_{i}$ is just the obvious projection. This is clearly a morphism.

Given $ϕ_{0} = (g_{0}, h_{0}) : C_{0} \to D$ and $ϕ_{1} = (g_{1}, h_{1}) : C_{1} \to D$ , we let $ϕ = (g, h)$ , where $g : A_{0} ⊔ A_{1} \to B$ is given by $g (a) = g_{i} (a)$ where $i$ is such that $a \in A_{i}$ , and $h : F \to E_{0} \times E_{1}$ is given by $h (f) = (h_{0} (f), h_{1} (f))$ . This is a morphism because for all $a \in A_{0} ⊔ A_{1}$ and $f \in F$ , we have

\begin{matrix} a ⋄ h (f) & = a \cdot_{i} h_{i} (f) = g_{i} (a) ⋆ f = g (a) ⋆ f, \end{matrix}

where $i$ is such that $a \in A_{i}$ . It is clear from the definitions that $ϕ \circ ι_{i} = ϕ_{i}$ .

Finally, we need to show the uniqueness of this $ϕ$ . Let $ϕ^{'} = (g^{'}, h^{'}) : C_{0} \oplus C_{1} \to D$ be a morphism such that $ϕ^{'} \circ ι_{i} = ϕ_{i}$ for both $i = 1, 2$ . This means that $g^{'} (a) = g_{i} (a)$ when $a \in A_{i}$ , so $g^{'} (a) = g (a)$ for all $a \in A_{0} ⊔ A_{1}$ . Similarly, $h^{'} (f)$ must project to $h_{0} (f)$ and $h_{1} (f)$ , so

\begin{matrix} h^{'} (f) & = (h_{0} (f), h_{1} (f)) = h (f) \end{matrix}

for all $f \in F$ . Thus $ϕ^{'} = ϕ$ . $□$

4. Products of Cartesian Frames

Dual to sum, we have the product operation, $&$ . This operation is a product. It is also in the section on additive operations. There are many counterintuitive things about the notation of Chu spaces and linear logic.

Definition: For Cartesian frames $C = (A, E, \cdot)$ and $D = (B, F, ⋆)$ over $W$ , $C & D$ is the Cartesian frame $(A \times B, E ⊔ F, ⋄)$ , where $(a, b) ⋄ e = a \cdot e$ if $e \in E$ , and $(a, b) ⋄ e = b ⋆ e$ if $e \in F$ .

$C & D$ means that the agent might have to be the agent of $C$ , and might have to be the agent of $D$ , but does not get to decide which one. Thus, it will have to choose a pair, $(a, b)$ , where $a$ says how to behave in a $C$ situation, and $b$ says how to behave in a $D$ situation. The environment will "choose" to either be $C$ 's environment or $D$ 's environment. When the agent and environment interact, the agent uses the component of its pair that matches the environment's choice.

Instead of thinking of the agent as choosing a pair, we could again think about the situation temporally. $C & D$ is equivalent to an interaction where the environment first chooses which Cartesian frame, $C$ or $D$ , to play; then the agent observes this choice, and the agent and environment simultaneously behave as though they were in the chosen frame, either $C$ or $D$ .

(In fact, if $Image (C)$ and $Image (D)$ are disjoint, we can see this interpretation in the formalism by noting that $Image (C) \in Obs (C & D)$ —that is, the agent can change its behavior on the basis of whether the environment selected from $C$ or from $D$ .)

For example, if we let $C_{2}$ and $D_{2}$ be as the example in §3,

then $C_{2} & D_{2}$ is given by

$C_{2} & D_{2} = \begin{matrix} \begin{matrix} e_{0} & e_{1} & f_{0} & f_{1} & f_{2} \end{matrix} \begin{matrix} a_{0} b_{0} a_{0} b_{1} a_{0} b_{2} a_{1} b_{0} a_{1} b_{1} a_{1} b_{2} \end{matrix} & ⎛ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ \begin{matrix} w_{0} & w_{1} & w_{4} & w_{5} & w_{6} w_{0} & w_{1} & w_{7} & w_{8} & w_{9} w_{0} & w_{1} & w_{10} & w_{11} & w_{12} w_{2} & w_{3} & w_{4} & w_{5} & w_{6} w_{2} & w_{3} & w_{7} & w_{8} & w_{9} w_{2} & w_{3} & w_{10} & w_{11} & w_{12} \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎠ \end{matrix}$ .

A second example: Suppose that we have two Cartesian frames, $C_{3}$ and $D_{3}$ . $C_{3}$ is a frame in which it's raining, and the agent chooses whether to carry an umbrella. $D_{3}$ is a frame in which it's sunny, and the agent chooses whether to carry an umbrella.

$C_{3} = \begin{matrix} \begin{matrix} r \end{matrix} \begin{matrix} u n \end{matrix} & (\begin{matrix} u r n r \end{matrix}) \end{matrix} and D_{3} = \begin{matrix} \begin{matrix} s \end{matrix} \begin{matrix} u n \end{matrix} & (\begin{matrix} u s n s \end{matrix}) \end{matrix}$

It turns out that the second example we provided in "Introduction to Cartesian Frames" §3.2 (Examples of Controllables [LW · GW]) is exactly equal to the product of these two Cartesian frames,

$\begin{matrix} \begin{matrix} r & s \end{matrix} \begin{matrix} u u = u n n = n u n = u \leftrightarrow r n u = u \leftrightarrow s \end{matrix} & ⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} u r & u s n r & n s u r & n s n r & u s \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ \end{matrix}$ .

The environment is the disjoint union of the rain and sun environments, and the policies of the agent can be viewed as "I get to choose what to do as a function of what game we're playing," where "what game we're playing" is "what the weather is."

Definition: Let $⊤$ be given by the Cartesian frame $⊤ = ({a}, {}, \cdot)$ , where $Agent (⊤)$ is a singleton, $Env (⊤)$ is the empty set, and $Eval (⊤)$ is trivial, since it has empty domain.

Claim: $&$ is commutative and associative, and $⊤$ is the identity of $&$ (up to isomorphism).

Proof: Trivial. $□$

$&$ is essentially just $\oplus$ from the point of view of the environment. Thus, since $-^{*}$ swaps agent and environment, we can express $&$ using $\oplus$ and $-^{*}$ .

Claim: $C & D = (C^{*} \oplus D^{*})^{*}$ , $⊤ = 0^{*}$ , $C \oplus D = (C^{*} & D^{*})^{*}$ , and $0 = ⊤^{*}$ .

Proof: Trivial. $□$

In other words, $\oplus$ and $&$ are De Morgan dual with respect to $-^{*}$ .

In the same way that we interpreted $C \oplus D$ as having the weakest agent that is stronger than the agents of $C$ and $D$ , we can interpret $C & D$ 's agent as the strongest agent that is weaker than the agents of $C$ and $D$ .

Theorem: $C_{0} & C_{1}$ is the product of $C$ and $D$ in $Chu (W)$ , and $⊤$ is terminal in $Chu (W)$ .

Proof: Since $\oplus$ is the coproduct in $Chu (W)$ , it is the product in $Chu (W)^{op}$ . Since $-^{*}$ is an isomorphism between $Chu (W)$ and $Chu (W)^{op}$ , we can take a product in $Chu (W)$ of $C_{0}$ and $C_{1}$ by sending them to $Chu (W)^{op}$ via this isomorphism, taking a product, and sending them back. Thus $(C_{0}^{*} \oplus C_{1}^{*})^{*} = C_{0} & C_{1}$ is the product in $Chu (W)$ of $C_{0}$ and $C_{1}$ .

Similarly, since $0$ is initial in $Chu (W)$ , it is terminal in $Chu (W)^{op}$ . Thus, $0^{*} = ⊤$ is terminal in $Chu (W)$ . $□$

Our next post will discuss equivalence relations between Cartesian frames. We will introduce a homotopy equivalence on Cartesian frames, and employ these relations to classify small Cartesian frames up to homotopy.

Footnotes

1. One important reason $-^{⊥}$ is bad notation for dual is that $A^{B}$ normally represents $B \to A$ , where $\to$ is your category's internal hom functor. For Chu spaces, $\to$ is $⊸$ . Since $⊥$ will be the name for an object in our category, one would reasonably expect $C^{⊥}$ to represent $⊥ ⊸ C$ , but it doesn't. Worse still, $C^{*}$ does happen to be equivalent to $C ⊸ ⊥$ , and this will be an important fact to understand. To minimize confusion, we instead use the common notation $-^{*}$ for dual. ↩ [LW · GW]

6 comments

Comments sorted by top scores.

comment by Rob Bensinger (RobbBB) · 2020-10-26T20:37:47.460Z · LW(p) · GW(p)

Scott's Sunday talk, covering content from this post and the Intro [LW · GW] post: https://www.youtube.com/watch?v=H1tJdaCvcck

comment by Charlie Steiner · 2020-10-30T19:42:27.507Z · LW(p) · GW(p)

Typo in the definition of product: b cdot e should be b star e.

Replies from: Scott Garrabrant

↑ comment by Scott Garrabrant · 2020-10-30T20:58:01.368Z · LW(p) · GW(p)

Yep. Fixed. Thanks.

comment by MikkW (mikkel-wilson) · 2020-10-30T21:30:54.031Z · LW(p) · GW(p)

I like that when you sum two agents together, the resulting environment is the product of each agent's enviroments, and since the dual of a frame just swaps an agent with its environment, that means A&B = (A! + B!)!

(I've used ! to represent the dual, since the editor isn't giving me what I want when I type *)

comment by jollybard · 2020-10-27T09:33:21.179Z · LW(p) · GW(p)

Fantastic work!

How do we express the way that the world might be carved up into different agent-environment frames while still remaining "the same world"? The dual functor certainly works, but how about other ways to carve up the world? Suppose I notice a subagent of the environment, can I switch perspective to it?

Also, I am guessing is that an "embedded" cartesian frame might be one where i.e. where the world is just the agent along with the environment. Or something. Then, since we can iterate the choice function, it ould represent time steps. Though we might in fact need sequences of agents and environments. Anyway, I can't wait to see what you came up with.

comment by romeostevensit · 2020-10-26T23:53:45.487Z · LW(p) · GW(p)

The main intuition this sparks in me is that it gives us concrete data structures to look for when talking broadly about the brain doing 'compression' by rotating a high dimensional object and carving off recognized chunks (simple distributions) in order to make the messy inputs more modular, composable, accessible, error correctable, etc. Sort of the way that predictive coding gives us a target to hunt for in looking for structures that look like they might be doing something like the atomic predictive coding unit.

Additive Operations on Cartesian Frames

Contents

1. What Do These Morphisms Represent?

2. Self-Duality

3. Sums of Cartesian Frames

4. Products of Cartesian Frames

Footnotes

6 comments