Multiplicative Operations on Cartesian Frames

scott-garrabrant

Multiplicative Operations on Cartesian Frames

post by Scott Garrabrant · 2020-11-03T19:27:15.489Z · LW · GW · 24 comments

  1. Tensor
    1.1. Example
  2. Properties of Tensor
    2.1. Commutativity, Associativity, and Identity
    2.2. Biextensional Equivalence
    2.3. Distributivity
    2.4. Tensor is for Disjoint Agents
  3. Tensor is Relative to a Coarse World Model
  4. Par
  5. Lollipop
None
24 comments

This is the seventh post in the Cartesian frames [LW · GW] sequence.

Here, we introduce three new binary operations on Cartesian frames, and discuss their properties.

1. Tensor

Our first multiplicative operation is the tensor product, .

One way we can visualize our additive operations [LW · GW] from before, $\oplus$ and $&$ , is to imagine two robots (say, a mining robot $Agent (C)$ and a drilling robot $Agent (D)$ ) that have an override mode allowing an AI supervisor to take over that robot's decisions.

$C \oplus D$ represents the supervisor deciding which robot to take control of, then selecting that robot's action. (The other robot continues to run autonomously.)
$C & D$ represents something in the supervisor's environment (e.g., its human operator) deciding which robot the supervisor will take control of. Then the supervisor selects that robot's action (while the other robot runs autonomously).

$C \otimes D$ represents an AI supervisor that controls both robots simultaneously. This lets $Agent (C \otimes D)$ direct $Agent (C)$ and $Agent (D)$ to work together as a team.

Definition: Let $C = (A, E, \cdot)$ and $D = (B, F, ⋆)$ be Cartesian frames over $W$ . The tensor product of $C$ and $D$ , written $C \otimes D$ , is given by $C \otimes D = (A \times B, hom (C, D^{*}), ⋄)$ , where $hom (C, D^{*})$ is the set of morphisms $(g, h) : C \to D^{*}$ (i.e., the set of all pairs $(g : A \to F, h : B \to E)$ such that $b ⋆ g (a) = a \cdot h (b)$ for all $a \in A$ , $b \in B$ ), and $⋄$ is given by $(a, b) ⋄ (g, h) = b ⋆ g (a) = a \cdot h (b)$ .

Let us meditate for a moment on why this definition represents two agents working together on a team. The following will be very informal.

Let Alice be an agent with Cartesian frame $C = (A, E, \cdot)$ , and let Bob be an agent with Cartesian frame $D = (B, F, ⋆)$ . The team consisting of Alice and Bob should have agent $A \times B$ , since the team's choices consist of deciding what Alice does and also deciding what Bob does.

The environment is a bit more complicated. Starting from Alice, to construct the team, we want to internalize Bob's choices: instead of just being choices in $A$ 's environment, Bob's choices will now be additional options for the team $A \times B$ .

To do this, we want to first see Bob as being embedded in Alice's environment. This embedding is given by a function $h : B \to E$ , which extends each $b \in B$ to a full environment $e \in E$ . We will view Alice's possible environments as being constructed by combining a choice by Bob (that is, a $b \in B$ ) with a function from Bob's choices to possible environments ( $h : B \to E$ ). Then, we will move the $B$ part across the Cartesian boundary into the agent.

Now, the agent looks like $A \times B$ , while the environment looks like $B \to E$ . However, we must have been able to do this starting from Bob as well, so a possible environment can also be viewed as function $g : A \to F$ .

Since we should get the same world regardless of whether we think of the team as starting with Alice or with Bob, these functions $g$ and $h$ should agree with each other. This looks a bit like currying. The environment for an Alice-Bob team should be able to take in a Bob to create an environment for Alice, and it should also be able to take in an Alice to create an environment for Bob.

1.1. Example

We will illustrate this new operation using a simple formal example.

Jack, Kate, and Luke are simultaneously casting votes on whether to have a party. Each agent can vote for or against the party. The possible worlds are encoded as strings listing which people vote for the party, $W = {ε, J, K, L, JK, JL, KL, JKL}$ . Jack's perspective is given by the frame

$C_{J} = (\begin{matrix} J & JK & JL & JKL ε & K & L & KL \end{matrix})$ ,

Kate's perspective is given by the frame

$C_{K} = (\begin{matrix} K & JK & KL & JKL ε & J & L & JL \end{matrix})$ ,

and Luke's perspective is given by the frame

$C_{L} = (\begin{matrix} L & JL & KL & JKL ε & J & K & JK \end{matrix})$ .

Since Luke's environment can be thought of as the team consisting of Jack and Kate, one might expect that $C_{J} \otimes C_{K} ≅ C_{L}^{*}$ . Indeed, we will show this is the case.

Let $C_{J} = (A, E, \cdot)$ , and let $C_{K} = (B, F, ⋆)$ . We label the elements of $A$ , $E$ , $B$ , and $F$ as follows:

$C_{J} = \begin{matrix} \begin{matrix} e_{ε} & e_{K} & e_{L} & e_{K L} \end{matrix} \begin{matrix} a_{J} a_{ε} \end{matrix} & (\begin{matrix} J & JK & JL & JKL ε & K & L & KL \end{matrix}) \end{matrix}$ , and $C_{K} = \begin{matrix} \begin{matrix} f_{ε} & f_{J} & f_{L} & f_{J L} \end{matrix} \begin{matrix} b_{K} b_{ε} \end{matrix} & (\begin{matrix} K & JK & KL & JKL ε & J & L & JL \end{matrix}) \end{matrix}$ .

We will first enumerate all of the morphisms from $C_{J}$ to $C_{K}^{*}$ . A morphism $(g, h) : C_{J} \to C_{K}^{*}$ consists of a function $g : A \to F$ and a function $h : B \to E$ . There are 16 functions from $A$ to $F$ and 16 functions from $B$ to $E$ , but most of the 256 pairs do not form morphisms.

Let us break the possibilities into cases based on $g (a_{J})$ . Observe that $b_{K} ⋆ g (a_{J}) = a_{J} \cdot h (b_{K})$ : the possible worlds where (from Kate's perspective) Kate votes for the party and Jake-interfacing-with-Kate's-perspective votes for the party too, are the same as the possible worlds where (from Jake's perspective) Jake votes for the party and Kate-interfacing-with-Jake's-perspective does too. These possible worlds must have a $J$ in them, so $g (a_{J})$ must be either $f_{J}$ or $f_{J L}$ .

If $g (a_{J}) = f_{J}$ , then

\begin{matrix} a_{J} \cdot h (b_{K}) & = b_{K} ⋆ g (a_{J}) = JK, \end{matrix}

so $h (b_{K}) = e_{K}$ . Similarly,

\begin{matrix} a_{J} \cdot h (b_{ε}) & = b_{ε} ⋆ g (a_{J}) = J, \end{matrix}

so $h (b_{ε}) = e_{ε}$ , and

\begin{matrix} b_{K} ⋆ g (a_{ε}) & = a_{ε} \cdot h (b_{K}) = K, \end{matrix}

so $g (a_{ε}) = f_{ε}$ .

Similarly, if $g (a_{J}) = f_{J L}$ , then $h (b_{K}) = e_{K L}$ , $h (b_{ε}) = e_{L}$ , and $g (a_{ε}) = f_{L}$ .

Thus, there are only two candidate morphisms:

The first, which we will call $ϕ_{ε} = (g_{ε}, h_{ε})$ , is given by $g_{ε} (a_{ε}) = f_{ε}$ , $g_{ε} (a_{J}) = f_{J}$ , $h_{ε} (b_{ε}) = e_{ε}$ , and $h_{ε} (b_{K}) = e_{K}$ .
The second, $ϕ_{L} = (g_{L}, h_{L})$ , is given by $g_{L} (a_{ε}) = f_{L}$ , $g_{L} (a_{J}) = f_{J L}$ , $h_{L} (b_{ε}) = e_{L}$ , and $h_{L} (b_{K}) = e_{K L}$ .

It is easy to see that these are both indeed morphisms, by checking the definition of morphism on all four pairs in $A \times B$ .

Thus, $Env (C_{J} \otimes C_{K}) = {ϕ_{ε}, ϕ_{L}}$ , and $Agent (C_{J} \otimes C_{K}) = A \times B$ , and we can compute $Eval (C_{J} \otimes C_{K})$ from the definitions of the morphisms. The result is as follows:

$C_{J} \otimes C_{K} = \begin{matrix} \begin{matrix} ϕ_{ε} & ϕ_{L} \end{matrix} \begin{matrix} (a_{J}, b_{K}) (a_{J}, b_{ε}) (a_{ε}, b_{K}) (a_{ε}, b_{ε}) \end{matrix} & ⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} JK & JKL J & JL K & KL ε & L \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ \end{matrix}$ .

This is clearly $C_{L}^{*}$ , up to reordering and relabeling rows and columns.

2. Properties of Tensor

Tensor introduces a lot of categorical structure to Chu spaces, in fact giving us a star-autonomous category. This post and the ones to come will be ignoring connections to larger topics in category theory, but only because my time and my familiarity with category theory are limited, not because these connections are unimportant.

I encourage the interested reader to learn more about the structure of Chu spaces on the excellent category theory wiki nLab, beginning with their article on the Chu construction.

2.1. Commutativity, Associativity, and Identity

Claim: $\otimes$ is commutative and associative, and $1$ is the identity of $\otimes$ (up to isomorphism).

Proof: Commutativity is clear from the definition of $\otimes$ , once one unpacks the definition of $hom (C, D^{*})$ .

To see that 1 is the identity of $\otimes$ , let $C = (A, E, \cdot)$ , let $1 = ({b}, W, ⋆)$ , and let $C \otimes 1 = (A \times {b}, hom (C, 1^{*}), ⋄)$ .

Consider the isomorphism $(ι_{0}, ι_{1}) : C \to C \otimes 1$ given by $ι_{0} (a) = (a, b)$ and $ι_{1} (g, h) = h (b) .$ We need to show that $(ι_{0}, ι_{1})$ is a morphism, and that both $ι_{0}$ and $ι_{1}$ are bijective. To see that $(ι_{0}, ι_{1})$ is a morphism, observe that for all $a \in A$ and $(g, h) : C \to 1^{*}$ ,

\begin{matrix} ι_{0} (a) ⋄ (g, h) & = a \cdot h (b) = a \cdot ι_{1} (g, h) . \end{matrix}

Clearly, $ι_{0}$ is a bijection, so all that remains to show is that $ι_{1}$ is bijective.

To see that $ι_{1}$ is injective, observe that if $ι_{1} (g_{0}, h_{0}) = ι_{1} (g_{1}, h_{1})$ , then $h_{0} (b) = h_{1} (b)$ , so $h_{0} = h_{1}$ , and

\begin{matrix} g_{0} (a) & = b ⋆ g_{0} (a) = a \cdot h_{0} (b) = a \cdot h_{1} (b) = b ⋆ g_{1} (a) = g_{1} (a) \end{matrix}

for all $a \in A$ , so $g_{0} = g_{1}$ .

To see that $ι_{1}$ is surjective, observe that for every $e \in E$ , there exists a morphism $(g_{e}, h_{e}) : C \to 1^{*}$ , given by $h_{e} (b) = e$ and $g_{e} (a) = a \cdot e$ . This is clearly a morphism, since

\begin{matrix} b ⋆ g_{e} (a) & = g_{e} (a) = a \cdot e = a \cdot h_{e} (b), \end{matrix}

and $ι_{1} (g_{e}, h_{e}) = e$ .

Next, we need to show that $\otimes$ is associative, which will be much more tedious. Let $C_{i} = (A_{i}, E_{i}, \cdot)$ . Since we have already established commutativity, it suffices to show that $(C_{0} \otimes C_{1}) \otimes C_{2} ≅ (C_{0} \otimes C_{2}) \otimes C_{1}$ .

Let $D = (A_{0} \times A_{1} \times A_{2}, F, ⋆)$ , where $F$ is the set of all triples of functions $(g_{0} : A_{1} \times A_{2} \to E_{0}, g_{1} : A_{0} \times A_{2} \to E_{1}, g_{2} : A_{0} \times A_{1} \to E_{2})$ , such that for all $a_{i} \in A_{i}$ , we have

\begin{matrix} a_{0} \cdot_{0} g_{0} (a_{1}, a_{2}) & = a_{1} \cdot_{1} g_{1} (a_{0}, a_{2}) = a_{2} \cdot_{2} g_{2} (a_{0}, a_{1}), \end{matrix}

and $⋆$ is given by

\begin{matrix} (a_{0}, a_{1}, a_{2}) ⋆ (g_{0}, g_{1}, g_{2}) & = a_{0} \cdot_{0} g_{0} (a_{1}, a_{2}) = a_{1} \cdot_{1} g_{1} (a_{0}, a_{2}) = a_{2} \cdot_{2} g_{2} (a_{0}, a_{1}) . \end{matrix}

We will show that $(C_{0} \otimes C_{1}) \otimes C_{2} ≅ D$ , and since the definition of $D$ is symmetric in swapping $C_{1}$ and $C_{2}$ , it will follow that $(C_{0} \otimes C_{2}) \otimes C_{1} ≅ D$ , so $(C_{0} \otimes C_{1}) \otimes C_{2} ≅ (C_{0} \otimes C_{2}) \otimes C_{1}$ .

We construct a morphism $(ι_{0}, ι_{1})$ from $(C_{0} \otimes C_{1}) \otimes C_{2}$ to $D$ as follows. $ι_{0}$ is just the identity on $A_{0} \times A_{1} \times A_{2}$ . We will let $ι_{1} (g_{0}, g_{1}, g_{2})$ be the morphism $(g_{2}, h) : C_{0} \otimes C_{1} \to C_{2}^{*}$ , where $h : A_{2} \to hom (C_{0}, C_{1}^{*})$ is given by $h (a_{2}) = (h_{0}^{a_{2}}, h_{1}^{a_{2}}) : C_{0} \to C_{1}^{*}$ , where $h_{0}^{a_{2}} (a_{0}) = g_{1} (a_{0}, a_{2})$ , and $h_{1}^{a_{2}} (a_{1}) = g_{0} (a_{1}, a_{2})$ .

First, we need to show that $ι_{1}$ is well-defined, by showing that $h (a_{2})$ is a morphism from $C_{0}$ to $C_{1}^{*}$ , and that $(g_{2}, h)$ is a morphism from $C_{0} \otimes C_{1} \to C_{2}^{*}$ . To see that $h (a_{2}) = (h_{0}^{a_{2}}, h_{1}^{a_{2}})$ is a morphism, observe that for $a_{0} \in A_{0}$ and $a_{1} \in A_{1}$ ,

\begin{matrix} a_{1} \cdot_{1} h_{0}^{a_{2}} (a_{0}) & = a_{1} \cdot_{1} g_{1} (a_{0}, a_{2}) = a_{0} \cdot_{0} g_{0} (a_{1}, a_{2}) = a_{0} \cdot_{0} h_{1}^{a_{2}} (a_{1}) . \end{matrix}

To see that $(g_{2}, h)$ is a morphism, observe for all $(a_{0}, a_{1}) \in A_{0} \times A_{1}$ and all $a_{2} \in A_{2}$ ,

\begin{matrix} a_{2} \cdot_{2} g_{2} (a_{0}, a_{1}) & = a_{0} \cdot_{0} g_{0} (a_{1}, a_{2}) = a_{0} \cdot_{0} h_{1}^{a_{2}} (a_{1}) = (a_{0}, a_{1}) ⋄ (h_{0}^{a_{2}}, h_{1}^{a_{2}}) = (a_{0}, a_{1}) ⋄ h (a_{2}), \end{matrix}

where $⋄ = Eval (C_{0} \otimes C_{1})$ .

Now that we know $ι_{1}$ is well-defined, we need to show that $(ι_{0}, ι_{1})$ is a morphism. Indeed, for all $(a_{0}, a_{1}, a_{2}) \in A_{0}, A_{1}, A_{2}$ , and for all $(g_{0}, g_{1}, g_{2}) \in F$ , we have

\begin{matrix} ι_{0} (a_{0}, a_{1}, a_{2}) ⋆ (g_{0}, g_{1}, g_{2}) & = a_{2} \cdot_{2} g_{2} (a_{0}, a_{1}) = (a_{0}, a_{1}, a_{2}) ∙ (g_{2}, h) = (a_{0}, a_{1}, a_{2}) ∙ ι_{1} (g_{0}, g_{1}, g_{2}), \end{matrix}

where $∙ = Eval ((C_{0} \otimes C_{1}) \otimes C_{2})$ .

Finally, to show that $(ι_{0}, ι_{1})$ is an isomorphism, we need to show that $ι_{0}$ and $ι_{1}$ are bijective. $ι_{0}$ is trivial, since it is the identity, so it suffices to show that $ι_{1}$ is bijective.

To see that $ι_{1}$ is surjective, let $(g, h)$ be a morphism from $C_{0} \otimes C_{1}$ to $C_{2}^{*}$ , so $g : A_{0} \times A_{1} \to E_{2}$ , and $h : A_{2} \to hom (C_{0}, C_{1}^{*})$ . Again, let $h (a_{2}) = (h_{0}^{a_{2}}, h_{1}^{a_{2}})$ . We will define $(g_{0}, g_{1}, g_{2})$ by $g_{2} = g$ , $g_{1} (a_{0}, a_{2}) = h_{0}^{a_{2}} (a_{0})$ , and $g_{0} (a_{1}, a_{2}) = h_{1}^{a_{2}} (a_{1})$ .

We need to show that $(g_{0}, g_{1}, g_{2}) \in F$ , by showing that for all $(a_{0}, a_{1}, a_{2}) \in A_{0} \times A_{1} \times A_{2}$ , we have

\begin{matrix} a_{0} \cdot_{0} g_{0} (a_{1}, a_{2}) & = a_{1} \cdot_{1} g_{1} (a_{0}, a_{2}) = a_{2} \cdot_{2} g_{2} (a_{0}, a_{1}) . \end{matrix}

Observe that since $(g, h)$ is a morphism,

\begin{matrix} a_{2} \cdot_{2} g_{2} (a_{0}, a_{1}) & = a_{2} \cdot_{2} g (a_{0}, a_{1}) = (a_{0}, a_{1}) ⋆ h (a_{2}) = (a_{0}, a_{1}) ⋆ (h_{0}^{a_{2}}, h_{1}^{a_{2}}), \end{matrix}

where $⋆ = Eval (C_{0} \otimes C_{1})$ . Also, by the definition of $C_{0} \otimes C_{1}$ , we have that

\begin{matrix} (a_{0}, a_{1}) ⋆ (h_{0}^{a_{2}}, h_{1}^{a_{2}}) & = a_{0} \cdot_{0} h_{1}^{a_{2}} (a_{1}) = a_{0} \cdot_{0} g_{0} (a_{1}, a_{2}), \end{matrix}

and similarly

\begin{matrix} (a_{0}, a_{1}) ⋆ (h_{0}^{a_{2}}, h_{1}^{a_{2}}) & = a_{1} \cdot_{1} h_{0}^{a_{2}} (a_{1}) = a_{1} \cdot_{1} g_{1} (a_{0}, a_{2}) . \end{matrix}

Thus,

\begin{matrix} a_{0} \cdot_{0} g_{0} (a_{1}, a_{2}) & = a_{1} \cdot_{1} g_{1} (a_{0}, a_{2}) = a_{2} \cdot_{2} g_{2} (a_{0}, a_{1}), \end{matrix}

so $(g_{0}, g_{1}, g_{2}) \in F$ . Finally, observe that $ι_{1} (g_{0}, g_{1}, g_{2})$ is in fact $(g, h)$ .

To show that $ι_{1}$ is injective, assume $ι_{1} (g_{0}, g_{1}, g_{2}) = ι_{1} (g_{0}^{'}, g_{1}^{'}, g_{2}^{'}) = (g, h)$ , and given an $a_{2} \in A_{2}$ , let $h (a_{2}) = (h_{0}^{a_{2}}, h_{1}^{a_{2}})$ . Clearly, this means $g_{2} = g = g_{2}^{'}$ . Further, for all $a_{0} \in A_{0}$ , $a_{1} \in A_{1}$ , and $a_{2} \in A_{2}$ ,

\begin{matrix} g_{0} (a_{1}, a_{2}) & = h_{1}^{a_{2}} (a_{1}) = g_{0}^{'} (a_{1}, a_{2}) \end{matrix}

and

\begin{matrix} g_{1} (a_{0}, a_{2}) & = h_{0}^{a_{2}} (a_{0}) = g_{1}^{'} (a_{0}, a_{2}) . \end{matrix}

Thus $(g_{0}, g_{1}, g_{2}) = (g_{0}^{'}, g_{1}^{'}, g_{2}^{'})$ . Thus, $ι_{1}$ is bijective, so $(ι_{0}, ι_{1})$ is an isomorphism, so $(C_{0} \otimes C_{1}) \otimes C_{2} ≅ D ≅ (C_{0} \otimes C_{2}) \otimes C_{1}$ . $□$

2.2. Biextensional Equivalence

Since many of our intuitions about Cartesian frames are up to biextensional equivalence, we should verify that tensor is well-defined up to biextensional equivalence.

Claim: If $C_{0} ≃ C_{1}$ and $D_{0} ≃ D_{1}$ , then $C_{0} \otimes D_{0} ≃ C_{1} \otimes D_{1}$ .

Proof: It suffices to show that for all $D$ , $C_{0} \otimes D ≃ C_{1} \otimes D$ . Then, by commutativity of tensor,

\begin{matrix} C_{0} \otimes D_{0} & ≃ C_{0} \otimes D_{1} ≅ D_{1} \otimes C_{0} ≃ D_{1} \otimes C_{1} \equiv C_{1} \otimes D_{1} . \end{matrix}

Let $C_{i} = (A_{i}, E_{i}, \cdot_{i})$ , and let $D = (B, F, ⋆)$ . Since $C_{0} ≃ C_{1}$ , there must exist morphisms $(g_{0}, h_{0}) : C_{0} \to C_{1}$ and $(g_{1}, h_{1}) : C_{1} \to C_{0}$ such that $(g_{1} \circ g_{0}, {id}_{E_{0}}) : C_{0} \to C_{0}$ and $(g_{0} \circ g_{1}, {id}_{E_{1}}) : C_{1} \to C_{1}$ are both morphisms.

Let $C_{i} \otimes D = (A_{i} \times B, hom (C_{i}, D^{*}), ⋄_{i})$ . Consider the morphisms $(g_{i}^{'}, h_{i}^{'}) : C_{i} \otimes D \to C_{1 - i} \otimes D$ , where $g_{i}^{'} : A_{i} \times B \to A_{1 - i} \times B$ is given by $g_{i}^{'} (a, b) = (g_{i} (a), b)$ and $h_{i}^{'} : hom (C_{1 - i}, D^{*}) \to hom (C_{I}, D^{*})$ is given by $h_{i}^{'} (g, h) = (g, h) \circ (g_{i}, h_{i})$ .

To see that these are morphisms, observe that for any $(a, b) \in A_{i} \times B$ and $(g, h) : C_{1 - i} \to D^{*}$ , we have

\begin{matrix} g_{i}^{'} (a, b) ⋄_{1 - i} (g, h) & = (g_{i} (a), b) ⋄_{1 - i} (g, h) = b ⋆ g (g_{i} (a)) = b ⋆ (g \circ g_{i}) (a)) = (a, b) ⋄_{i} (g \circ g_{i}, h_{i} \circ h) = (a, b) ⋄_{i} h_{i}^{'} (g, h) . \end{matrix}

Finally, we need to show that $(g_{0}^{'}, h_{0}^{'})$ and $(g_{1}^{'}, h_{1}^{'})$ compose to something homotopic to the identity in both orders. This is equivalent to saying that $(g_{0}^{'} \circ g_{1}^{'}, {id}_{hom (C_{1}, D^{*})})$ and $(g_{1}^{'} \circ g_{0}^{'}, {id}_{hom (C_{0}, D^{*})})$ are both morphisms. Indeed, for all $(a, b) \in A_{i} \times B$ and $(g, h) : C_{i} \to D^{*}$ , since $(g_{1 - i} \circ g_{i}, {id}_{E_{i}})$ is a morphism, we have

\begin{matrix} g_{1 - i}^{'} (g_{i}^{'} (a, b)) ⋄_{i} (g, h) & = (g_{1 - i} (g_{i} (a)), b) ⋄_{i} (g, h) = g_{1 - i} (g_{i} (a)) \cdot_{i} h (b) = a \cdot_{i} h (b) = (a, b) ⋄_{i} (g, h) . \end{matrix}

$□$

2.3. Distributivity

Claim: $\otimes$ distributes over $\oplus$ , so for all Cartesian frames $C_{0}$ , $C_{1}$ , and $D$ , $(C_{0} \oplus C_{1}) \otimes D ≅ (C_{0} \otimes D) \oplus (C_{1} \otimes D)$ .

Proof: Since $\oplus$ is the categorical coproduct, there exist morphisms $ι_{0} : C_{0} \to C_{0} \oplus C_{1}$ and $ι_{1} : C_{1} \to C_{0} \oplus C_{1}$ such that for any morphisms $ϕ_{0} : C_{0} \to D^{*}$ and $ϕ_{1} : C_{1} \to D^{*}$ , there exists a unique morphism $ϕ : C_{0} \otimes C_{1} \to D^{*}$ such that $ϕ_{i} = ϕ \circ ι_{i}$ .

Let $C_{i} = (A_{i}, E_{i}, \cdot_{i})$ , and let $D = (B, F, ⋆)$ . Consider the isomorphism $(g, h) : (C_{0} \otimes D) \oplus (C_{1} \otimes D) \to (C_{0} \oplus C_{1}) \otimes D$ , where $g : (A_{0} \times B) ⊔ (A_{1} \times B) \to (A_{0} ⊔ A_{1}) \times B$ is the natural bijection that sends $(a, b)$ to $(a, b)$ , and $h : hom (C_{0} \oplus C_{1}, D^{*}) \to hom (C_{0}, D^{*}) \times hom (C_{1}, D^{*})$ is given by $h (ϕ) = (ϕ \circ ι_{0}, ϕ \circ ι_{1})$ .

Clearly, $g$ is an bijection. $h$ is also a bijection, since it is inverse to the function that sends $(ϕ_{0}, ϕ_{1})$ to the unique $ϕ$ as above. Thus, all that remains to show is that $(g, h)$ is a morphism.

Let $⋄ = Eval ((C_{0} \otimes D) \oplus (C_{1} \otimes D))$ and let $∙ = Eval ((C_{0} \oplus C_{1}) \otimes D)$ . Given $(a, b) \in (A_{0} \times B) ⊔ (A_{1} \times B)$ and $(g^{'}, h^{'}) \in hom (C_{0} \oplus C_{1}, D^{*})$ , without loss of generality, assume that $a \in A_{0}$ . Let $(g_{0}^{'}, h_{0}^{'}) = (g^{'}, h^{'}) \circ ι_{0}$ . Observe that since the function on agents in $ι_{0}$ is the inclusion of $A_{0}$ into $A_{0} ⊔ A_{1}$ , we have that $g_{0}^{'}$ is $g^{'}$ restricted to $A_{0}$ . Thus, we have

\begin{matrix} g (a, b) ∙ (g^{'}, h^{'}) & = (a, b) ∙ (g^{'}, h^{'}) = b ⋆ g^{'} (a) = b ⋆ g_{0}^{'} (a) = (a, b) ⋄ (g_{0}^{'}, h_{0}^{'}) = (a, b) ⋄ h (g^{'}, h^{'}) . \end{matrix}

$□$

2.4. Tensor is for Disjoint Agents

It doesn't really make sense to talk about $C \otimes D$ when $C$ and $D$ 's agents are the same agent, or otherwise overlap. This is because $C \otimes D$ 's agent can make choices for both $C$ and $D$ , and if $C$ and $D$ overlap, $C \otimes D$ 's agent could make choices for the intersection in two contradictory ways.

If you try to take the tensor of two frames whose agents overlap, you get a frame with an agent but no possible worlds.

Claim: If $Ensure (C) \cap Prevent (D)$ is nonempty, then $C \otimes D ≃ ⊤$ .

Proof: Let $C = (A, E, \cdot)$ , and let $D = (B, F, ⋆)$ . Consider some $S \in Ensure (C) \cap Prevent (D)$ . There is some $a \in A$ such that $a \cdot e \in S$ for all $e \in E$ , and some $b \in B$ such that $b ⋆ f \notin S$ for all $f \in F$ . First, observe that $Agent (C \otimes D)$ is nonempty, since it contains $(a, b)$ . Next, observe that $Env (C \otimes D)$ is empty, since if there were a morphism $(g, h) : C \to D^{*}$ , it would need to satisfy $b ⋆ g (a) = a \cdot h (b)$ , which is impossible since the left hand side is not in $S$ , while the right hand side is in $S$ . Thus, $C \otimes D$ has empty environment and nonempty agent, so $C \otimes D ≃ ⊤$ . $□$

Tensoring an agent with itself lets you play "both" agents, which has the neat consequence that if the agent has any control, you can have the agent make two different choices that put you in two different possible worlds, which is a contradiction. The result is that the agent has no possible worlds.

Corollary: If $Ctrl(C)$ is nonempty, then $C \otimes C ≃ ⊤$ .

Proof: Trivial. $□$

3. Tensor is Relative to a Coarse World Model

Recall that for any function $p : W \to V$ , the functor [LW · GW] $p^{\circ} : Chu (W) \to Chu (V)$ preserves sums and products, meaning that for any Cartesian frames $C$ and $D$ over $W$ , $p^{\circ} (C \oplus D) = p^{\circ} (C) \oplus p^{\circ} (D)$ and $p^{\circ} (C & D) = p^{\circ} (C) & p^{\circ} (D)$ . However, the same is not true for $\otimes$ . To see this, let's go back to the voting example above.

Let's assume that Jack, Kate, and Luke have a party if and only if a majority vote in favor, and let $V = {Y, N}$ be the two-element world that only tracks whether or not they have a party. Let $p : W \to V$ be the function such that $p (ε) = p (J) = p (K) = p (L) = N$ and $p (JK) = p (JL) = p (KL) = p (JKL) = Y$ . Then,

$p^{\circ} (C_{J}) ≅ p^{\circ} (C_{K}) ≅ (\begin{matrix} N & Y & Y & Y N & N & N & Y \end{matrix}) ≃ (\begin{matrix} N & Y & Y N & N & Y \end{matrix})$ ,

and

$p^{\circ} (C_{J} \otimes C_{K}) ≅ p^{\circ} (C_{L}^{*}) ≅ ⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} Y & Y N & Y N & Y N & N \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠ ≃ ⎛ ⎜ ⎝ \begin{matrix} Y & Y N & Y N & N \end{matrix} ⎞ ⎟ ⎠$ ,

but

$(\begin{matrix} N & Y & Y & Y N & N & N & Y \end{matrix}) \otimes (\begin{matrix} N & Y & Y & Y N & N & N & Y \end{matrix}) / ≄ ⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} Y & Y N & Y N & Y N & N \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠$ .

We can see that $p^{\circ} (C_{J} \otimes C_{K})$ is not equivalent to $p^{\circ} (C_{J}) \otimes p^{\circ} (C_{K})$ by observing that the latter has a constant $N$ environment while the former doesn't.

Let $p^{\circ} (C_{J}) ≅ p^{\circ} (C_{K}) ≅ (A, E, \cdot)$ , and let $e_{N} \in E$ denote the environment such that $a \cdot e_{N} = N$ for both $a \in A$ . (In the matrix representation above, this is the first column.) Observe that there exists a morphism $(g, h) : (A, E, \cdot) \to (A, E, \cdot)^{*}$ , where $g$ and $h$ are both the constant $e_{N}$ function. This is a morphism because for all $a_{0}, a_{1} \in A$ , $a_{0} \cdot h (a_{1}) = a_{1} \cdot g (a_{0}) = N$ . This gives an environment in $p^{\circ} (C_{J}) \otimes p^{\circ} (C_{K})$ , all of whose entries must be $N$ . $p^{\circ} (C_{J} \otimes C_{K})$ has no such environment, so $p^{\circ} (C_{J} \otimes C_{K})$ cannot be isomorphic to $p^{\circ} (C_{J}) \otimes p^{\circ} (C_{K})$ , or even biextensionally equivalent. Indeed:

$p^{\circ} (C_{J}) \otimes p^{\circ} (C_{K}) ≃ ⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} N & Y & Y & Y & Y & Y N & N & N & Y & Y & Y N & N & Y & N & Y & Y N & N & N & N & N & Y \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠$ .

To see what is going on here, consider another example where Jack and Kate and Luke vote on whether to have a party, but whether or not the party happens is not just a function of the majority's vote. Instead, after the three people cast their votes, a coin is flipped:

If heads, the votes are tallied and majority wins as normal.
If tails, one of the three voters is selected at random to be dictator, and the party happens if and only if they voted in favor.

Let us work up to biextensional collapse. Let $D_{J}$ be the Cartesian frame over $V$ representing Jack's perspective. We have

$D_{J} ≃ (\begin{matrix} N & Y & Y N & N & Y \end{matrix})$ ,

where the top row represents voting for the party, and the bottom row represents voting against.

The first column represents environments where the party does not happen and Jack's vote didn't matter—either the coin came up heads and the others both voted against, or Kate or Luke became dictator and voted against. The third column similarly represents outcomes where the party happens regardless of how Jack votes. The second column represents all environments in which Jack's vote matters, so either he is dictator, or Kate and Luke's votes were split.

Similarly, let $D_{K}$ be the Cartesian frame over $V$ representing Kate's perspective,

$D_{K} ≃ (\begin{matrix} N & Y & Y N & N & Y \end{matrix})$ .

Then,

$D_{J} \otimes D_{K} ≃ ⎛ ⎜ ⎜ ⎜ ⎝ \begin{matrix} N & Y & Y & Y & Y & Y N & N & N & Y & Y & Y N & N & Y & N & Y & Y N & N & N & N & N & Y \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎠$ .

The rows represent, in order: both voting in favor; Jack voting in favor but Kate voting against; Kate voting in favor but Jack voting against; and both voting against.

The columns represent, in order: Luke is dictator and votes against; majority rules and Luke votes against; Kate is dictator; Jack is dictator; majority rules and Luke votes in favor; and Luke is dictator and votes in favor.

Here, $D_{J} \otimes D_{K}$ looks more like what we would expect Jack and Kate working together on a team to look like. However, up to biextensional equivalence, $D_{J}$ and $D_{K}$ are the same as $p^{\circ} (C_{J})$ and $p^{\circ} (C_{K})$ .

When we forget the actual votes and only look at whether the party happens, then up to biextensional collapse, the Cartesian frame representing Jack's perspective no longer has any way to distinguish between the simple majority rule vote and the complicated voting system with coins and dictators.

In general, just looking at two Cartesian frames does not tell you all of the information about the relationships between the people we might be using the frames to model. The Cartesian frames over $V$ representing Jack and Kate's perspectives do not have any information that distinguishes between the two vote counting schemes.

When taking a tensor, we automatically include all of the possible ways the two agents can embed in each other's environments, even if a given embedding doesn't make sense in a given interpretation.

4. Par

Our next multiplicative operation is $⅋$ , which is pronounced "par."

Definition: Let $C = (A, E, \cdot)$ and $D = (B, F, ⋆)$ be Cartesian frames over $W$ . $C ⅋ D = (hom (C^{*}, D), E \times F, ⋄)$ , where $(g, h) ⋄ (e, f) = g (e) ⋆ f = h (f) \cdot e$ .

Claim: $⅋$ is De Morgan dual to $\otimes$ , so $C ⅋ D = (C^{*} \otimes D^{*})^{*}$ .

Proof: Trivial. $□$

$⅋$ has much less of an intuitive interpretation than $\otimes$ . One reason for this is that in order to par two agents together, they have to be large enough that each other's environments embed within them. If $C$ and $D$ are not large enough, we will have that $C ⅋ D ≃ 0$ . (I am being informal with the word "large" here.)

One way that $C$ and $D$ can fail to be large enough is if $Ensure (C^{*}) \cap Prevent (D^{*})$ is nonempty, which is dual to the above result about tensor being for disjoint agents. It is actually pretty difficult for $C$ and $D$ to be large enough. If there is any fact about the world that is determined outside of both agents, $C ⅋ D$ will be trivial.

We had a dual restriction for $\otimes$ , but it didn't get in the way nearly as often: simple intuitive examples tend to be about small agents interacting with a large environment, so it is easy to imagine two agents that are disjoint. It is much harder to imagine simple examples of two agents that cover, which (informally) is what you would have to have for $⅋$ to be nontrivial.

I expect to not use $⅋$ very often, but I am including it here for completeness.

Claim: $⅋$ is commutative and associative, and $⊥$ is the identity of $⅋$ (up to isomorphism).

Proof: Trivial from the fact that $⅋$ is De Morgan dual to $\otimes$ and $1^{*} ≅ ⊥$ . $□$

Claim: If $C_{0} ≃ C_{1}$ and $D_{0} ≃ D_{1}$ , then $C_{0} ⅋ D_{0} ≃ C_{1} ⅋ D_{1}$ .

Proof: Trivial from the fact that $⅋$ is De Morgan dual to $\otimes$ , and $≃$ is preserved by $-^{*}$ . $□$

Claim: $⅋$ distributes over $&$ , so for all Cartesian frames $C_{0}$ , $C_{1}$ , and $D$ , we have $(C_{0} & C_{1}) ⅋ D ≅ (C_{0} ⅋ D) & (C_{1} ⅋ D)$ .

Proof: Trivial from the fact that $⅋$ is De Morgan dual to $\otimes$ , and $&$ is De Morgan dual to $\oplus .$ $□$

5. Lollipop

We have one more operation to introduce, $⊸$ (pronounced "lollipop"), which is a Cartesian frame that can be thought of as representing the collection of morphisms between two Cartesian frames.

Definition: Given two Cartesian frames over $W$ , $C = (A, E, \cdot)$ and $D = (B, F, ⋆)$ , we let $C ⊸ D$ denote the Cartesian frame $C ⊸ D = (hom (C, D), A \times F, ⋄)$ , where $⋄$ is given by $(g, h) ⋄ (a, f) = g (a) ⋆ f = a \cdot h (f)$ .

One way to interpret $C ⊸ D$ is as " $D$ with a $C$ -shaped hole in it." Indeed, let us think about $Agent (C ⊸ D)$ . and $Env (C ⊸ D)$ separately.

$Agent (C ⊸ D) = hom (C, D)$ is the collection of morphisms from $C$ to $D$ . Morphisms from $C$ to $D$ are exactly interfaces through which the agent of $C$ can interact with the environment of $D$ . We can also think of this as the collection of interfaces that allow the agent of $C$ to fill the role of the agent of $D$ . This makes sense. The collection of ways that a " $D$ with a $C$ -shaped hole in it" can be is exactly the collection of interfaces that allow us to get a possible agent of $D$ from a possible agent of $C$ .

Similarly, $Env (C ⊸ D) = A \times F$ makes sense as the environment of a " $D$ with a $C$ -shaped hole in it." The environment needs to supply an environment for $D$ , and also fill in the hole with an agent for $C$ .

Previously, $C$ 's agent might have been part of $D$ 's agent; in $C ⊸ D$ , however, this part of $D$ gets moved into the environment.

Imagine a football team $D$ with one team member, $C$ , removed—the team with a football-player-shaped hole in it. Its environment, naturally, is pairs of "the kind of environment you get for a football team" and "the removed teammate".

Lollipop can be easily constructed from our other operations.

Claim: $C ⊸ D ≅ C^{*} ⅋ D ≅ (C \otimes D^{*})^{*}$ .

Proof: Trivial. $□$

Lollipop is well-defined up to biextensional equivalence.

Claim: If $C_{0} ≃ C_{1}$ and $D_{0} ≃ D_{1}$ , then $C_{0} ⊸ D_{0} ≃ C_{1} ⊸ D_{1}$ .

Proof: Trivial. $□$

Lollipop also has some identity-like properties.

Claim: For all Cartesian Frames $C$ , $C ≅ 1 ⊸ C$ and $C^{*} ≅ C ⊸ ⊥$ .

Proof: $1 ⊸ C ≅ (1 \otimes C^{*})^{*} ≅ {C^{*}}^{*} ≅ C$ and $C ⊸ ⊥ ≅ (C \otimes 1)^{*} ≅ C^{*}$ . $□$

This last result is especially interesting because we can actually think of $C ⊸ ⊥$ as an alternative definition for $C^{*}$ .

In "Tensor is Relative to a Coarse World Model" above, we noted that two agents working together might sometimes have strictly fewer possible environments than show up in the tensor. In the next post, we will introduce the concept of a sub-tensor, which allows us to represent teams that have fewer possible environments than the tensor. Similarly, sub-sum will be sum with spurious possible environments removed.

24 comments

Comments sorted by top scores.

comment by Gunnar_Zarncke · 2020-11-08T00:12:41.274Z · LW(p) · GW(p)

Lollipop intuitively seems to map well to the concept of a role. A role in a process to be filled by a compatible agent. Roles play a big role in how real-life organizations solve coordination problems. I'd like to model that and see whether insights can be gained in the framework of CF. But I have trouble abstracting the role itself. It seems with Lollipop all I can get is an agent-shaped hole in a specific frame. I'd also like to compose roles but my experiments all seem to lead to null i.e. contradictions.

comment by DanielFilan · 2020-11-04T07:55:11.676Z · LW(p) · GW(p)

Claim: For all Cartesian Frames C, C≃1⊸C and C∗≃C⊸⊥.

These should be isomorphisms, not biextensional equivalences, right? The proofs establish isomorphism.

Replies from: Scott Garrabrant

↑ comment by Scott Garrabrant · 2020-11-04T19:13:58.671Z · LW(p) · GW(p)

Yep, fixed, thanks.

comment by DanielFilan · 2020-11-03T23:19:26.360Z · LW(p) · GW(p)

It seems like there has to be some kind of relationship between lollipop and the sub-agent relation, right? Like, they're both about one 'agent' sending info to another to combine to send something to the environment. But I'm not quite sure what the relationship is going to be: presumably it's going to be that if C ◃ D, then C ⊸ D is {equal, isomorphic, biextensionally equivalent} to some special object, but IDK what that object would be.

Replies from: DanielFilan, DanielFilan

↑ comment by DanielFilan · 2020-11-04T00:58:37.858Z · LW(p) · GW(p)

So, I'd have thought there would be a morphism from C ⊸ ⊥ to (C ⊸ D) ⊗ (D ⊸ ⊥). You get a function from hom(C,⊥) to hom(C,D) x hom(D,⊥) from C being a subagent of D. But you also need a convenient function from hom(C ⊸ D, dual(D ⊸ ⊥)) to env(C). Now, dual(D ⊸ ⊥) is just D, so it's really a function from hom(C ⊸ D, D) to env(C). But I have no idea what that function would be.

Replies from: Scott Garrabrant, DanielFilan, DanielFilan

↑ comment by Scott Garrabrant · 2020-11-05T18:28:03.664Z · LW(p) · GW(p)

I believe, if and $D = null$ , then $C ◃ D$ , $C ⊸ ⊥ ≅ 0$ , $C ⊸ D ≅ null$ and $D ⊸ ⊥ ≅ null$ . Thus $(C ⊸ D) \otimes (D ⊸ ⊥) ≅ null \otimes null ≅ ⊤$ , but there is no morphism from 0 to $⊤$

Replies from: DanielFilan

↑ comment by DanielFilan · 2020-11-05T19:16:39.888Z · LW(p) · GW(p)

Reader's note: I wish there were somewhere I could look to see the definitions of 0, 1, null, top, and bottom, all in the same place.

Replies from: RobbBB

↑ comment by Rob Bensinger (RobbBB) · 2020-11-05T19:42:25.415Z · LW(p) · GW(p)

For my personal use when I was helping review Scott's drafts, I made some mnemonics (complete with silly emojis to keep track of the small Cartesian frames and operations) here: https://docs.google.com/drawings/d/1bveBk5Pta_tml_4ezJ0oWiq-qudzgnsRlfbGJgZ1qv4/.

(Also includes my crude visualizations of morphism composition and homotopy equivalence to help those concepts stick better in my brain.)

Replies from: DanielFilan

↑ comment by DanielFilan · 2020-11-05T21:01:59.472Z · LW(p) · GW(p)

Thanks!

Replies from: RobbBB

↑ comment by Rob Bensinger (RobbBB) · 2020-11-08T12:47:57.694Z · LW(p) · GW(p)

And now I've made a LW post collecting most of the definitions in the sequence so far, so they're easier to find: https://www.lesswrong.com/posts/kLLu387fiwbis3otQ/cartesian-frames-definitions [LW · GW]

↑ comment by DanielFilan · 2020-11-04T01:37:25.354Z · LW(p) · GW(p)

You can prove there's a morphism the other way, but that doesn't rely on any subagency relationship.

Replies from: DanielFilan, DanielFilan

↑ comment by DanielFilan · 2020-11-05T00:12:18.944Z · LW(p) · GW(p)

Actually, you can get a morphism from (C ⊸ D) ⊗ (D ⊸ E) to C ⊸ E for any C, D, and E.

Replies from: Scott Garrabrant

↑ comment by Scott Garrabrant · 2020-11-05T00:26:06.704Z · LW(p) · GW(p)

I haven't put time into thinking about most of your comments yet, but I'm pretty sure the answer to this is yes.

EDIT: Oh, I just realized it wasn't a question.

Replies from: DanielFilan

↑ comment by DanielFilan · 2020-11-05T00:41:53.665Z · LW(p) · GW(p)

This thread is mostly me trying to work something out and reporting the results. To the extent there's a question, it's this: if C ◃ D, is there something interesting to say about C ⊸ D?

Replies from: Scott Garrabrant

↑ comment by Scott Garrabrant · 2020-11-05T18:41:00.968Z · LW(p) · GW(p)

I am not sure, but I think that the answer is that you can't say anything interesting with just , but can maybe say interesting things with $◃_{+}$ and $◃_{\times}$ , which I am about to introduce. In the post that just went up [LW · GW], $◃_{+}$ is the relationship between one of the components and a sub-sum, and $◃_{\times}$ is the relationship between one of the components and a sub-tensor. $◃$ is the transitive closure of $◃_{+}$ and $◃_{\times}$ .

I think that if $C ◃_{+} D$ , then there is a nice morphism from $C$ to $D$ , and if $C ◃_{\times} D$ , there is a set of nice morphisms from $C$ to $D,$ but in some degenerate cases that set is empty, which is how I constructed a counter example in my other comment.

↑ comment by DanielFilan · 2020-11-04T02:25:47.254Z · LW(p) · GW(p)

And in my proof the morphism isn't bijective.

↑ comment by DanielFilan · 2020-11-04T01:12:48.082Z · LW(p) · GW(p)

I think it would have to involve some fixed point?

↑ comment by DanielFilan · 2020-11-03T23:36:40.221Z · LW(p) · GW(p)

I guess one version of this is that if C ◃ D, then for all e in the environment of C, there's some (g,h) in agent(C ⊸ D) and (a,f) in env(C ⊸ D) such that e = h(f). So for all (a,e) in agent(C) x env(C), there's (a', e') in agent(C ⊸ D) x env(C ⊸ D) so that a ⋅ e = a' ⋄ e'. Which I guess is something.

comment by Ramana Kumar (ramana-kumar) · 2020-12-17T14:49:18.012Z · LW(p) · GW(p)

Since we have already established commutativity, it suffices to show that .

For the confused reader, the argument in more detail here is:

$\begin{matrix} C_{0} \otimes (C_{1} \otimes C_{2}) ≅ (C_{1} \otimes C_{2}) \otimes C_{0} & comm ≅ (C_{1} \otimes C_{0}) \otimes C_{2} & lemma ≅ (C_{0} \otimes C_{1}) \otimes C_{2} & comm, iso \end{matrix}$

where $comm$ is commutativity of tensor, $lemma$ is the fact claimed to suffice above, and $iso$ is the implicitly assumed lemma that $C_{0} ≅ C_{1}$ and $D_{0} ≅ D_{1}$ implies $C_{0} \otimes D_{0} ≅ C_{1} \otimes D_{1}$ (this is proved later but only for $≃$ ).

comment by Ramana Kumar (ramana-kumar) · 2020-12-16T08:46:05.321Z · LW(p) · GW(p)

minor typo in the indices here:

We will show that , and since the definition of $D$ is symmetric in swapping $C_{1}$ and $C_{2}$ , it will follow that $(C_{0} \otimes C_{1}) \otimes C_{2} ≅ D$