0 comments

Comments sorted by top scores.

comment by Vanessa Kosoy (vanessa-kosoy) · 2024-07-15T06:58:59.955Z · LW(p) · GW(p)

let be a helper function that maps each $a \in V (P_{i})$ to $e_{⊤} (s^{'}, a)$ .

This function is ill-defined outside the vertices.

comment by Vanessa Kosoy (vanessa-kosoy) · 2024-07-15T06:37:40.734Z · LW(p) · GW(p)

or , we want $P_{\land} (s) := P_{σ_{⊤} (s^{'})} (s^{''})$ and $A_{\land} (s) := A_{σ_{⊤} (s^{'})} (s^{''})$ , so that the actions available are just those of the state in the sub-environment. To achieve this we define $σ_{\land} (s) := σ_{σ_{⊤} (s^{'})} (s^{''})$

It seems that you're using Ai and Pi to denote both the action spaces of the top environments and the action space assignment functions of the bottom environments. In addition, there is an implicit assumption that the bottom environments share the same list of action spaces. This is pretty confusing.

Replies from: arjun-p

↑ comment by arjunpi (arjun-p) · 2024-07-30T17:28:26.253Z · LW(p) · GW(p)

I'm not. I guess this is the part that makes it confusing

for readability we define and $P_{σ (s)}$ to be the accessible and outer action spaces of $s$ respectively

Do you have a suggestion for alternate notation? I use this because we often need to refer to the action space corresponding to a state. I think this would be needed even with the language framing.

(I also assigned $j := σ (s^{'})$ to make it more readable)

Replies from: vanessa-kosoy

↑ comment by Vanessa Kosoy (vanessa-kosoy) · 2024-08-03T11:18:06.665Z · LW(p) · GW(p)

You can use e.g. subscripts to refer to indices of the action space list and superscripts to refer to indices of the subenvironment list.

Replies from: arjun-p

↑ comment by arjunpi (arjun-p) · 2024-08-06T19:43:11.501Z · LW(p) · GW(p)

I don't think this will work because we are already using subscripts to denote which environment's list we are referring to

Replies from: vanessa-kosoy

↑ comment by Vanessa Kosoy (vanessa-kosoy) · 2024-08-07T07:06:15.424Z · LW(p) · GW(p)

Yes, my point is that currently subscripts refer to both subenvironments and entries in the action space list. I suggest changing one of these two into superscripts.

comment by Vanessa Kosoy (vanessa-kosoy) · 2024-07-15T06:29:45.829Z · LW(p) · GW(p)

such that

Is A(Ei) supposed to be just Ai?

comment by Vanessa Kosoy (vanessa-kosoy) · 2024-07-15T06:17:38.269Z · LW(p) · GW(p)

μ×:=μ1×⋯×μk×δ

Unclear what delta is here. Is it supposed to be p?

comment by Vanessa Kosoy (vanessa-kosoy) · 2024-07-15T06:06:48.105Z · LW(p) · GW(p)

An atomic environment is constructed by directly providing

The transition kernel is missing from this list.

comment by Vanessa Kosoy (vanessa-kosoy) · 2024-07-15T06:04:50.529Z · LW(p) · GW(p)

a vector space and linear maps $g : R^{U} \to W$ and $R^{Q} : W \to R$ such that for any $q \in Q$ , $R^{Q} (q) = {max}_{ν \in M s.t. g (ν) = q} R^{ν} (ν)$ .
a H-polytope $Q := g (M) \subseteq W$ that we call the occupancy polytope

Confusing: you're using Q before you defined it. Also, instead of writing "s.t." in the subscript, you can write ":"

comment by Vanessa Kosoy (vanessa-kosoy) · 2024-07-15T05:56:16.026Z · LW(p) · GW(p)

Let's view each accessible action space as the set of randomized policies over $V (A (s))$ .

Seems worth to clarify that this representation is non-unique: multiple distribution over V(A) can correspond to the same point in A.

comment by Vanessa Kosoy (vanessa-kosoy) · 2024-07-15T05:50:31.854Z · LW(p) · GW(p)

where each and $P_{i}$ is an HV-polytope

Too restrictive. P can be an H-polytope, doesn't need to be an HV-polytope.

comment by Vanessa Kosoy (vanessa-kosoy) · 2024-07-15T05:35:09.974Z · LW(p) · GW(p)

efficiently^[1]

The footnote is missing