0 comments
Comments sorted by top scores.
comment by Vanessa Kosoy (vanessa-kosoy) · 2024-03-04T13:06:01.266Z · LW(p) · GW(p)
.
What is ? Also, we should allow adding some valid reward function of .
Replies from: OliverHayman↑ comment by OliverHayman · 2024-03-04T17:59:19.794Z · LW(p) · GW(p)
kth element of q
comment by Vanessa Kosoy (vanessa-kosoy) · 2024-03-04T12:21:57.921Z · LW(p) · GW(p)
is a polytope with , corresponding to allowed action distributions at that state.
I think it's mathematically cleaner to get rid of A and have those be abstract polytopes.
Replies from: OliverHayman↑ comment by OliverHayman · 2024-03-04T17:39:05.640Z · LW(p) · GW(p)
Sounds good