post by [deleted] · · ? · GW · 0 comments

This is a link post for

0 comments

Comments sorted by top scores.

comment by Vanessa Kosoy (vanessa-kosoy) · 2024-03-04T13:06:01.266Z · LW(p) · GW(p)

 .

What is ? Also, we should allow adding some valid reward function of .

Replies from: OliverHayman
comment by OliverHayman · 2024-03-04T17:59:19.794Z · LW(p) · GW(p)

kth element of q

comment by Vanessa Kosoy (vanessa-kosoy) · 2024-03-04T12:21:57.921Z · LW(p) · GW(p)

 is a polytope with , corresponding to allowed action distributions at that state. 

I think it's mathematically cleaner to get rid of A and have those be abstract polytopes.

Replies from: OliverHayman
comment by OliverHayman · 2024-03-04T17:39:05.640Z · LW(p) · GW(p)

Sounds good