Posts
Comments
Comment by
arvganesh on
MDPs and the Bellman Equation, Intuitively Explained ·
2023-02-16T20:07:22.263Z ·
LW ·
GW
The expectation of the reward calculation starting from Berkeley should be 22.75, I think the arithmetic is incorrect. Thanks!