Posts

Comments

Comment by arvganesh on MDPs and the Bellman Equation, Intuitively Explained · 2023-02-16T20:07:22.263Z · LW · GW

The expectation of the reward calculation starting from Berkeley should be 22.75, I think the arithmetic is incorrect. Thanks!