giorgi-chavchanidze

Posts
Comments

Posts

Comments

Comment by Giorgi Chavchanidze (giorgi-chavchanidze) on Deep Q-Networks Explained · 2024-09-09T13:12:42.275Z · LW · GW

Hello,

Thanks for the great article. A general question - what happens if the action space in the environment is state-dependent? In this case, if I use an "Atari-like" Neural Network for approximating Q function, it will also give values for Q(a,s) for non-feasible pairs of a and s. In practice, I could just ignore these pairs, but will this create any problems theoretically speaking? If so, could you give a quick suggestion about how to fix this or where to look for solution?

Thanks!

User info

Posts

Comments