Posts
Comments
Comment by
lukaemon on
Actually, Othello-GPT Has A Linear Emergent World Representation ·
2024-08-07T15:57:04.042Z ·
LW ·
GW
In hindsight, I should have trained on layer 6, which is the point where the board state is fully computed and starts to really be used.
You mean layer 4?