Posts
Open Source Automated Interpretability for Sparse Autoencoder Features
2024-07-30T21:11:36.866Z
Ophiology (or, how the Mamba architecture works)
2024-04-09T19:31:09.975Z
Comments
Comment by
SrGonao (srgonao) on
Evaluating Sparse Autoencoders with Board Game Models ·
2024-08-03T09:59:28.169Z ·
LW ·
GW
I don't know much about chess. Could it be that feature 172 that you are highlighting is related to some kind of chess opening? The distribution of black pawns could be due to different states of the opening, and the position of the black bishop and white horse could also be related to different parts of that opening?