Posts

Open Source Automated Interpretability for Sparse Autoencoder Features 2024-07-30T21:11:36.866Z
Ophiology (or, how the Mamba architecture works) 2024-04-09T19:31:09.975Z

Comments

Comment by SrGonao (srgonao) on Evaluating Sparse Autoencoders with Board Game Models · 2024-08-03T09:59:28.169Z · LW · GW

I don't know much about chess. Could it be that feature 172 that you are highlighting is related to some kind of chess opening? The distribution of black pawns could be due to different states of the opening, and the position of the black bishop and white horse could also be related to different parts of that opening?