Posts
Conditional Importance in Toy Models of Superposition
2025-02-02T20:35:38.655Z
Thoughts on Toy Models of Superposition
2025-02-02T13:52:54.505Z
Reflections on ML4Good
2024-11-25T02:40:32.586Z
Comments
Comment by
james__p on
Conditional Importance in Toy Models of Superposition ·
2025-02-13T13:51:22.817Z ·
LW ·
GW
Yeah I agree that with hindsight, the conclusion could be better explained and motivated from first principles, rather than by running an experiment. I wrote this post in the order in which I actually tried things as I wanted to give an honest walkthrough of the process that lead me to the conclusion, but I can appreciate that it doesn't optimise for ease to follow.