Posts

Conditional Importance in Toy Models of Superposition 2025-02-02T20:35:38.655Z
Thoughts on Toy Models of Superposition 2025-02-02T13:52:54.505Z
Reflections on ML4Good 2024-11-25T02:40:32.586Z

Comments

Comment by james__p on Conditional Importance in Toy Models of Superposition · 2025-02-13T13:51:22.817Z · LW · GW

Yeah I agree that with hindsight, the conclusion could be better explained and motivated from first principles, rather than by running an experiment. I wrote this post in the order in which I actually tried things as I wanted to give an honest walkthrough of the process that lead me to the conclusion, but I can appreciate that it doesn't optimise for ease to follow.