Posts

Using mechanistic interpretability to find in-distribution failure in toy transformers 2022-11-28T19:39:32.603Z

Comments