Posts

Takeaways from a Mechanistic Interpretability project on “Forbidden Facts” 2023-12-15T11:05:23.256Z
Update on Harvard AI Safety Team and MIT AI Alignment 2022-12-02T00:56:45.596Z

Comments