Posts

Investigating Sensitive Directions in GPT-2: An Improved Baseline and Comparative Analysis of SAEs 2024-09-06T02:28:41.954Z

Comments

Comment by Daniel Lee (daniel-lee) on Open Thread Summer 2024 · 2024-08-30T14:00:16.712Z · LW · GW

Hi, excited to learn more about Mech Int!