Posts

Comments

Comment by Jiaxing Wu on Addressing Feature Suppression in SAEs · 2024-11-22T08:20:07.127Z · LW · GW

Hi, thanks for your work. I was wondering why we use scaling to modify the activation here rather than using an analytical solution by compensating for the −cd/2 term.