Posts
Comments
Comment by
Jiaxing Wu on
Addressing Feature Suppression in SAEs ·
2024-11-22T08:20:07.127Z ·
LW ·
GW
Hi, thanks for your work. I was wondering why we use scaling to modify the activation here rather than using an analytical solution by compensating for the −cd/2 term.