Posts
Comments
Comment by
wuthejeff (jeff-wu) on
ProLU: A Nonlinearity for Sparse Autoencoders ·
2024-04-25T15:41:38.937Z ·
LW ·
GW
This is great! We were working on very similar things concurrently at OpenAI but ended up going a slightly different route.
A few questions:
- What does the distribution of learned biases look like?
- For the STE variant, did you find it better to use the STE approximation for the activation gradient, even though the approximation is only needed for the bias?