wuthejeff

Posts
Comments

Posts

Comments

Comment by wuthejeff (jeff-wu) on ProLU: A Nonlinearity for Sparse Autoencoders · 2024-04-25T15:41:38.937Z · LW · GW

This is great! We were working on very similar things concurrently at OpenAI but ended up going a slightly different route.

A few questions:
- What does the distribution of learned biases look like?
- For the STE variant, did you find it better to use the STE approximation for the activation gradient, even though the approximation is only needed for the bias?

User info

Posts

Comments