Posts
An interactive introduction to grokking and mechanistic interpretability
2023-08-07T19:09:19.422Z
Comments
Comment by
Adam Pearce (adam-pearce) on
An interactive introduction to grokking and mechanistic interpretability ·
2023-08-16T02:50:00.049Z ·
LW ·
GW
Lots of custom d3 https://github.com/PAIR-code/ai-explorables/tree/master/source/grokking
Comment by
Adam Pearce (adam-pearce) on
Growing Bonsai Networks with RNNs ·
2023-08-07T21:28:03.520Z ·
LW ·
GW
- The optimization section of Learning Transformer Programs might work with your task/model
- You've probably seen David Ha's work, but something like https://es-clip.github.io/ could be a good starting point for dropping backprop.
- The exotic activation function almost feels like cheating? Like I want the model the model to discover these useful structures, then try to understand them. But trying to do everything at once may be too hard.
- Incredibility minor, but changing from
onchange
tooninput
and dropping the animation will make the slider feel much slicker.