Paper Replication Walkthrough: Reverse-Engineering Modular Addition

post by Neel Nanda (neel-nanda-1) · 2023-03-12T13:25:46.400Z · LW · GW · 0 comments

This is a link post for https://neelnanda.io/modular-addition-walkthrough

I'm excited about trying different formats for mechanistic interpretability education! I've made a video walkthrough where we replicate my paper, Progress Measures for Grokking via Mechanistic Interpretability. With Jess Smith, one of my co-authors, we record ourselves coding a replication and discussed what we did at each step. This is a three part walkthrough and you can see the accompanying code for the walkthrough here:

This is an experiment with a new format, and I'd love to hear about how useful you find it!

0 comments

Comments sorted by top scores.