post by [deleted] · · ? · GW · 0 comments

This is a link post for


Comments sorted by top scores.

comment by Jonas Hallgren · 2024-04-02T10:04:23.500Z · LW(p) · GW(p)

This was a dig at interpretability research. I'm pro-interpretability research in general, so if you feel personally attacked by this, it wasn't meant to be too serious. Just be careful with infohazards, ok? :)