Posts

Comments

Comment by zyx zyx (zyx-zyx) on How useful is mechanistic interpretability? · 2024-01-13T15:37:59.064Z · LW · GW

Other approaches of alignment are just as deserving to be skeptical of as mechanistic interpretability if faced with as much scrutiny.