Posts

Adam Optimizer Causes Privileged Basis in Transformer LM Residual Stream 2024-09-06T17:55:34.265Z

Comments

Comment by Diego Caples (diego-caples) on Adam Optimizer Causes Privileged Basis in Transformer LM Residual Stream · 2024-09-08T23:20:51.635Z · LW · GW

If we were to start training with Adam and later switch to SGD, I would guess that the privileged basis would persist.

There is no mechanism in SGD which opposes solutions with basis aligned features, it’s just that SGD is agnostic to all choices of directions for features in the residual stream. Because there are -many possible directions for features to point, the reason an SGD trained model does not have privileged basis is simply because it is exceedingly unlikely to be randomly initialized into one.

On the other hand, Adam collects statistics with respect to each basis dimension, making basis dimensions different other directions. Somehow, this causes model features to align with basis dimensions.