Posts

Comments

Comment by Trang Nguyen (trang-nguyen-1) on Implementing activation steering · 2024-10-01T13:53:01.338Z · LW · GW

What would be the use of gradient of the steering vector?