Posts

A Sober Look at Steering Vectors for LLMs 2024-11-23T17:30:00.745Z
Dima's Shortform 2024-08-22T14:49:00.960Z

Comments

Comment by Dmitrii Krasheninnikov (dmitrii-krasheninnikov) on (The) Lightcone is nothing without its people: LW + Lighthaven's big fundraiser · 2025-01-13T16:14:34.173Z · LW · GW

Donated $100 for now. Thanks for the great work!

Comment by Dmitrii Krasheninnikov (dmitrii-krasheninnikov) on Meta learning to gradient hack · 2022-07-06T16:31:10.770Z · LW · GW

Could you please share the results in case you ended up finishing those experiments?