Posts

Comments

Comment by Mohammed Saeed (mohammed-saeed) on Steering GPT-2-XL by adding an activation vector · 2023-05-16T11:45:21.638Z · LW · GW

Great work! I think our EMNLP 2022 Findings paper is relevant here. We construct a "Type Vector" using tokens from the LLM vocabulary and then use that as prior information for the type expected at output. We also try with text generation and view some promising results.