Posts

Sampling Effects on Strategic Behavior in Supervised Learning Models 2024-09-24T07:44:41.677Z

Comments

Comment by Phil Bland on How LLMs are and are not myopic · 2024-02-09T10:47:08.495Z · LW · GW

Do you have any update on this? It goes strongly against my current understanding of how LLMs learn. In particular, in the supervised learning phase any output text claiming to be an LLM would be penalized unless such statements are included in the training corpus. If such behavior nevertheless arises I would be super excited to analyze this further though.