Posts

Has Anthropic checked if Claude fakes alignment for intended values too? 2024-12-23T00:43:07.490Z

Comments