Posts

Notable runaway-optimiser-like LLM failure modes on Biologically and Economically aligned AI safety benchmarks for LLMs with simplified observation format 2025-03-16T23:23:30.989Z

Comments