Posts

New Capabilities, New Risks? - Evaluating Agentic General Assistants using Elements of GAIA & METR Frameworks 2024-09-29T18:58:56.253Z

Comments