Posts
Comments
What are your current AGI timelines?
Are you aware of the recent metr paper which measured AI Ability to Complete Long Tasks and found out it doubles every 7 months?
But then again, it seems like we wouldn’t be able to create accurate plots with any model, since models are inherently different, and each one has slight architectural variations. Even the 2024–2025 plot isn’t entirely accurate, as the models it includes also differ to some extent. Comparing LLMs to LRMs (Large Reasoning Models) is simply a natural step in their evolution, these models will always continue to develop.
When do you expect agents or AI systems to accelerate AI R&D by a good margin? Like 2x from where it’s now for example.
Yes they used a 50% success rate and even then some sub 10min tasks are still troublesome for LLMs as seen in the graph. But I think this will improve aswell if we make the algorithms better