BenchmarkJune 19, 20261 min read
AI Models Fail to Deliver on Real-World Knowledge Work, Solving Just 3% of Tasks
A new benchmark has revealed the stark limitations of even the most advanced AI models, with the top performer solving only 3% of tasks. This raises significant concerns about the ability of AI to handle complex, real-world knowledge work, with far-reaching implications for businesses and developers.
Even the best AI model fails at realistic knowledge work, fully solving just 3 percent of tasks. The article New benchmark exposes how badly AI struggles with real knowledge work appeared first on The Decoder.