About

The Arc Prize Foundation says its new ARC-AGI-2 test stumps most AI models; humans get 60% of the questions right but GPT-4.5 and Claude 3.7 Sonnet score ~1% (Maxwell Zeff/TechCrunch)

The Arc Prize Foundation says its new ARC-AGI-2 test stumps most AI models; humans get 60% of the questions right but GPT-4.5 and Claude 3.7 Sonnet score ~1% (Maxwell Zeff/TechCrunch) The Arc Prize Foundation says its new ARC-AGI-2 test stumps most AI models; humans get 60% of the questions right but GPT-4.5 and Claude 3.7 Sonnet score ~1% (Maxwell Zeff/TechCrunch) Reviewed by swadu on March 25, 2025 Rating: 5

No comments:

Powered by Blogger.