Home / Techmeme / The Arc Prize Foundation says its new ARC-AGI-2 test stumps most AI models; humans get 60% of the questions right but GPT-4.5 and Claude 3.7 Sonnet score ~1% (Maxwell Zeff/TechCrunch)

The Arc Prize Foundation says its new ARC-AGI-2 test stumps most AI models; humans get 60% of the questions right but GPT-4.5 and Claude 3.7 Sonnet score ~1% (Maxwell Zeff/TechCrunch)

March 25, 2025 Techmeme

Maxwell Zeff / TechCrunch:
The Arc Prize Foundation says its new ARC-AGI-2 test stumps most AI models; humans get 60% of the questions right but GPT-4.5 and Claude 3.7 Sonnet score ~1% — The Arc Prize Foundation, a nonprofit co-founded by prominent AI researcher François Chollet, announced in a blog post …

from Techmeme https://ift.tt/PyjSqwX

Reviewed by swadu on March 25, 2025 Rating: 5

No comments:

Subscribe to: Post Comments ( Atom )

Coding Tech

About

The Arc Prize Foundation says its new ARC-AGI-2 test stumps most AI models; humans get 60% of the questions right but GPT-4.5 and Claude 3.7 Sonnet score ~1% (Maxwell Zeff/TechCrunch)

No comments:

Recent Posts

Facebook

Blog Archive

Ads

Popular Posts

Categories

Report Abuse

About Me

Blog Archive

Search This Blog

Labels

Recent Post

Featured

Food

Random Posts

Tags

Recent Posts