Essayer OR - Gratuit
AI Models Collapse in Face of Complex Problems
Hindustan Times Chandigarh
|June 09, 2025
Just days ahead of the much-anticipated Worldwide Developer Conference (WWDC), Apple has released a study titled "The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity", which saw researchers testing 'reasoning' AI models such as Anthropic's Claude, OpenAI's models, DeepSeek RL, and Google's Thinking models to see how far they can scale to replicate human reasoning.
NEW DELHI: Just days ahead of the much-anticipated Worldwide Developer Conference (WWDC), Apple has released a study titled "The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity," which saw researchers testing 'reasoning' AI models such as Anthropic's Claude, OpenAI's models, DeepSeek RL, and Google's Thinking models to see how far they can scale to replicate human reasoning. Spoiler alert—not as much, as the entire AI marketing pitch would have you believe. Could this signal what may be in store for Apple's AI conversation ahead of the keynote?
The study questions the current standard evaluation of Large Reasoning Models (LRMs) using established mathematical and coding benchmarks, arguing they suffer from data contamination and don't reveal insights into reasoning trace structure and quality. Instead, it proposes a controlled experimental test-bed using algorithmic puzzle environments. The limitations of AI benchmarking, and need to evolve, is something we had written about earlier.
Cette histoire est tirée de l'édition June 09, 2025 de Hindustan Times Chandigarh.
Abonnez-vous à Magzter GOLD pour accéder à des milliers d'histoires premium sélectionnées et à plus de 9 000 magazines et journaux.
Déjà abonné ? Se connecter
PLUS D'HISTOIRES DE Hindustan Times Chandigarh
Hindustan Times Chandigarh
Boxing nationals: Lovlina, Amit survive tough fights
It was a day when some of India’s top boxers were thoroughly tested.
1 mins
January 07, 2026
Hindustan Times Chandigarh
Goyal heads to Brussels for India-EU FTA talks
Union commerce and industry minister Piyush Goyal will visit Brussels this week to provide “strategic guidance” to negotiators finalising the contours of a mutually beneficial India-European Union free trade deal.
1 mins
January 07, 2026
Hindustan Times Chandigarh
SC raps ‘rich, affluent’ for using writ pleas to dodge PMLA trials
DURING THE TRIAL, THE AFFLUENT MOVE COURT, CHALLENGING THE VIRES OF LEGISLATION. FACE TRIAL LIKE ANY OTHER CITIZEN, THE CJI SAID
2 mins
January 07, 2026
Hindustan Times Chandigarh
OLYMPIAN’S DISQUALIFICATION FOR HEADBUTT SPARKS ROW
Tokyo Olympian Ashish Chaudhary's disqualification for a headbutt has sparked a refereeing controversy at the Boxing Nationals.
1 min
January 07, 2026
Hindustan Times Chandigarh
2 Hindu bizmen killed within hours in B’desh
A 40-year-old Hindu man, owner of a grocery shop, has been murdered after unidentified attackers struck him with a sharp weapon in Bangladesh's Narsingdi city, according to a local media report.
1 min
January 07, 2026
Hindustan Times Chandigarh
Madras HC upholds ruling allowing lighting of deepam on Madurai hilltop
BENGALURU: The Madras high court on Tuesday allowed the lighting of the Karthigai Deepam lamp on the Thiruparankundram Hills in Madurai, and pulled up the Tamil Nadu government for invoking an “imaginary ghost” of disturbance to law and order to resist a long-standing Tamil tradition for “its own convenience.”
2 mins
January 07, 2026
Hindustan Times Chandigarh
The Venezuela test for UN & international law
A long-running discussion at the core of international law has been rekindled by the recent US military strike within Venezuelan territory that resulted in President Nicolas Maduro’s arrest and transfer to New York:
3 mins
January 07, 2026
Hindustan Times Chandigarh
Looking beyond Trump’s rhetoric
Hard-headed economic considerations should drive India’s engagement with the US
2 mins
January 07, 2026
Hindustan Times Chandigarh
Bengal SIR: Amartya Sen gets EC notice
The Election Commission (EC) has sent a notice for hearing to Nobel Laureate Amartya Sen after logical discrepancies were detected in the enumeration form of the eminent economist, a senior official of the poll panel confirmed on Tuesday.
1 mins
January 07, 2026
Hindustan Times Chandigarh
NO RUSSIAN OIL RECEIVED IN 3 WEEKS: RELIANCE
Reliance Industries Ltd, the operator of the world’s largest single site oil refining complex and till recently India’s biggest buyer of Russian oil, on Tuesday said it has not received any Russian barrels in almost three weeks and none are expected in January.
1 min
January 07, 2026
Listen
Translate
Change font size
