Essayer OR - Gratuit

AI Models Collapse in Face of Complex Problems

Hindustan Times Chandigarh

|

June 09, 2025

Just days ahead of the much-anticipated Worldwide Developer Conference (WWDC), Apple has released a study titled "The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity", which saw researchers testing 'reasoning' AI models such as Anthropic's Claude, OpenAI's models, DeepSeek RL, and Google's Thinking models to see how far they can scale to replicate human reasoning.

- Vishal Mathur

NEW DELHI: Just days ahead of the much-anticipated Worldwide Developer Conference (WWDC), Apple has released a study titled "The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity," which saw researchers testing 'reasoning' AI models such as Anthropic's Claude, OpenAI's models, DeepSeek RL, and Google's Thinking models to see how far they can scale to replicate human reasoning. Spoiler alert—not as much, as the entire AI marketing pitch would have you believe. Could this signal what may be in store for Apple's AI conversation ahead of the keynote?

The study questions the current standard evaluation of Large Reasoning Models (LRMs) using established mathematical and coding benchmarks, arguing they suffer from data contamination and don't reveal insights into reasoning trace structure and quality. Instead, it proposes a controlled experimental test-bed using algorithmic puzzle environments. The limitations of AI benchmarking, and need to evolve, is something we had written about earlier.

PLUS D'HISTOIRES DE Hindustan Times Chandigarh

Hindustan Times Chandigarh

Hindustan Times Chandigarh

Boxing nationals: Lovlina, Amit survive tough fights

It was a day when some of India’s top boxers were thoroughly tested.

time to read

1 mins

January 07, 2026

Hindustan Times Chandigarh

Goyal heads to Brussels for India-EU FTA talks

Union commerce and industry minister Piyush Goyal will visit Brussels this week to provide “strategic guidance” to negotiators finalising the contours of a mutually beneficial India-European Union free trade deal.

time to read

1 mins

January 07, 2026

Hindustan Times Chandigarh

SC raps ‘rich, affluent’ for using writ pleas to dodge PMLA trials

DURING THE TRIAL, THE AFFLUENT MOVE COURT, CHALLENGING THE VIRES OF LEGISLATION. FACE TRIAL LIKE ANY OTHER CITIZEN, THE CJI SAID

time to read

2 mins

January 07, 2026

Hindustan Times Chandigarh

OLYMPIAN’S DISQUALIFICATION FOR HEADBUTT SPARKS ROW

Tokyo Olympian Ashish Chaudhary's disqualification for a headbutt has sparked a refereeing controversy at the Boxing Nationals.

time to read

1 min

January 07, 2026

Hindustan Times Chandigarh

2 Hindu bizmen killed within hours in B’desh

A 40-year-old Hindu man, owner of a grocery shop, has been murdered after unidentified attackers struck him with a sharp weapon in Bangladesh's Narsingdi city, according to a local media report.

time to read

1 min

January 07, 2026

Hindustan Times Chandigarh

Hindustan Times Chandigarh

Madras HC upholds ruling allowing lighting of deepam on Madurai hilltop

BENGALURU: The Madras high court on Tuesday allowed the lighting of the Karthigai Deepam lamp on the Thiruparankundram Hills in Madurai, and pulled up the Tamil Nadu government for invoking an “imaginary ghost” of disturbance to law and order to resist a long-standing Tamil tradition for “its own convenience.”

time to read

2 mins

January 07, 2026

Hindustan Times Chandigarh

The Venezuela test for UN & international law

A long-running discussion at the core of international law has been rekindled by the recent US military strike within Venezuelan territory that resulted in President Nicolas Maduro’s arrest and transfer to New York:

time to read

3 mins

January 07, 2026

Hindustan Times Chandigarh

Looking beyond Trump’s rhetoric

Hard-headed economic considerations should drive India’s engagement with the US

time to read

2 mins

January 07, 2026

Hindustan Times Chandigarh

Hindustan Times Chandigarh

Bengal SIR: Amartya Sen gets EC notice

The Election Commission (EC) has sent a notice for hearing to Nobel Laureate Amartya Sen after logical discrepancies were detected in the enumeration form of the eminent economist, a senior official of the poll panel confirmed on Tuesday.

time to read

1 mins

January 07, 2026

Hindustan Times Chandigarh

NO RUSSIAN OIL RECEIVED IN 3 WEEKS: RELIANCE

Reliance Industries Ltd, the operator of the world’s largest single site oil refining complex and till recently India’s biggest buyer of Russian oil, on Tuesday said it has not received any Russian barrels in almost three weeks and none are expected in January.

time to read

1 min

January 07, 2026

Listen

Translate

Share

-
+

Change font size