Magzter GOLDで無制限に

Magzter GOLDで無制限に

10,000以上の雑誌、新聞、プレミアム記事に無制限にアクセスできます。

$149.99
 
$74.99/年
The Perfect Holiday Gift Gift Now

AI Models Collapse in Face of Complex Problems

Hindustan Times Bengaluru

|

June 09, 2025

Just days ahead of the much-anticipated Worldwide Developer Conference (WWDC), Apple has released a study titled "The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity", which saw researchers testing 'reasoning' AI models such as Anthropic's Claude, OpenAI's models, DeepSeek RL, and Google's Thinking models to see how far they can scale to replicate human reasoning.

- Vishal Mathur

NEW DELHI: Just days ahead of the much-anticipated Worldwide Developer Conference (WWDC), Apple has released a study titled "The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity," which saw researchers testing 'reasoning' AI models such as Anthropic's Claude, OpenAI's models, DeepSeek RL, and Google's Thinking models to see how far they can scale to replicate human reasoning. Spoiler alert—not as much, as the entire AI marketing pitch would have you believe. Could this signal what may be in store for Apple's AI conversation ahead of the keynote?

The study questions the current standard evaluation of Large Reasoning Models (LRMs) using established mathematical and coding benchmarks, arguing they suffer from data contamination and don't reveal insights into reasoning trace structure and quality. Instead, it proposes a controlled experimental test-bed using algorithmic puzzle environments. The limitations of AI benchmarking, and need to evolve, is something we had written about earlier.

Hindustan Times Bengaluru からのその他のストーリー

Hindustan Times Bengaluru

Hindustan Times Bengaluru

'Winning WPL, World Cup made me more positive'

{ RENUKA SINGH THAKUR } INDIA BOWLER

time to read

2 mins

December 19, 2025

Hindustan Times Bengaluru

A chance for Surya to finish South Africa series on a high

A good knock in the fifth and final T201 will soothe the nerves of team management and the Indian fans

time to read

2 mins

December 19, 2025

Hindustan Times Bengaluru

India’s top-performing AI stock faces scrutiny after 55,000% surge

The world’s best-performing stock is turning into a cautionary tale for investors chasing outsized returns from the artificial intelligence boom.

time to read

2 mins

December 19, 2025

Hindustan Times Bengaluru

Female celebs raise concern over Al misuse

The rampant exploitation of AI-generated images on social media has targeted several female celebrities in recent times, and many of them have now come out on Instagram to voice their concerns over the serious matter.

time to read

1 min

December 19, 2025

Hindustan Times Bengaluru

Hindustan Times Bengaluru

Adani Group’s internal project manager to raise $1 billion

A private company owned by billionaire Gautam Adani and his family has been entrusted to oversee the infrastructure projects of all listed firms of the Adani Group as part of the tycoon’s plans to capture margins that would otherwise have gone to external parties

time to read

2 mins

December 19, 2025

Hindustan Times Bengaluru

26 Indians killed while serving in Russian military: Centre in RS

Twenty-six Indian nationals were killed while serving in the Russian armed forces and seven have been reported missing, while efforts are on to ensure the early discharge of 50 more, the government informed Parliament on Thursday.

time to read

1 mins

December 19, 2025

Hindustan Times Bengaluru

Hindustan Times Bengaluru

Cash-rich Airtel pares debt as Vodafone Idea borrows to stay afloat

Two of India’s biggest private telecom operators—Bharti Airtel and Vodafone Idea (Vi)—are looking to shore up their finances and fund network investments from vastly different starting points, pursuing sharply different strategies of equity-led deleveraging and debt-ied survival, respectively.

time to read

1 mins

December 19, 2025

Hindustan Times Bengaluru

OSCARS TO MOVE OFF BROADCAST TV TO YOUTUBE STARTING 2029

The annual Academy Awards telecast will move from the ABC broadcast network to stream live on YouTube around the world starting in 2029, organisers said on Wednesday.

time to read

1 min

December 19, 2025

Hindustan Times Bengaluru

India inks Oman FTA, gains duty-free boost

For India's exports, the deal presents untapped potential across key sectors

time to read

1 min

December 19, 2025

Hindustan Times Bengaluru

SAHITYA AKADEMI AWARD DEFERMENT TRIGGERS FURORE

A decision by the Sahitya Akademi to defer the announcement of its annual prestigious awards on Thursday sparked a controversy, with the Opposition alleging interference by the government and the Union culture ministry saying as per a prior memorandum of understanding, prior approval was required.

time to read

1 min

December 19, 2025

Listen

Translate

Share

-
+

Change font size

Holiday offer front
Holiday offer back