يحاول ذهب - حر
AI Models Collapse in Face of Complex Problems
June 09, 2025
|Hindustan Times Mumbai
Just days ahead of the much-anticipated Worldwide Developer Conference (WWDC), Apple has released a study titled "The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity," which saw researchers testing 'reasoning' AI models such as Anthropic's Claude, OpenAI's models, DeepSeek RL, and Google's Thinking models to see how far they can scale to replicate human reasoning.
NEW DELHI: Spoiler alert—not as much, as the entire AI marketing pitch would have you believe. Could this signal what may be in store for Apple's AI conversation ahead of the keynote?
The study questions the current standard evaluation of Large Reasoning Models (LRMs) using established mathematical and coding benchmarks, arguing they suffer from data contamination and don't reveal insights into reasoning trace structure and quality. Instead, it proposes a controlled experimental test-bed using algorithmic puzzle environments. The limitations of AI benchmarking, and need to evolve, is something we had written about earlier.
هذه القصة من طبعة June 09, 2025 من Hindustan Times Mumbai.
اشترك في Magzter GOLD للوصول إلى آلاف القصص المتميزة المنسقة، وأكثر من 9000 مجلة وصحيفة.
هل أنت مشترك بالفعل؟ تسجيل الدخول
المزيد من القصص من Hindustan Times Mumbai
Hindustan Times Mumbai
Was this in the book?
When novels jump to the screen, they often leave a bit of themselves behind. These 10 adaps are plot twists in themselves. Get ready to flip the script
3 mins
December 13, 2025
Hindustan Times Mumbai
Our flavour savers
Still reaching for generic sauces? Restock with condiments from Japan, Italy, France, and local artisanal finds
5 mins
December 13, 2025
Hindustan Times Mumbai
Hyderabad shock Mumbai in SMAT
The presence of a clutch of India stars failed to inspire Mumbai and Andhra as they went down to Hyderabad and Madhya Pradesh respectively in their Super League matches of the Syed Mushtaq Ali Trophy here on Friday.
1 mins
December 13, 2025
Hindustan Times Mumbai
Death of a Beatle: Lennon gunned down in New York
Mark David Chapman, 25, murdered the 40-year-old music icon outside his home
2 mins
December 13, 2025
Hindustan Times Mumbai
Cabinet clears key bills, approves census budget
Crucial bills greenlit
1 mins
December 13, 2025
Hindustan Times Mumbai
RBI sees CBDC risks, but less than in stablecoins
The Reserve Bank of India (RBI) sees privacy concerns associated with the use of programmable central bank digital currency (CBDC), particularly for targeted benefits such as subsidies, according to deputy governor T Rabi Sankar.
2 mins
December 13, 2025
Hindustan Times Mumbai
Luthras used forged agreement to get permit for Goa club: Police
Delhi-based entrepreneurs Saurabh and Gaurav Luthra allegedly applied for permission to open their club in Goa's Arpora using a forged copy of a land agreement, police said on Friday, adding that the investigators will confront the brothers about this revelation when they receive custody likely early next week.
1 min
December 13, 2025
Hindustan Times Mumbai
Dhaka’s return to rule by the vote
The first general election in Bangladesh after the ouster of the Sheikh Hasina government presents a dilemma for India
2 mins
December 13, 2025
Hindustan Times Mumbai
In Colombo, New Delhi's relief route to goodwill
Sri Lanka is dealing with its worst disaster since the multifaceted crisis of 2021.
2 mins
December 13, 2025
Hindustan Times Mumbai
India, NZ discuss ways to fast-track trade negotiations
India and New Zealand on Friday discussed ways to fast-track negotiations for the proposed free trade agreement (FTA) between the two countries.
1 min
December 13, 2025
Listen
Translate
Change font size
