Passez à l'illimité avec Magzter GOLD

Passez à l'illimité avec Magzter GOLD

Obtenez un accès illimité à plus de 9 000 magazines, journaux et articles Premium pour seulement

$149.99
 
$74.99/Année

Essayer OR - Gratuit

BENCHMARKS IN MEDICINE: THE PROMISE AND PITFALLS OF EVALUATING AI TOOLS WITH MISMATCHED YARDSTICKS

Southern Mail Newspaper

|

June 13, 2025

The core tension is this: medicine is not just about getting answers right. It is about getting people right. Doctors are trained to deal with doubts, handle exceptions, and recognise cultural patterns not taught in books. AI, by contrast, is only as good as the data it has seen and the questions it has been trained on

In May 2024, OpenAI released HealthBench, a new benchmarking system to test the clinical capabilities of large language models (LLMs) such as ChatGPT. On the surface, this may sound like yet another technical update. But for the medical world, it marked an important moment—a quiet acknowledgement that our current ways of evaluating medical AI are fundamentally wrong.

Headlines in the recent past have trumpeted that AI “outperforms doctors” or “aces medical exams.” The impression that’s coming through is these models are smarter, faster, and perhaps even safer. But this hype masks a deeper truth. To put it plainly, the benchmarks used to arrive at these claims are based on exams built for evaluating human memory retention from classroom teachings. They reward fact recall, not clinical judgment.

AI-driven innovations in medicine: devices, data, and diagnosis

A calculator problem

A calculator can multiply two six-digit numbers within seconds. Impressive, no doubt. But does this mean calculators are better than, and understand maths more than mathematics experts ? Or better even than an ordinary person who takes a few minutes to do the calculation with a pen and paper?

Language models are celebrated because they can churn out textbook-style answers to MCQs and fill in the blanks for medical facts and questions faster than medical professors. But the practice of medicine is not a quiz. Real doctors deal with ambiguity, emotion, and decision-making under uncertainty. They listen, observe, and adapt.

The irony is that while AI beats doctors in answering questions, it still struggles to generate the very case vignettes that form the basis of those questions. Writing a good clinical scenario from real patients in clinical practice requires understanding human suffering, filtering irrelevant details, and framing the diagnostic dilemma with context. So far, that remains a deeply human ability.

PLUS D'HISTOIRES DE Southern Mail Newspaper

Southern Mail Newspaper

Southern Mail Newspaper

K.C. VENUGOPAL FLAYS KERALA GOVT FOR 'POLITICISING' LORD AYYAPPA

In a letter to Chief Minister Pinarayi Vijayan, Venugopal says organising Global Ayyappa Sangamam in the name of protecting faith was 'nothing but hypocrisy' on the part of the government

time to read

2 mins

September 20, 2025

Southern Mail Newspaper

Southern Mail Newspaper

PRIVATISATION OF MEDICAL COLLEGES: ANDHRA PRADESH LEGISLATIVE COUNCIL ADJOURNED AMID YSRCP PROTESTS

Council Chairman K. Moshen Raju urged the protesting members to maintain order, assuring them that their demand would be placed before the BAC meeting

time to read

1 min

September 20, 2025

Southern Mail Newspaper

Southern Mail Newspaper

KARNATAKA AIMS TO CREATE 1.5 LAKH JOBS IN HOSPITALITY, ATTRACT ₹8,000 CRORE IN INVESTMENT BY 2029: CM

Union Tourism and Culture Minister Gajendra Singh Shekhawat says that the Union government is committed to granting “industry status” to the hospitality sector

time to read

2 mins

September 20, 2025

Southern Mail Newspaper

VIJAY'S TIRUCHI CAMPAIGN: MADRAS HIGH COURT ORDERS FRAMING OF GUIDELINES TO PENALISE POLITICAL PARTIES FOR DAMAGE TO PUBLIC PROPERTY

Justice N. Sathish Kumar issues the interim direction while hearing a case filed by actor Vijay's TVK, alleging onerous conditions being imposed on it for the grant of permission for his campaign

time to read

2 mins

September 20, 2025

Southern Mail Newspaper

Southern Mail Newspaper

PM MODI SPEAKS TO NEPAL PM SUSHILA KARKI, SUPPORTS EFFORTS TO RESTORE PEACE

PM Modi expressed condolences and support to his Nepalese counterpart Sushila Karki, emphasising India's strong ties with Nepal

time to read

1 min

September 19, 2025

Southern Mail Newspaper

GAZA IS GASPING AND THE WORLD MUST NOT LOOK AWAY: T.N. CM STALIN

In a social media post, Mr. Stalin said he was \"shaken beyond words by what is unfolding in Gaza\" Stating that he was \"shaken beyond words by what is unfolding in Gaza,\" Tamil Nadu Chief Minister M.K. Stalin on Thursday (September 18, 2025) said Gaza city was \"gasping\" and the \"world must not look away.\"In a social media post, Mr. Stalin referred to a media report and said every visual from the war-torn region was \"gut wrenching.

time to read

1 min

September 19, 2025

Southern Mail Newspaper

GST 2.0 IS A GAMECHANGER, BOON FOR COMMON MAN: FINANCE MINISTER NIRMALA SITHARAMAN IN VISAKHAPATNAM

FM said that the GST reforms will bring ₹2 lakh crore into the hands of the common man

time to read

1 min

September 19, 2025

Southern Mail Newspaper

Southern Mail Newspaper

RAHUL GANDHI'S ALLEGATIONS ON VOTE THEFT BASELESS: ELECTION COMMISSION OF INDIA

No deletion can take place without giving an opportunity of being heard to the affected person, the ECI asserted.

time to read

1 min

September 19, 2025

Southern Mail Newspaper

Southern Mail Newspaper

KHARGE ACCUSES ECI OF 'STONE-WALLING' PROBE, BJP CALLS CONGRESS ‘FRUSTRATED'

Rahul Gandhi says targeted deletion of votes was done in Congress's stronghold through planned action.

time to read

3 mins

September 19, 2025

Southern Mail Newspaper

Southern Mail Newspaper

'OUR SOLDIERS BROUGHT PAKISTAN TO ITS KNEES': PM MODI ON OPERATION SINDOOR AT RALLY IN MADHYA PRADESH'S DHAR

Addressing a rally in Dhar district of Madhya Pradesh, Prime Minister Narendra Modi on Wednesday (September 17, 2025) said India's soldiers \"brought Pakistan to its knees in the blink of an eye\", hailing Operation Sindoor which \"obliterated terror launch pads\".

time to read

2 mins

September 18, 2025

Listen

Translate

Share

-
+

Change font size