Intentar ORO - Gratis

BENCHMARKS IN MEDICINE: THE PROMISE AND PITFALLS OF EVALUATING AI TOOLS WITH MISMATCHED YARDSTICKS

Southern Mail Newspaper

|

June 13, 2025

The core tension is this: medicine is not just about getting answers right. It is about getting people right. Doctors are trained to deal with doubts, handle exceptions, and recognise cultural patterns not taught in books. AI, by contrast, is only as good as the data it has seen and the questions it has been trained on

In May 2024, OpenAI released HealthBench, a new benchmarking system to test the clinical capabilities of large language models (LLMs) such as ChatGPT. On the surface, this may sound like yet another technical update. But for the medical world, it marked an important moment—a quiet acknowledgement that our current ways of evaluating medical AI are fundamentally wrong.

Headlines in the recent past have trumpeted that AI “outperforms doctors” or “aces medical exams.” The impression that’s coming through is these models are smarter, faster, and perhaps even safer. But this hype masks a deeper truth. To put it plainly, the benchmarks used to arrive at these claims are based on exams built for evaluating human memory retention from classroom teachings. They reward fact recall, not clinical judgment.

AI-driven innovations in medicine: devices, data, and diagnosis

A calculator problem

A calculator can multiply two six-digit numbers within seconds. Impressive, no doubt. But does this mean calculators are better than, and understand maths more than mathematics experts ? Or better even than an ordinary person who takes a few minutes to do the calculation with a pen and paper?

Language models are celebrated because they can churn out textbook-style answers to MCQs and fill in the blanks for medical facts and questions faster than medical professors. But the practice of medicine is not a quiz. Real doctors deal with ambiguity, emotion, and decision-making under uncertainty. They listen, observe, and adapt.

The irony is that while AI beats doctors in answering questions, it still struggles to generate the very case vignettes that form the basis of those questions. Writing a good clinical scenario from real patients in clinical practice requires understanding human suffering, filtering irrelevant details, and framing the diagnostic dilemma with context. So far, that remains a deeply human ability.

MÁS HISTORIAS DE Southern Mail Newspaper

Southern Mail Newspaper

Southern Mail Newspaper

BIHAR ELECTION 2025 RESULTS: NDA CROSSES 200-SEAT MARK IN LEADS AS MAHAGATHBANDHAN FAILS TO TAKE OFF

RJD's Tejashwi Yadav trails by over 4,000 votes in Raghopur seat; trends show the BJP is set to emerge as the single-largest party.

time to read

4 mins

November 15, 2025

Southern Mail Newspaper

Southern Mail Newspaper

HEAVY RAINFALL LIKELY OVER DELTA, COASTAL BELTS IN TAMIL NADU FOR FOUR DAYS

The prevailing weather system over the Bay of Bengal is expected to bring intense downpour over the delta and coastal districts between November 16 and November 19, signalling the return of Northeast monsoon's active phase after a lull.

time to read

2 mins

November 15, 2025

Southern Mail Newspaper

Southern Mail Newspaper

PROPOSAL TO CREATE 'GREATER MYSURU' TO BE DICUSSED IN CABINET, SAYS CM

Chief Minister Siddaramaiah said the proposal to create a 'Greater Mysuru' will be discussed in the Cabinet before it is taken forward.

time to read

1 min

November 13, 2025

Southern Mail Newspaper

Southern Mail Newspaper

RAHUL'S VOTE CHORI CAMPAIGN WILL HELP INDIA BLOC IN TAMIL NADU TOO: SELVAPERUNTHAGAI

The Special Intensive Revision of the electoral rolls in Tamil Nadu has opened a debate among members of the public on why such an exercise is being carried out by the Election Commission, says the TNCC chief

time to read

1 min

November 13, 2025

Southern Mail Newspaper

Southern Mail Newspaper

A.P. CM REAFFIRMS COMMITMENT TO MINORITY WELFARE

Assures free education to Minority girls up to Intermediate, construction of Haj buildings in Kadapa and Vijayawada, one lakh financial assistance to Haj pilgrims

time to read

2 mins

November 13, 2025

Southern Mail Newspaper

CONGRESS TO COLLECT 5 CRORE SIGNATURES TO HIGHLIGHT 'VOTE CHORI' ACROSS INDIA: MLC MANJUNATH BHANDARY

Manjunath Bhandary said Congress party leader Rahul Gandhi had held press conferences divulging details of the theft of votes during elections

time to read

1 min

November 08, 2025

Southern Mail Newspaper

Southern Mail Newspaper

SENGOTTAIYAN ACCUSES PALANISWAMI OF ALLOWING DYNASTIC POLITICS, WEAKENING AIADMK BY EXPELLING 'ANYONE WHO SPEAKS THEIR MIND'

Speaking to reporters in Gobichettipalayam, Mr. Sengottaiyan said uniting those who had \"drifted away\" from the party and bringing together committed cadres were essential to revive the ideals of M.G.R. and Jayalalithaa

time to read

2 mins

November 08, 2025

Southern Mail Newspaper

Southern Mail Newspaper

SUPREME COURT ORDERS URGENT HEARING ON NOV 11 TO TAKE UP PLEAS CHALLENGING PAN-INDIA SIR EXERCISE

The petitioners in the SIR case have questioned the absolute discretion employed by the Election Commission to conduct the revision of the electoral roll

time to read

4 mins

November 08, 2025

Southern Mail Newspaper

Southern Mail Newspaper

BIHAR ELECTIONS PHASE 1 VOTING: RJD SUPPORTERS ATTACKED MY CONVOY IN LAKHISARAI, ALLEGES DEPUTY CM VIJAY KUMAR SINHA

Over 27.5% voter turnout as of 11 a.m. in Bihar

time to read

4 mins

November 07, 2025

Southern Mail Newspaper

Southern Mail Newspaper

PMK MLA ARUL'S CAR ATTACKED IN SALEM DURING CLASH BETWEEN SUPPORTERS OF ANBUMANI AND RAMADOSS

\"They attacked the car with stones, wooden logs, and iron rods. If I had come out of the car, they would have killed me,\" the MLA claimed.

time to read

1 min

November 06, 2025

Listen

Translate

Share

-
+

Change font size