يحاول ذهب - حر
BENCHMARKS IN MEDICINE: THE PROMISE AND PITFALLS OF EVALUATING AI TOOLS WITH MISMATCHED YARDSTICKS
June 13, 2025
|Southern Mail Newspaper
The core tension is this: medicine is not just about getting answers right. It is about getting people right. Doctors are trained to deal with doubts, handle exceptions, and recognise cultural patterns not taught in books. AI, by contrast, is only as good as the data it has seen and the questions it has been trained on
-
In May 2024, OpenAI released HealthBench, a new benchmarking system to test the clinical capabilities of large language models (LLMs) such as ChatGPT. On the surface, this may sound like yet another technical update. But for the medical world, it marked an important moment—a quiet acknowledgement that our current ways of evaluating medical AI are fundamentally wrong.
Headlines in the recent past have trumpeted that AI “outperforms doctors” or “aces medical exams.” The impression that’s coming through is these models are smarter, faster, and perhaps even safer. But this hype masks a deeper truth. To put it plainly, the benchmarks used to arrive at these claims are based on exams built for evaluating human memory retention from classroom teachings. They reward fact recall, not clinical judgment.
AI-driven innovations in medicine: devices, data, and diagnosis
A calculator problem
A calculator can multiply two six-digit numbers within seconds. Impressive, no doubt. But does this mean calculators are better than, and understand maths more than mathematics experts ? Or better even than an ordinary person who takes a few minutes to do the calculation with a pen and paper?
Language models are celebrated because they can churn out textbook-style answers to MCQs and fill in the blanks for medical facts and questions faster than medical professors. But the practice of medicine is not a quiz. Real doctors deal with ambiguity, emotion, and decision-making under uncertainty. They listen, observe, and adapt.
The irony is that while AI beats doctors in answering questions, it still struggles to generate the very case vignettes that form the basis of those questions. Writing a good clinical scenario from real patients in clinical practice requires understanding human suffering, filtering irrelevant details, and framing the diagnostic dilemma with context. So far, that remains a deeply human ability.
هذه القصة من طبعة June 13, 2025 من Southern Mail Newspaper.
اشترك في Magzter GOLD للوصول إلى آلاف القصص المتميزة المنسقة، وأكثر من 9000 مجلة وصحيفة.
هل أنت مشترك بالفعل؟ تسجيل الدخول
المزيد من القصص من Southern Mail Newspaper

Southern Mail Newspaper
K.C. VENUGOPAL FLAYS KERALA GOVT FOR 'POLITICISING' LORD AYYAPPA
In a letter to Chief Minister Pinarayi Vijayan, Venugopal says organising Global Ayyappa Sangamam in the name of protecting faith was 'nothing but hypocrisy' on the part of the government
2 mins
September 20, 2025

Southern Mail Newspaper
PRIVATISATION OF MEDICAL COLLEGES: ANDHRA PRADESH LEGISLATIVE COUNCIL ADJOURNED AMID YSRCP PROTESTS
Council Chairman K. Moshen Raju urged the protesting members to maintain order, assuring them that their demand would be placed before the BAC meeting
1 min
September 20, 2025

Southern Mail Newspaper
KARNATAKA AIMS TO CREATE 1.5 LAKH JOBS IN HOSPITALITY, ATTRACT ₹8,000 CRORE IN INVESTMENT BY 2029: CM
Union Tourism and Culture Minister Gajendra Singh Shekhawat says that the Union government is committed to granting “industry status” to the hospitality sector
2 mins
September 20, 2025
Southern Mail Newspaper
VIJAY'S TIRUCHI CAMPAIGN: MADRAS HIGH COURT ORDERS FRAMING OF GUIDELINES TO PENALISE POLITICAL PARTIES FOR DAMAGE TO PUBLIC PROPERTY
Justice N. Sathish Kumar issues the interim direction while hearing a case filed by actor Vijay's TVK, alleging onerous conditions being imposed on it for the grant of permission for his campaign
2 mins
September 20, 2025

Southern Mail Newspaper
PM MODI SPEAKS TO NEPAL PM SUSHILA KARKI, SUPPORTS EFFORTS TO RESTORE PEACE
PM Modi expressed condolences and support to his Nepalese counterpart Sushila Karki, emphasising India's strong ties with Nepal
1 min
September 19, 2025
Southern Mail Newspaper
GAZA IS GASPING AND THE WORLD MUST NOT LOOK AWAY: T.N. CM STALIN
In a social media post, Mr. Stalin said he was \"shaken beyond words by what is unfolding in Gaza\" Stating that he was \"shaken beyond words by what is unfolding in Gaza,\" Tamil Nadu Chief Minister M.K. Stalin on Thursday (September 18, 2025) said Gaza city was \"gasping\" and the \"world must not look away.\"In a social media post, Mr. Stalin referred to a media report and said every visual from the war-torn region was \"gut wrenching.
1 min
September 19, 2025
Southern Mail Newspaper
GST 2.0 IS A GAMECHANGER, BOON FOR COMMON MAN: FINANCE MINISTER NIRMALA SITHARAMAN IN VISAKHAPATNAM
FM said that the GST reforms will bring ₹2 lakh crore into the hands of the common man
1 min
September 19, 2025

Southern Mail Newspaper
RAHUL GANDHI'S ALLEGATIONS ON VOTE THEFT BASELESS: ELECTION COMMISSION OF INDIA
No deletion can take place without giving an opportunity of being heard to the affected person, the ECI asserted.
1 min
September 19, 2025

Southern Mail Newspaper
KHARGE ACCUSES ECI OF 'STONE-WALLING' PROBE, BJP CALLS CONGRESS ‘FRUSTRATED'
Rahul Gandhi says targeted deletion of votes was done in Congress's stronghold through planned action.
3 mins
September 19, 2025

Southern Mail Newspaper
'OUR SOLDIERS BROUGHT PAKISTAN TO ITS KNEES': PM MODI ON OPERATION SINDOOR AT RALLY IN MADHYA PRADESH'S DHAR
Addressing a rally in Dhar district of Madhya Pradesh, Prime Minister Narendra Modi on Wednesday (September 17, 2025) said India's soldiers \"brought Pakistan to its knees in the blink of an eye\", hailing Operation Sindoor which \"obliterated terror launch pads\".
2 mins
September 18, 2025
Listen
Translate
Change font size