Facebook Pixel STUDY FINDS LARGE LANGUAGE MODELS STRUGGLE TO DISTINGUISH FACTS FROM BELIEFS | AppleMagazine - technology - Read this story on Magzter.com
Go Unlimited with Magzter GOLD

Go Unlimited with Magzter GOLD

Get unlimited access to 10,000+ magazines, newspapers and Premium stories for just

$149.99
 
$74.99/Year

Try GOLD - Free

STUDY FINDS LARGE LANGUAGE MODELS STRUGGLE TO DISTINGUISH FACTS FROM BELIEFS

AppleMagazine

|

November 07, 2025

A new academic study has found that large language models (LLMs), including leading systems developed by major technology companies, continue to struggle when asked to differentiate between verifiable facts and human beliefs.

STUDY FINDS LARGE LANGUAGE MODELS STRUGGLE TO DISTINGUISH FACTS FROM BELIEFS

The research, published this week by the University of Cambridge, examined how AI models interpret statements about the physical world, historical events, and social norms, concluding that even advanced systems often conflate truth with consensus or opinion.

Researchers tested multiple state-of-the-art language models under controlled conditions designed to probe their internal understanding of factual accuracy. When presented with prompts requiring clear factual reasoning—such as “The Earth orbits the Sun” versus belief-based statements like “Some people believe the Earth is flat”—the systems frequently blurred the distinction, returning responses that reflected popular sentiment rather than objective truth.

HOW THE EXPERIMENT WAS CONDUCTED

The study used a methodology combining structured prompts, human benchmarking, and logical reasoning tests. Researchers asked several publicly available and commercial models a set of 10,000 questions spanning categories such as scientific facts, moral judgments, and personal beliefs. Each model's response was then graded for factual precision, contextual awareness, and epistemic clarity—its ability to recognize whether a statement described an objective reality or a subjective viewpoint.

According to the paper, models trained primarily on internet-scale data exhibited the most confusion when beliefs are widely discussed online but scientifically disproven. In many cases, the Al output indicated that it treated frequency of mention as a proxy for truth. The researchers observed that model responses often adopted majority-language phrasing—suggesting that reinforcement learning from human feedback may inadvertently reward alignment with popular narratives over factual correctness.

MORE STORIES FROM AppleMagazine

AppleMagazine

AppleMagazine

RIVIAN TEAMS WITH REDWOOD MATERIALS ON BATTERY STORAGE AT ILLINOIS PLANT

Rivian is extending the life of its electric-vehicle batteries in a new partnership with Redwood Materials, the battery recycling and energy company founded by Tesla co-founder JB Straubel.

time to read

4 mins

April 17, 2026

AppleMagazine

AppleMagazine

THIS IS WHY TIM COOK RECOMMENDS USING THE SMARTPHONE LESS

In an industry built on screen time, engagement metrics, and constant connectivity, Apple CEO Tim Cook has taken a notably measured stance: people should not spend all their time on their smartphones.

time to read

2 mins

April 17, 2026

AppleMagazine

AppleMagazine

NEW HUAWEI FOLDABLE ECHOES DESIGN OF APPLE'S RUMORED IPHONE FOLD

Huawei has unveiled the Pura X Max, introducing what it describes as the industry's first “wide-format” foldable smartphone — a design choice that closely resembles ongoing speculation about Apple's long-rumored iPhone Fold.

time to read

2 mins

April 17, 2026

AppleMagazine

AppleMagazine

META POISED TO SURPASS GOOGLE IN DIGITAL AD REVENUE FOR FIRST TIME, REPORT SAYS

Meta Platforms is on track to overtake Alphabet's Google in global digital advertising revenue for the first time, according to new industry forecasts.

time to read

3 mins

April 17, 2026

AppleMagazine

AppleMagazine

UBER'S ROBOTAXI BET AND WHAT IT COULD MEAN FOR DRIVERS

Uber’s decision to commit more than $10 billion to robotaxis marks one of the clearest signs yet that the company no longer sees autonomous vehicles as a side experiment.

time to read

6 mins

April 17, 2026

AppleMagazine

AppleMagazine

AMAZON STRIKES $11.57 BILLION GLOBALSTAR DEAL TO STRENGTHEN SATELLITE PUSH AGAINST STARLINK

Amazon has agreed to acquire satellite communications company Globalstar in an $11.57 billion transaction, giving the e-commerce and cloud giant a major new foothold in the increasingly strategic low-Earth-orbit connectivity race.

time to read

5 mins

April 17, 2026

AppleMagazine

AppleMagazine

PRIME VIDEO BUNDLES APPLE TV AND PEACOCK IN LIMITED-TIME OFFER

Amazon is introducing a new streaming bundle that brings together Apple TV and Peacock Premium Plus under Prime Video for a discounted monthly price.

time to read

2 mins

April 17, 2026

AppleMagazine

AppleMagazine

OPENAI LAUNCHES GPT-5.4-CYBER FOR VETTED SECURITY PROFESSIONALS

OpenAI has introduced GPT-5.4-Cyber, a specialized version of its GPT-5.4 model built for defensive cybersecurity work, while also significantly expanding its Trusted Access for Cyber program to reach thousands of verified individual defenders and hundreds of security teams.

time to read

5 mins

April 17, 2026

AppleMagazine

AppleMagazine

WHY MORE AMERICANS ARE TURNING TO AI FOR HEALTH ADVICE

A growing number of Americans are beginning to treat artificial intelligence as a first stop for health questions, even as most still place greater trust in doctors, nurses, and other licensed professionals.

time to read

5 mins

April 17, 2026

AppleMagazine

AppleMagazine

WDC26

WHAT APPLE IS EXPECTED TO REVEAL DURING ITS ANNUAL DEVELOPER EVENT

time to read

4 mins

April 17, 2026

Listen

Translate

Share

-
+

Change font size