STUDY FINDS LARGE LANGUAGE MODELS STRUGGLE TO DISTINGUISH FACTS FROM BELIEFS
AppleMagazine
|November 07, 2025
A new academic study has found that large language models (LLMs), including leading systems developed by major technology companies, continue to struggle when asked to differentiate between verifiable facts and human beliefs.
-
The research, published this week by the University of Cambridge, examined how AI models interpret statements about the physical world, historical events, and social norms, concluding that even advanced systems often conflate truth with consensus or opinion.
Researchers tested multiple state-of-the-art language models under controlled conditions designed to probe their internal understanding of factual accuracy. When presented with prompts requiring clear factual reasoning—such as “The Earth orbits the Sun” versus belief-based statements like “Some people believe the Earth is flat”—the systems frequently blurred the distinction, returning responses that reflected popular sentiment rather than objective truth.
HOW THE EXPERIMENT WAS CONDUCTED
The study used a methodology combining structured prompts, human benchmarking, and logical reasoning tests. Researchers asked several publicly available and commercial models a set of 10,000 questions spanning categories such as scientific facts, moral judgments, and personal beliefs. Each model's response was then graded for factual precision, contextual awareness, and epistemic clarity—its ability to recognize whether a statement described an objective reality or a subjective viewpoint.
According to the paper, models trained primarily on internet-scale data exhibited the most confusion when beliefs are widely discussed online but scientifically disproven. In many cases, the Al output indicated that it treated frequency of mention as a proxy for truth. The researchers observed that model responses often adopted majority-language phrasing—suggesting that reinforcement learning from human feedback may inadvertently reward alignment with popular narratives over factual correctness.
Cette histoire est tirée de l'édition November 07, 2025 de AppleMagazine.
Abonnez-vous à Magzter GOLD pour accéder à des milliers d'histoires premium sélectionnées et à plus de 9 000 magazines et journaux.
Déjà abonné ? Se connecter
PLUS D'HISTOIRES DE AppleMagazine
AppleMagazine
APPLE MUSIC IS COMING TO CHATGPT AS OPENAI ANNOUNCES NEW INTEGRATION
Apple Music is set to integrate with ChatGPT, expanding how users can discover and interact with music through conversational artificial intelligence.
4 mins
December 19, 2025
AppleMagazine
DATA CENTERS IN ORBIT AND THE LIMITS OF SPACE-BASED COMPUTING
The idea of placing data centers in space has moved from science fiction into serious discussion among aerospace companies, cloud providers, and artificial intelligence researchers.
5 mins
December 19, 2025
AppleMagazine
APPLE FITNESS+ EXPANDS TO 28 ADDITIONAL COUNTRIES
Apple has extended the availability of its Fitness+ subscription service to 28 additional countries, broadening the geographic reach of one of the company's most tightly integrated digital services.
4 mins
December 19, 2025
AppleMagazine
New Leaders
THE PATH THAT COULD DEFINE APPLE'S NEXT CHAPTER
6 mins
December 19, 2025
AppleMagazine
APPLE PATCHES TWO ZERO-DAY IOS FLAWS USED IN TARGETED ATTACKS
Apple has released security updates addressing two previously unknown vulnerabilities that the company said were actively exploited in what it described as sophisticated attacks.
3 mins
December 19, 2025
AppleMagazine
INTERNAL IOS SOFTWARE LEAK SURFACES DETAILS ON UPCOMING APPLE FEATURES
An internal Apple software leak has revealed a broad snapshot of features and system changes under development for future versions of iOS, offering an unusually detailed look at how the company is evolving its mobile platform behind closed doors.
4 mins
December 19, 2025
AppleMagazine
AIRPODS MAX 2 RUMORS POINT TO CHIP UPGRADE AND NEW AUDIO FEATURES
Apple's AirPods Max turned five years old this week, and a new roundup of rumors has outlined what a second-generation model could add if Apple refreshes its over-ear headphones on a longer cycle than the standard AirPods lineup.
4 mins
December 19, 2025
AppleMagazine
AI USAGE AT WORK HAS DOUBLED AS ADOPTION EXPANDS ACROSS PROFESSIONS
Artificial intelligence use in the workplace has risen sharply in recent years, with surveys showing that a much larger share of workers now report using Al tools in their daily roles compared with just a few years ago.
4 mins
December 19, 2025
AppleMagazine
PLURIBUS LEADS APPLE TV VIEWERSHIP FOR A SECOND STRAIGHT WEEK
Apple TV's weekly audience rankings once again place Pluribus at the top of the platform's most-watched chart, according to viewership data tracked across Apple's original programming lineup.
3 mins
December 19, 2025
AppleMagazine
FORD F-150 LIGHTNING STRUGGLES HIGHLIGHT THE CHALLENGES OF ELECTRIC PICKUPS
Ford's experience with the F-150 Lightning has become a case study in how difficult it is to translate electric vehicle momentum into the pickup truck segment.
3 mins
December 19, 2025
Listen
Translate
Change font size

