Poging GOUD - Vrij
STUDY FINDS LARGE LANGUAGE MODELS STRUGGLE TO DISTINGUISH FACTS FROM BELIEFS
AppleMagazine
|November 07, 2025
A new academic study has found that large language models (LLMs), including leading systems developed by major technology companies, continue to struggle when asked to differentiate between verifiable facts and human beliefs.
-
The research, published this week by the University of Cambridge, examined how AI models interpret statements about the physical world, historical events, and social norms, concluding that even advanced systems often conflate truth with consensus or opinion.
Researchers tested multiple state-of-the-art language models under controlled conditions designed to probe their internal understanding of factual accuracy. When presented with prompts requiring clear factual reasoning—such as “The Earth orbits the Sun” versus belief-based statements like “Some people believe the Earth is flat”—the systems frequently blurred the distinction, returning responses that reflected popular sentiment rather than objective truth.
HOW THE EXPERIMENT WAS CONDUCTED
The study used a methodology combining structured prompts, human benchmarking, and logical reasoning tests. Researchers asked several publicly available and commercial models a set of 10,000 questions spanning categories such as scientific facts, moral judgments, and personal beliefs. Each model's response was then graded for factual precision, contextual awareness, and epistemic clarity—its ability to recognize whether a statement described an objective reality or a subjective viewpoint.
According to the paper, models trained primarily on internet-scale data exhibited the most confusion when beliefs are widely discussed online but scientifically disproven. In many cases, the Al output indicated that it treated frequency of mention as a proxy for truth. The researchers observed that model responses often adopted majority-language phrasing—suggesting that reinforcement learning from human feedback may inadvertently reward alignment with popular narratives over factual correctness.
Dit verhaal komt uit de November 07, 2025-editie van AppleMagazine.
Abonneer u op Magzter GOLD voor toegang tot duizenden zorgvuldig samengestelde premiumverhalen en meer dan 9000 tijdschriften en kranten.
Bent u al abonnee? Aanmelden
MEER VERHALEN VAN AppleMagazine
AppleMagazine
GOOGLE DEEP RESEARCH GETS ENTERPRISE DATA ACCESS
Google is expanding its autonomous research agent strategy with two new Gemini-powered tools, Deep Research and Deep Research Max, designed to search the open web, connect with private enterprise data, and generate more complete research reports through a single API workflow.
8 mins
April 24, 2026
AppleMagazine
META TURNS EMPLOYEE WORK INTO AI TRAINING DATA
Meta is beginning to collect mouse movements, clicks, keystrokes, and occasional screen snapshots from U.S.-based employees’ work computers as part of a new internal effort to train AI agents on real workplace behavior.
7 mins
April 24, 2026
AppleMagazine
FAA GROUNDS BLUE ORIGIN AFTER NEW GLENN MISHAP
The Federal Aviation Administration has ordered Blue Origin to investigate a New Glenn launch mishap after the rocket failed to place an AST SpaceMobile satellite into its planned orbit, temporarily grounding the vehicle until the company completes a formal review and corrective actions are accepted.
6 mins
April 24, 2026
AppleMagazine
AI USE RAISES COGNITIVE CONCERNS
A growing body of research is beginning to examine whether heavy reliance on generative AI can weaken the mental processes people are supposed to practice when they write, study, and solve problems.
7 mins
April 24, 2026
AppleMagazine
MAC STUDIO DELAY SHOWS APPLE'S MEMORY STRAIN
Apple's next Mac Studio may not arrive until October, as the global memory shortage begins to disrupt the company’s professional desktop roadmap.
9 mins
April 24, 2026
AppleMagazine
MUSK KEEPS CONTROL IN SPACEX IPO PLAN
SpaceX’s public IPO filing gives Wall Street a clear message before one of the largest stock offerings ever attempted: the company may be going public, but control is not being sold.
7 mins
April 24, 2026
AppleMagazine
MERCEDES C-CLASS EV GOES BIG ON SCREENS
Mercedes-Benz has revealed the new electric C-Class sedan, bringing one of its most familiar nameplates into the battery-powered era with a high-output dual-motor system, an 800-volt electrical architecture, and one of the most screen-heavy cabins in the compact luxury segment.
7 mins
April 24, 2026
AppleMagazine
EU BATTERY RULES MAY RESHAPE SMARTPHONES
The European Union is preparing to force another major hardware change across the smartphone industry, this time targeting one of the most difficult and expensive parts of modern phone ownership: the battery.
7 mins
April 24, 2026
AppleMagazine
ADOBE LAUNCHES AI SUITE FOR ENTERPRISE MARKETING
Adobe has introduced a new artificial intelligence platform for corporate clients, moving deeper into agentic AI as competition intensifies across creative software, marketing technology, and enterprise automation.
8 mins
April 24, 2026
AppleMagazine
TESLA ROBOTAXI EXPANDS ACROSS TEXAS
Tesla has expanded its Robotaxi service to Dallas and Houston, marking the company's first Texas growth beyond Austin and giving Elon Musk a broader stage for one of Tesla's most important long-term bets.
8 mins
April 24, 2026
Listen
Translate
Change font size

