Facebook Pixel STUDY FINDS LARGE LANGUAGE MODELS STRUGGLE TO DISTINGUISH FACTS FROM BELIEFS | AppleMagazine - technology - Magzter.comでこの記事を読む
Magzter GOLDで無制限に

Magzter GOLDで無制限に

10,000以上の雑誌、新聞、プレミアム記事に無制限にアクセスできます。

$149.99
 
$74.99/年

試す - 無料

STUDY FINDS LARGE LANGUAGE MODELS STRUGGLE TO DISTINGUISH FACTS FROM BELIEFS

AppleMagazine

|

November 07, 2025

A new academic study has found that large language models (LLMs), including leading systems developed by major technology companies, continue to struggle when asked to differentiate between verifiable facts and human beliefs.

STUDY FINDS LARGE LANGUAGE MODELS STRUGGLE TO DISTINGUISH FACTS FROM BELIEFS

The research, published this week by the University of Cambridge, examined how AI models interpret statements about the physical world, historical events, and social norms, concluding that even advanced systems often conflate truth with consensus or opinion.

Researchers tested multiple state-of-the-art language models under controlled conditions designed to probe their internal understanding of factual accuracy. When presented with prompts requiring clear factual reasoning—such as “The Earth orbits the Sun” versus belief-based statements like “Some people believe the Earth is flat”—the systems frequently blurred the distinction, returning responses that reflected popular sentiment rather than objective truth.

HOW THE EXPERIMENT WAS CONDUCTED

The study used a methodology combining structured prompts, human benchmarking, and logical reasoning tests. Researchers asked several publicly available and commercial models a set of 10,000 questions spanning categories such as scientific facts, moral judgments, and personal beliefs. Each model's response was then graded for factual precision, contextual awareness, and epistemic clarity—its ability to recognize whether a statement described an objective reality or a subjective viewpoint.

According to the paper, models trained primarily on internet-scale data exhibited the most confusion when beliefs are widely discussed online but scientifically disproven. In many cases, the Al output indicated that it treated frequency of mention as a proxy for truth. The researchers observed that model responses often adopted majority-language phrasing—suggesting that reinforcement learning from human feedback may inadvertently reward alignment with popular narratives over factual correctness.

AppleMagazine からのその他のストーリー

AppleMagazine

AppleMagazine

GOOGLE DEEP RESEARCH GETS ENTERPRISE DATA ACCESS

Google is expanding its autonomous research agent strategy with two new Gemini-powered tools, Deep Research and Deep Research Max, designed to search the open web, connect with private enterprise data, and generate more complete research reports through a single API workflow.

time to read

8 mins

April 24, 2026

AppleMagazine

AppleMagazine

META TURNS EMPLOYEE WORK INTO AI TRAINING DATA

Meta is beginning to collect mouse movements, clicks, keystrokes, and occasional screen snapshots from U.S.-based employees’ work computers as part of a new internal effort to train AI agents on real workplace behavior.

time to read

7 mins

April 24, 2026

AppleMagazine

AppleMagazine

FAA GROUNDS BLUE ORIGIN AFTER NEW GLENN MISHAP

The Federal Aviation Administration has ordered Blue Origin to investigate a New Glenn launch mishap after the rocket failed to place an AST SpaceMobile satellite into its planned orbit, temporarily grounding the vehicle until the company completes a formal review and corrective actions are accepted.

time to read

6 mins

April 24, 2026

AppleMagazine

AppleMagazine

AI USE RAISES COGNITIVE CONCERNS

A growing body of research is beginning to examine whether heavy reliance on generative AI can weaken the mental processes people are supposed to practice when they write, study, and solve problems.

time to read

7 mins

April 24, 2026

AppleMagazine

AppleMagazine

MAC STUDIO DELAY SHOWS APPLE'S MEMORY STRAIN

Apple's next Mac Studio may not arrive until October, as the global memory shortage begins to disrupt the company’s professional desktop roadmap.

time to read

9 mins

April 24, 2026

AppleMagazine

AppleMagazine

MUSK KEEPS CONTROL IN SPACEX IPO PLAN

SpaceX’s public IPO filing gives Wall Street a clear message before one of the largest stock offerings ever attempted: the company may be going public, but control is not being sold.

time to read

7 mins

April 24, 2026

AppleMagazine

AppleMagazine

MERCEDES C-CLASS EV GOES BIG ON SCREENS

Mercedes-Benz has revealed the new electric C-Class sedan, bringing one of its most familiar nameplates into the battery-powered era with a high-output dual-motor system, an 800-volt electrical architecture, and one of the most screen-heavy cabins in the compact luxury segment.

time to read

7 mins

April 24, 2026

AppleMagazine

AppleMagazine

EU BATTERY RULES MAY RESHAPE SMARTPHONES

The European Union is preparing to force another major hardware change across the smartphone industry, this time targeting one of the most difficult and expensive parts of modern phone ownership: the battery.

time to read

7 mins

April 24, 2026

AppleMagazine

AppleMagazine

ADOBE LAUNCHES AI SUITE FOR ENTERPRISE MARKETING

Adobe has introduced a new artificial intelligence platform for corporate clients, moving deeper into agentic AI as competition intensifies across creative software, marketing technology, and enterprise automation.

time to read

8 mins

April 24, 2026

AppleMagazine

AppleMagazine

TESLA ROBOTAXI EXPANDS ACROSS TEXAS

Tesla has expanded its Robotaxi service to Dallas and Houston, marking the company's first Texas growth beyond Austin and giving Elon Musk a broader stage for one of Tesla's most important long-term bets.

time to read

8 mins

April 24, 2026

Listen

Translate

Share

-
+

Change font size