Facebook Pixel DeepSeek's hidden warning for AI safety | Time - news - Magzter.comでこの記事を読む
Magzter GOLDで無制限に

Magzter GOLDで無制限に

10,000以上の雑誌、新聞、プレミアム記事に無制限にアクセスできます。

$149.99
 
$74.99/年

試す - 無料

DeepSeek's hidden warning for AI safety

Time

|

February 24, 2025

THE RELEASE OF DEEPSEEK R1 STUNNED WALL STREET and Silicon Valley in January, spooking investors and impressing tech leaders.

- BY BILLY PERRIGO

DeepSeek's hidden warning for AI safety

But amid all the talk, many overlooked a critical detail about the way the new Chinese artificial intelligence model functions-a nuance that has researchers worried about humanity's ability to control sophisticated new AI systems.

It's all down to an innovation in how DeepSeek R1 was trained-one that led to surprising behaviors in an early version of the model, which researchers described in the technical documentation accompanying its release.

During testing, researchers noticed that the model would spontaneously switch between English and Chinese while it was solving problems. When they forced it to stick to one language, thus making it easier for users to follow along, they found that the system's ability to solve the same problems would diminish.

That finding rang alarm bells for some AI-safety researchers. Currently, the most capable AI systems "think" in human-legible languages, writing out their reasoning before coming to a conclusion. That has been a boon for safety teams, whose most effective guardrails involve monitoring models' so-called chains of thought for signs of dangerous behaviors. But DeepSeek's results raised the possibility of a decoupling on the horizon: one where new AI capabilities might be gained from freeing models of the constraints of human language altogether.

To be sure, DeepSeek's language switching is not by itself cause for alarm. Instead, what worries researchers is the new innovation that caused it. The DeepSeek paper describes a novel training method whereby the model was rewarded purely for getting correct answers, regardless of how comprehensible its thinking process was to humans. The worry is that this incentive-based approach could eventually lead AI systems to develop completely inscrutable ways of reasoning, maybe even creating their own nonhuman languages, if doing so proves to be more effective.

Time からのその他のストーリー

Time

Time

Susan Dell & Michael Dell

CROWDED AS A BAZAAR AND CLUTTERED WITH screens, the trading floor of the New York Stock Exchange is a bubble of overstimulation, not exactly a place you’d want to bring a child.

time to read

9 mins

May 25, 2026

Time

Time

TV's first Lord of the Flies adaptation is worth the wait

LORD OF THE FLIES LOOMS SO LARGE in the allegorical canon that it’s easy to forget the book is only 72 years old.

time to read

2 mins

May 25, 2026

Time

Time

UNDER PRESSURE

Ahead of the FIFA World Cup, all eyes are on U.S. star Christian Pulisic.

time to read

13 mins

May 25, 2026

Time

Time

THE REVOLUTION WILL BE ZANY

Boots Riley's I Love Boosters is a madcap ode to the power of collective action

time to read

6 mins

May 25, 2026

Time

Time

Victoria Beckham The former Posh Spice on life in the public eye, evolving her fashion and beauty brand, and the validation that came from her Netflix docuseries

How has being a self-described 'control freak' served or undermined you in business?

time to read

3 mins

May 25, 2026

Time

Time

Idris Elba & Sabrina Dhowre Elba

IDRIS ELBA’S BODY DOESN’T KNOW WHEN HE’S acting.

time to read

9 mins

May 25, 2026

Time

Time

How Karl Urban conquered geekdom

HE’S FOUGHT ORCS FROM HORSEBACK IN MIDDLE-EARTH, explored the final frontier on the starship Enterprise, and wielded dual machine guns in the Marvel Cinematic Universe, but now Karl Urban is really in the thick of it.

time to read

6 mins

May 25, 2026

Time

Time

Climate Is Everything

DEEP OCEAN HEAT IS MOVING closer to Antarctica, threatening the stability of the continent's ice sheets, a new decades-long study has revealed. The study in the journal Communications Earth & Environment confirms that a warm mass known as circumpolar deep water has expanded and shifted toward the Antarctic continental shelf over the past 20 years.

time to read

2 mins

May 25, 2026

Time

Time

Ted Turner

Nonstop-news visionary

time to read

1 min

May 25, 2026

Time

Time

HOW NICKI MINAJ WENT MAGA

The rapper is the cornerstone of Trump plan to turn celebrity surrogates into cultural currency.

time to read

11 mins

May 25, 2026

Listen

Translate

Share

-
+

Change font size