Facebook Pixel DeepSeek's hidden warning for AI safety | Time - news - Magzter.comでこの記事を読む

試す - 無料

DeepSeek's hidden warning for AI safety

Time

|

February 24, 2025

THE RELEASE OF DEEPSEEK R1 STUNNED WALL STREET and Silicon Valley in January, spooking investors and impressing tech leaders.

- BY BILLY PERRIGO

DeepSeek's hidden warning for AI safety

But amid all the talk, many overlooked a critical detail about the way the new Chinese artificial intelligence model functions-a nuance that has researchers worried about humanity's ability to control sophisticated new AI systems.

It's all down to an innovation in how DeepSeek R1 was trained-one that led to surprising behaviors in an early version of the model, which researchers described in the technical documentation accompanying its release.

During testing, researchers noticed that the model would spontaneously switch between English and Chinese while it was solving problems. When they forced it to stick to one language, thus making it easier for users to follow along, they found that the system's ability to solve the same problems would diminish.

That finding rang alarm bells for some AI-safety researchers. Currently, the most capable AI systems "think" in human-legible languages, writing out their reasoning before coming to a conclusion. That has been a boon for safety teams, whose most effective guardrails involve monitoring models' so-called chains of thought for signs of dangerous behaviors. But DeepSeek's results raised the possibility of a decoupling on the horizon: one where new AI capabilities might be gained from freeing models of the constraints of human language altogether.

To be sure, DeepSeek's language switching is not by itself cause for alarm. Instead, what worries researchers is the new innovation that caused it. The DeepSeek paper describes a novel training method whereby the model was rewarded purely for getting correct answers, regardless of how comprehensible its thinking process was to humans. The worry is that this incentive-based approach could eventually lead AI systems to develop completely inscrutable ways of reasoning, maybe even creating their own nonhuman languages, if doing so proves to be more effective.

Time からのその他のストーリー

Time

Time

The Risk Report

PRESIDENT CLAUDIA SHEINBAUM scored a major win last month when Mexican special forces killed Rubén Nemesio “El Mencho” Oseguera, leader of the country’s most powerful criminal organization.

time to read

2 mins

March 23, 2026

Time

Time

The D.C. Brief

JAMES TALARICO, THE SECULARIST seminarian armed with a biblical rejoinder for what he sees as politics' sins, won Texas' hard-fought U.S. Senate Democratic primary on March 3, setting up a November push once seen as a Hail Mary for his party.

time to read

1 mins

March 23, 2026

Time

Time

FREEZING BUT FREE

I am writing from Kyiv, where the temperature in my apartment barely reaches 40°F. I sleep in thermal underwear, an insulated tracksuit, and a winter hat. Recently, I added gloves. Russian attacks have destroyed 80% of our energy and heat infrastructure, working to keep Ukraine cold in what is apparently an attempt to turn us against one another. Instead, neighbors cook borscht for the entire building.

time to read

3 mins

March 23, 2026

Time

Time

Eric Dane: Actor and ALS advocate

ACTOR ERIC DANE, KNOWN FOR HIS MANY ROLES INCLUDING ON TV shows Grey's Anatomy and Euphoria, died Feb. 19 at age 53 following \"a courageous battle with ALS,\" a statement from his representatives said.

time to read

1 mins

March 23, 2026

Time

Health Matters By Veronique Greenwood

Think back to the first thing you can remember, and you'll find you were likely already several years old.

time to read

1 min

March 23, 2026

Time

Time

5 phrases that will instantly get your doctor's attention

DOCTORS DON'T JUST EXAMINE bodies—they also decode language.

time to read

3 mins

March 23, 2026

Time

B.C., to DST: A constant clock

Things will be sunnier in British Columbia going forward. That's because the Canadian province decided that after it sets its clocks ahead for daylight saving time (DST) on March 8, it's not going back to standard time in the fall.

time to read

1 min

March 23, 2026

Time

Time

Trump's War

THE PRESIDENT'S MASSIVE GAMBLE IN THE MIDDLE EAST

time to read

13 mins

March 23, 2026

Time

Time

Inside Iran, jubilation quickly followed by apprehension

In Tehran, the war found a contractor named Salman as he was toweling off.

time to read

2 mins

March 23, 2026

Time

Time

Two new comedies loosen up about sex on campus

IN THE SERIES PREMIERE OF NETFLIX’S VLADIMIR, Rachel Weisz awakens from troubled sleep to a cascade of texts and addresses the camera with pleading eyes.

time to read

5 mins

March 23, 2026

Listen

Translate

Share

-
+

Change font size