Intentar ORO - Gratis

DeepSeek's hidden warning for AI safety

Time

|

February 24, 2025

THE RELEASE OF DEEPSEEK R1 STUNNED WALL STREET and Silicon Valley in January, spooking investors and impressing tech leaders.

- BY BILLY PERRIGO

DeepSeek's hidden warning for AI safety

But amid all the talk, many overlooked a critical detail about the way the new Chinese artificial intelligence model functions-a nuance that has researchers worried about humanity's ability to control sophisticated new AI systems.

It's all down to an innovation in how DeepSeek R1 was trained-one that led to surprising behaviors in an early version of the model, which researchers described in the technical documentation accompanying its release.

During testing, researchers noticed that the model would spontaneously switch between English and Chinese while it was solving problems. When they forced it to stick to one language, thus making it easier for users to follow along, they found that the system's ability to solve the same problems would diminish.

That finding rang alarm bells for some AI-safety researchers. Currently, the most capable AI systems "think" in human-legible languages, writing out their reasoning before coming to a conclusion. That has been a boon for safety teams, whose most effective guardrails involve monitoring models' so-called chains of thought for signs of dangerous behaviors. But DeepSeek's results raised the possibility of a decoupling on the horizon: one where new AI capabilities might be gained from freeing models of the constraints of human language altogether.

To be sure, DeepSeek's language switching is not by itself cause for alarm. Instead, what worries researchers is the new innovation that caused it. The DeepSeek paper describes a novel training method whereby the model was rewarded purely for getting correct answers, regardless of how comprehensible its thinking process was to humans. The worry is that this incentive-based approach could eventually lead AI systems to develop completely inscrutable ways of reasoning, maybe even creating their own nonhuman languages, if doing so proves to be more effective.

MÁS HISTORIAS DE Time

Time

Time

Where electricity bills are on the ballot

Clockwise from top left: downtown Atlanta at night; high-voltage transmission lines near Rome, Ga.; a QTS data center in Atlanta's Howell Station neighborhood; Georgia Power's coal-fired Plant Bowen in Euharlee, Ga.

time to read

14 mins

September 08, 2025

Time

Time

THE 100 MOST INFLUENTIAL PEOPLE IN ARTIFICIAL INTELLIGENCE

MATTHEW PRINCE HAD TO BE CONVERTED to the belief that AI is eating the web.

time to read

3 mins

September 08, 2025

Time

Time

Two good men confront the Task of forgiveness

CRIME DRAMAS, IN OUR DISTRACTED TIMES, TEND TO front-load said crimes. More often than not, there’s a murder within the first five minutes. This is only one of the genre’s many implicit rules that HBO’s Task breaks. The series from Mare of Easttown creator Brad Ingelsby opens with a montage of quotidian scenes from the lives of two men. Weary Tom Brandis (Mark Ruffalo) folds his hands in prayer, dunks his face in a sink full of ice water, downs Advil while driving. Rugged Robbie Prendergrast (Tom Pelphrey) carries his sleeping son to bed, pours himself a tall mug of coffee, perks up at a radio ad for a dating app.

time to read

3 mins

September 08, 2025

Time

Time

Beyond human control

THE RACE FOR ARTIFICIAL GENERAL INTELLIGENCE POSES NEW RISKS TO AN UNSTABLE WORLD

time to read

11 mins

September 08, 2025

Time

Time

In exile, I lost India but gained a home

ON NOV. 7, 2019, THE GOVERNMENT OF PRIME MINISTER Narendra Modi revoked my Overseas Citizenship of India (OCI), effectively banning me from the country I grew up in. India was where my mother and grandmother lived. Where four out of my five books of fiction and nonfiction were set. Where I had returned after college in the U.S. with the aim of being “an Indian writer.”

time to read

6 mins

September 08, 2025

Time

Time

POOR VOTE, SWING VOTE

On the one hand, this is the worst of times: power is concentrated in the hands of people who pray at the opening of Congress, then prey on the people they swore an oath to serve.

time to read

3 mins

September 08, 2025

Time

Time

SUMMER OF OUR DISCONTENT

In The Roses, Olivia Colman and Benedict Cumberbatch embrace a movie season of not- so-romantic comedies

time to read

6 mins

September 08, 2025

Time

Time

PUTIN’S BRUSH-OFF

The Kremlin appears in no rush to negotiate peace with Ukraine—despite Trump’s efforts

time to read

3 mins

September 08, 2025

Time

Time

The agentic age: a new frontier for AI and humans

FOR THE PAST YEAR, I’VE BEEN RUNNING SALES- force with a colleague who never sleeps, never takes vacations, and has read more than I could in 100 lifetimes. On a typical day, sitting with a few executives around the table, I’ll ask it to evaluate a competitor's moves, refine a keynote draft, or surface strategic blind spots we might have missed.

time to read

5 mins

September 08, 2025

Time

Time

Why are so many women leaving the workforce?

212,000. THAT'S HOW MANY WOMEN AGES 20 AND OVER have left the U.S. workforce since January, according to the most recent jobs numbers released Aug. 1 by the Bureau of Labor Statistics. (By contrast, 44,000 men of the same age have entered the workforce since January.) The numbers are especially stark for women with children. From January to June, the labor-force participation rate of women ages 25 to 44 living with a child under 5 fell nearly 3 percentage points, from 69.7% to 66.9%, says Misty Lee Heggeness, an associate professor of economics and public affairs at the University of Kansas.

time to read

2 mins

September 08, 2025

Listen

Translate

Share

-
+

Change font size