Essayer OR - Gratuit
DeepSeek's hidden warning for AI safety
Time
|February 24, 2025
THE RELEASE OF DEEPSEEK R1 STUNNED WALL STREET and Silicon Valley in January, spooking investors and impressing tech leaders.
But amid all the talk, many overlooked a critical detail about the way the new Chinese artificial intelligence model functions-a nuance that has researchers worried about humanity's ability to control sophisticated new AI systems.
It's all down to an innovation in how DeepSeek R1 was trained-one that led to surprising behaviors in an early version of the model, which researchers described in the technical documentation accompanying its release.
During testing, researchers noticed that the model would spontaneously switch between English and Chinese while it was solving problems. When they forced it to stick to one language, thus making it easier for users to follow along, they found that the system's ability to solve the same problems would diminish.
That finding rang alarm bells for some AI-safety researchers. Currently, the most capable AI systems "think" in human-legible languages, writing out their reasoning before coming to a conclusion. That has been a boon for safety teams, whose most effective guardrails involve monitoring models' so-called chains of thought for signs of dangerous behaviors. But DeepSeek's results raised the possibility of a decoupling on the horizon: one where new AI capabilities might be gained from freeing models of the constraints of human language altogether.
To be sure, DeepSeek's language switching is not by itself cause for alarm. Instead, what worries researchers is the new innovation that caused it. The DeepSeek paper describes a novel training method whereby the model was rewarded purely for getting correct answers, regardless of how comprehensible its thinking process was to humans. The worry is that this incentive-based approach could eventually lead AI systems to develop completely inscrutable ways of reasoning, maybe even creating their own nonhuman languages, if doing so proves to be more effective.
Cette histoire est tirée de l'édition February 24, 2025 de Time.
Abonnez-vous à Magzter GOLD pour accéder à des milliers d'histoires premium sélectionnées et à plus de 9 000 magazines et journaux.
Déjà abonné ? Se connecter
PLUS D'HISTOIRES DE Time
Time
HOW TO STEAL A NUCLEAR POWER PLANT AND GET AWAY WITH IT
VLADIMIR PUTIN HAD DONE HIS HOMEWORK.
16 mins
November 10, 2025
Time
FAMILY MATTERS
A crop of fall movies search proverbial—and literal— attics to explore what makes a family unit tick
6 mins
November 10, 2025
Time
Padma Lakshmi The culinary television star on centering immigrant stories, taking inspiration from activism, and writing her latest cookbook
You often speak about food through the lens of family. Why is that important to you?
3 mins
November 10, 2025
Time
A New Wave origin story, and an act of love
SOME DAYS IT SEEMS WE LIVE IN A HORRID WORLD where most humans couldn’t give a fig about art. How many people in that world are going to care about a 65-year-old black-and-white movie—one that, for anyone who doesn’t speak French, requires the reading of subtitles?
2 mins
November 10, 2025
Time
In the Loop
IN OCTOBER, HEART-WRENCHING photos of a 12-year-old girl driving her sick puppy to the vet went viral on social media. But upon closer examination, users noticed strange details: her steering wheel was on the right side of the car, which also lacked a dashboard.
2 mins
November 10, 2025
Time
A murder franchise finds its Monsters- and they're us
MIDWAY THROUGH MONSTER: THE ED GEIN STORY, the title character stares into the camera and warns: “You shouldn't be watching this.” He’s talking to two strangers who've interrupted him in the bloody aftermath of a murder. But the closeup makes it clear that Gein, played with eerie gentleness by Charlie Hunnam, is also addressing his audience of Netflix viewers. Then he revs his chainsaw and chases the men. Of course, we keep watching. In the next scene, Gein offers the spectacle of a dead, nude woman, strung up like a carcass in a slaughterhouse.
3 mins
November 10, 2025
Time
HOW THE DEAL GOT DONE
Inside Trump's unconventional Middle East diplomacy
15 mins
November 10, 2025
Time
Slow Horses gets an explosive sister show
In the premiere of Down Cemetery Road, a desperate woman walks into a private investigator's office. “Let me guess,” says the detective, Zoë Boehm (Emma Thompson). “You've got a husband. He's got a secretary. Am I warm?” She is not. Neither a film-noir femme fatale nor a jealous housewife, Sarah Trafford (Ruth Wilson) has come for help in solving a mystery that has little to do with her own life. Her initially inexplicable obsession sets the tone for Apple's unusually humane conspiracy thriller.
1 mins
November 10, 2025
Time
EDGE OF INVASION
Taiwan prepares as shadows of war creep closer to its shores
15 mins
November 10, 2025
Time
The Risk Report
WHEN FORMER PRIME MINISTER, champion of multiparty democracy, and longtime opposition leader Raila Odinga died on Oct. 15, Kenya lost the country's most consequential figure of the past generation.
3 mins
November 10, 2025
Listen
Translate
Change font size
