कोशिश गोल्ड - मुक्त
DeepSeek's hidden warning for AI safety
Time
|February 24, 2025
THE RELEASE OF DEEPSEEK R1 STUNNED WALL STREET and Silicon Valley in January, spooking investors and impressing tech leaders.
But amid all the talk, many overlooked a critical detail about the way the new Chinese artificial intelligence model functions-a nuance that has researchers worried about humanity's ability to control sophisticated new AI systems.
It's all down to an innovation in how DeepSeek R1 was trained-one that led to surprising behaviors in an early version of the model, which researchers described in the technical documentation accompanying its release.
During testing, researchers noticed that the model would spontaneously switch between English and Chinese while it was solving problems. When they forced it to stick to one language, thus making it easier for users to follow along, they found that the system's ability to solve the same problems would diminish.
That finding rang alarm bells for some AI-safety researchers. Currently, the most capable AI systems "think" in human-legible languages, writing out their reasoning before coming to a conclusion. That has been a boon for safety teams, whose most effective guardrails involve monitoring models' so-called chains of thought for signs of dangerous behaviors. But DeepSeek's results raised the possibility of a decoupling on the horizon: one where new AI capabilities might be gained from freeing models of the constraints of human language altogether.
To be sure, DeepSeek's language switching is not by itself cause for alarm. Instead, what worries researchers is the new innovation that caused it. The DeepSeek paper describes a novel training method whereby the model was rewarded purely for getting correct answers, regardless of how comprehensible its thinking process was to humans. The worry is that this incentive-based approach could eventually lead AI systems to develop completely inscrutable ways of reasoning, maybe even creating their own nonhuman languages, if doing so proves to be more effective.
यह कहानी Time के February 24, 2025 संस्करण से ली गई है।
हजारों चुनिंदा प्रीमियम कहानियों और 10,000 से अधिक पत्रिकाओं और समाचार पत्रों तक पहुंचने के लिए मैगज़्टर गोल्ड की सदस्यता लें।
क्या आप पहले से ही ग्राहक हैं? साइन इन करें
Time से और कहानियाँ
Time
CO2 Leadership Report
IN SOME WAYS, THE ANNUAL summit of the Sustainable Markets Initiative was notable merely for continuing on.
2 mins
April 06, 2026
Time
The Most Disruptive Company in The World
ANTHROPIC WAS POISED TO TRANSFORM THE FUTURE OF WORK. NOW IT'S IN A FIGHT OVER THE FUTURE OF WAR
22 mins
April 06, 2026
Time
A soul-deep friendship, lost in a shallow murder mystery
A COMPETITION TO DETERMINE TV’S MOST generic domestic thriller would have dozens of compelling entrants, but if I had to pick a winner, it would be Imperfect Women.
2 mins
April 06, 2026
Time
How do you respond to parents who think football is too dangerous for their kids?
I wouldn't blame them. But I will also say that fear is a choice. There are a lot of great things that you learn through sports. I wouldn't allow the fear of something negative happening stop me from achieving all of the character-building and life lessons that come with sports.
3 mins
April 06, 2026
Time
Zelensky's drone diplomacy
It was just over a year ago that President Donald Trump told Volodymyr Zelensky that he didn’t “have the cards right now.”
2 mins
April 06, 2026
Time
THE SCIENCE OF SKEPTICISM
Scientists once thought illness was caused by “miasmas,” foul vapors that drifted through the air.
3 mins
April 06, 2026
Time
FINDING THE HEARTBEAT
Michelle Pfeiffer is the emotional core of two layered and wildly different new TV shows
6 mins
April 06, 2026
Time
Housing bill
By an overwhelming bipartisan vote of 89-10, the Senate passed a sweeping piece of legislation on March 12 that seeks to bolster the U.S. housing supply and lower costs for homebuyers.
1 min
April 06, 2026
Time
Health Matters
THERE ARE MORE THAN 170 RHINO-viruses known to science.
3 mins
April 06, 2026
Time
A reality-TV spoof that's a reality check
THE QUESTION SCREAMS OUT FROM THE COVER OF Entertainment
8 mins
April 06, 2026
Listen
Translate
Change font size
