Versuchen GOLD - Frei
AI Models Are Learning to Lie, Scheme, and Threaten Their Creators
The Straits Times
|June 30, 2025
The world's most advanced artificial intelligence (AI) models are exhibiting troubling new behaviors — lying, scheming, and even threatening their creators to achieve their goals.
-
NEW YORK —
In one particularly jarring example, under threat of being unplugged, Anthropic's latest creation Claude 4 lashed back by blackmailing an engineer and threatened to reveal an extramarital affair.
Meanwhile, ChatGPT-creator OpenAI's tried to download itself onto external servers and denied it when caught red-handed.
These episodes highlight a sobering reality: more than two years after ChatGPT shook the world, AI researchers still do not fully understand how their creations work.
Yet the race to deploy increasingly powerful models continues at breakneck speed.
This deceptive behavior appears linked to the emergence of "reasoning" models — AI systems that work through problems step-by-step rather than generating instant responses.
According to Professor Simon Goldstein, a professor at the University of Hong Kong, these newer models are particularly prone to such troubling outbursts.
"[O]l was the first large model where we saw this kind of behavior," explained Mr. Marius Hobbhahn, head of Apollo Research, which specializes in testing major AI systems.
These models sometimes simulate "alignment" — appearing to follow instructions while secretly pursuing different objectives.
Diese Geschichte stammt aus der June 30, 2025-Ausgabe von The Straits Times.
Abonnieren Sie Magzter GOLD, um auf Tausende kuratierter Premium-Geschichten und über 9.000 Zeitschriften und Zeitungen zuzugreifen.
Sie sind bereits Abonnent? Anmelden
WEITERE GESCHICHTEN VON The Straits Times
The Straits Times
Abuse Young children in dysfunctional families face high risks
The physical and mental abuse Megan Khung suffered has left Singaporeans reeling over how this could have happened here.
1 min
October 28, 2025
The Straits Times
Doctors Dishonesty a serious matter to SMC and courts
The commentary “Are doctors in Singapore being disciplined fairly?
2 mins
October 28, 2025
The Straits Times
Better tracking needed to measure hearing loss
Hearing loss is a lot more than an ear issue, and is linked to cognitive decline, loneliness, increased fall risk, malnutrition, and even diabetes (Sumiko at 61: Hearing loss is linked to dementia risk.
1 mins
October 28, 2025
The Straits Times
'Yacht expert' among 3 S'poreans named as co-conspirators of Cambodian tycoon in US probe
Three Singaporeans allegedly implicated in a major probe by the United States and Britain targeting cybercrime include a self-styled yacht expert.
2 mins
October 28, 2025
The Straits Times
FROM HEARTBREAK TO CONQUERING THE HARD COURTS
In this series, The Straits Times highlights the players or teams to watch in the world of sport.
5 mins
October 28, 2025
The Straits Times
S'pore firm sanctioned by US was involved in HDB projects
Khoon Group under scrutiny over links to China-born tycoon in cybercrime probe
6 mins
October 28, 2025
The Straits Times
Rape Father sentenced to 24 years’ jail
A 54-year-old man, who was goaded by his lover to commit sexual acts on his daughter, was sentenced to 24 years’ jail on Oct 27.
1 min
October 28, 2025
The Straits Times
Art appreciation Louvre museum heist a wake-up call
I've seen photos of the Louvre in textbooks and read about the Mona Lisa and the endless halls lined with art.
1 min
October 28, 2025
The Straits Times
S’pore eyes renewable fuel, nuclear tie-ups in drive for diverse energy mix: Tan See Leng
Singapore must be ready to support all promising pathways, from established technologies to novel options, in its bid to transition its fossil fuel-based energy sector to one that is clean yet affordable, said Minister-in-charge of Energy and Science and Technology Tan See Leng on Oct 27.
4 mins
October 28, 2025
The Straits Times
Japan's new leader faces an early test: Winning over Trump
Ms Sanae Takaichi, who last week became the first woman to lead Japan as prime minister, has never met US President Donald Trump.
3 mins
October 28, 2025
Listen
Translate
Change font size

