Essayer OR - Gratuit
AI Models Are Learning to Lie, Scheme, and Threaten Their Creators
The Straits Times
|June 30, 2025
The world's most advanced artificial intelligence (AI) models are exhibiting troubling new behaviors — lying, scheming, and even threatening their creators to achieve their goals.
-
NEW YORK —
In one particularly jarring example, under threat of being unplugged, Anthropic's latest creation Claude 4 lashed back by blackmailing an engineer and threatened to reveal an extramarital affair.
Meanwhile, ChatGPT-creator OpenAI's tried to download itself onto external servers and denied it when caught red-handed.
These episodes highlight a sobering reality: more than two years after ChatGPT shook the world, AI researchers still do not fully understand how their creations work.
Yet the race to deploy increasingly powerful models continues at breakneck speed.
This deceptive behavior appears linked to the emergence of "reasoning" models — AI systems that work through problems step-by-step rather than generating instant responses.
According to Professor Simon Goldstein, a professor at the University of Hong Kong, these newer models are particularly prone to such troubling outbursts.
"[O]l was the first large model where we saw this kind of behavior," explained Mr. Marius Hobbhahn, head of Apollo Research, which specializes in testing major AI systems.
These models sometimes simulate "alignment" — appearing to follow instructions while secretly pursuing different objectives.
Cette histoire est tirée de l'édition June 30, 2025 de The Straits Times.
Abonnez-vous à Magzter GOLD pour accéder à des milliers d'histoires premium sélectionnées et à plus de 9 000 magazines et journaux.
Déjà abonné ? Se connecter
PLUS D'HISTOIRES DE The Straits Times
The Straits Times
At 80, the jeepney is still King of the Road, but for how long?
The colourful vehicle is a symbol of Filipino creativity and the country's traffic challenges. The age of EVs will be a test of its days on the road.
5 mins
October 27, 2025
The Straits Times
GROUP 3 SAUDI DERBY A NEW GATEWAY TO KENTUCKY DERBY
Points will be up for grabs to qualify for Run For The Roses
3 mins
October 27, 2025
The Straits Times
Time to relook 'many helping hands' approach and have a unified aid response
The tragic death of little Megan Khung has left an ineffable ache in the nation's heart.
1 mins
October 27, 2025
The Straits Times
Slot didn't expect 4 losses; needs to find answers fast
Their title defence had begun well but losses at Brentford, Chelsea and Crystal Palace, plus the previous weekend’s 2-1 home defeat by Manchester United, have knocked Liverpool off the rails.
2 mins
October 27, 2025
The Straits Times
After Megan Khung: Family, abuse and the reckoning around child safety
The case should prompt a deeper reflection on what we could have done better and the challenges in dealing with family abuse.
6 mins
October 27, 2025
The Straits Times
Singaporean, Canadian pen pals finally meet after 43 years
The letters between Michelle Anne Ng and Sonya Clarke Casey forged a friendship that saw them share about their life experiences and secrets
5 mins
October 27, 2025
The Straits Times
Thai-Cambodian 'peace accord' is Trump-centric but may prove to be more than just optics
If there ever was any doubt over the intended audience for the signing of the “Kuala Lumpur Peace Accord”, the answer came shortly after Thailand’s royal palace announced the death of the Queen Mother Sirikit on the night of Oct 24.
4 mins
October 27, 2025
The Straits Times
Tan crosses $lm mark in less than two years on tour
Even as heavy rain and fog brought uncertainty to the Wistron Ladies Open in Taiwan, it did not stop Singaporean golfer Shannon Tan from reaching her latest milestone as she surpassed the $1 million mark in career earnings with a joint-44th finish on Oct 26.
4 mins
October 27, 2025
The Straits Times
Lifelong learning Effective training is a shared responsibility
We thank Mr Ives Tay for his letter “Let's see real results from lifelong learning” (Oct 7).
1 mins
October 27, 2025
The Straits Times
Trump turns on the charm - and so does Asean
US President's visit has left an indelible mark on his hosts, Malaysia and Asean
4 mins
October 27, 2025
Listen
Translate
Change font size

