Al Tools Easily Tricked Into Safety Breaches, Say Researchers
The Guardian|April 04, 2024
The safety features on some of the most powerful AI tools that stop them being used for cybercrime or terrorism can be bypassed simply by flooding them with examples of wrongdoing, research shows.
Alex Hern 
Al Tools Easily Tricked Into Safety Breaches, Say Researchers

In a paper from the AI lab Anthropic, which produces the large language model (LLM) behind the ChatGPT rival Claude, researchers described an attack they called "many-shot jailbreaking". It is as simple as it is effective.

Claude, like most large commercial AI systems, contains safety features designed to encourage it to refuse certain requests, such as to generate violent or hateful speech, or produce instructions for illegal activities. A user who asks the system for instructions to build a bomb, for example, will receive a polite refusal to engage.

هذه القصة مأخوذة من طبعة April 04, 2024 من The Guardian.

ابدأ النسخة التجريبية المجانية من Magzter GOLD لمدة 7 أيام للوصول إلى آلاف القصص المتميزة المنسقة وأكثر من 8500 مجلة وصحيفة.

هذه القصة مأخوذة من طبعة April 04, 2024 من The Guardian.

ابدأ النسخة التجريبية المجانية من Magzter GOLD لمدة 7 أيام للوصول إلى آلاف القصص المتميزة المنسقة وأكثر من 8500 مجلة وصحيفة.

المزيد من القصص من THE GUARDIAN مشاهدة الكل
Southgate needs Saka but can his body take the strain in Germany?
The Guardian

Southgate needs Saka but can his body take the strain in Germany?

I's Bukayo Saka a little burned out? After six years of almost always making himself available, is the Arsenal and England No 7 feeling the strain? Like many overplayed youngsters before him, are these the first signs of a player whose body is starting to plead for a rest?

time-read
4 mins  |
June 13, 2024
Hazlewood raises run-rate dilemma but the task for England is clear
The Guardian

Hazlewood raises run-rate dilemma but the task for England is clear

Struggling holders must beat Oman then Namibiaand hope Australia help them out

time-read
4 mins  |
June 13, 2024
Ailing Hodgkinson shows heart from front and hangs on to 800m crown
The Guardian

Ailing Hodgkinson shows heart from front and hangs on to 800m crown

Briton 'finds way to win' after battling illness and Gajanova to strike gold again in Italy

time-read
3 mins  |
June 13, 2024
American dream What will be the real legacy of Cricket World Cup's New York adventure?
The Guardian

American dream What will be the real legacy of Cricket World Cup's New York adventure?

The shuttle bus was a squeeze, with more passengers than seats, and now someone needed to get up for the elderly couple who were the last ones on board.

time-read
4 mins  |
June 13, 2024
China Policies fail to fix swollen property market
The Guardian

China Policies fail to fix swollen property market

All across China, from Beijing in the north, to Shenzhen in the south, millions of newly built homes stand empty and unwanted.

time-read
3 mins  |
June 13, 2024
Sunak's hopes for 'bounceback' fade as UK economy flatlines in April
The Guardian

Sunak's hopes for 'bounceback' fade as UK economy flatlines in April

The UK economy flatlined in April, held back by wet weather, as the signs of a recovery from last year's recession began to fade.

time-read
1 min  |
June 13, 2024
Guilty as charged? Why Brussels wants to pull Beijing's plug
The Guardian

Guilty as charged? Why Brussels wants to pull Beijing's plug

The EU has told Beijing that it plans to impose new tariffs on imports of Chinese electric vehicles into the trading bloc, potentially triggering a trade war. Here's what's been generating the sparks.

time-read
3 mins  |
June 13, 2024
Danish PM still 'not great' after shock of assault in Copenhagen
The Guardian

Danish PM still 'not great' after shock of assault in Copenhagen

Denmark's prime minister, Mette Frederiksen, has said she needed time out with her family to recover from the shock of being assaulted on a Copenhagen square last week and is still \"not doing great\".

time-read
2 mins  |
June 13, 2024
Lebanon Hezbollah launches salvo after commander killed
The Guardian

Lebanon Hezbollah launches salvo after commander killed

The Lebanese militant group Hezbollah has launched its biggest salvo of rockets at Israel since the war in Gaza began, in retaliation for the killing of a senior field commander, bringing the two sides closer to all-out conflict.

time-read
2 mins  |
June 13, 2024
Les Républicains head vows to stay on despite revolt over call for alliance with Le Pen
The Guardian

Les Républicains head vows to stay on despite revolt over call for alliance with Le Pen

Éric Ciotti, the leader of France's mainstream rightwing party, Les Républicains, has vowed he will stay in his job despite key members of his party voting unanimously to oust him over his proposed alliance with the far right.

time-read
3 mins  |
June 13, 2024