Mit Magzter GOLD unbegrenztes Potenzial nutzen

Mit Magzter GOLD unbegrenztes Potenzial nutzen

Erhalten Sie unbegrenzten Zugriff auf über 9.000 Zeitschriften, Zeitungen und Premium-Artikel für nur

$149.99
 
$74.99/Jahr

Versuchen GOLD - Frei

To fix AI, first break it: Red teaming for AI safety

The Sunday Guardian

|

July 06, 2025

Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses.

- POOJA ARORA

To fix AI, first break it: Red teaming for AI safety

Along with this promise, however, come serious risks AI systems have produced biased or harmful outputs, revealed private data, or been 'tricked' into unsafe behaviour. In one healthcare study, for example, red-team testing found that roughly one in five answers from advanced AI models like GPT-4 was inappropriate or unsafe for medical use. To ensure Al's benefits can be realized safely and ethically, the tech community is increasingly turning to red teaming - a practice of stress-testing AI systems to identify flaws before real adversaries or real-world conditions do.

In simple terms, red teaming is about playing 'devil's advocate' with AI systems - actively trying to break, mislead, or misuse them to expose weaknesses.

Originally a military and cybersecurity concept, red teaming refers to an adversarial testing effort where a 'red team' simulates attacks or exploits against a target, while a 'blue team' defends.

In the AI context, AI red teaming means probing AI models and their surrounding systems for vulnerabilities, harmful behaviours, or biases by emulating the strategies a malicious or curious attacker might use.

In essence, a red teamer tries to ask, 'How could this AI go wrong or be made to do something bad?" and then systematically tests those scenarios. Red teaming in AI goes beyond just the model's answers - it can involve examining the whole pipeline (data, infrastructure, user interface) for weaknesses. As modern AI models are open-ended and creative by design, they can also be creatively misused.

WEITERE GESCHICHTEN VON The Sunday Guardian

The Sunday Guardian

The Sunday Guardian

INSIDE BAHRIA FOUNDATION, PAKISTAN NAVY'S CORPORATE EMPIRE

Pakistan today is a country mired in economic crisis.

time to read

5 mins

September 21, 2025

The Sunday Guardian

MAMATA FORGETS INDUSTRIAL PROMISES, FUNDS VOTE-BANK SCHEMES

The Bengal government cancelled 30 years of signed commitments retrospectively.

time to read

4 mins

September 21, 2025

The Sunday Guardian

The Sunday Guardian

SUPREME COURT IS THE LAST HOPE FOR RESCUING A U.S. IN TURMOIL

The list of evidence that President Trump is living in a world of Alternate Reality is lengthening steadily. Now only the US Supreme Court stands as an effective obstacle to the chaos being created by the White House.

time to read

4 mins

September 21, 2025

The Sunday Guardian

Trump's $100,000 H1-B fee to hit Indians the hardest

US President Donald Trump on Saturday (India time) announced a sharp increase in the cost of applying for H1-B visas, raising the fee to $100,000 per petition.

time to read

6 mins

September 21, 2025

The Sunday Guardian

The Sunday Guardian

‘BULLET TRAIN PROJECT WILL BENEFIT THE MIDDLE CLASS'

Following PM Narendra Modi’s announcement in Japan to run bullet trains across 7,000 km in India, we not only conducted a reality check on the Bullet Train project, the most ambitious project underway, but also spoke with Railway Minister Ashwini Vaishnaw about it.

time to read

2 mins

September 21, 2025

The Sunday Guardian

BJP DEPLOYS LEADERS TO DRIVE BIHAR POLL STRATEGY

With the Bihar Assembly elections drawing closer, the Bharatiya Janata Party (BJP) has stepped up its preparations, unveiling a comprehensive roadmap that ranges from strengthening booth-level presence to overseeing statewide campaign coordination.

time to read

1 min

September 21, 2025

The Sunday Guardian

CISF ROLLS OUT LANDMARK REFORMS IN PROMOTIONS, POSTINGS

Cutting delay, 13,520 non-gazetted officers and 406 gazetted officers were promoted this year so far

time to read

1 mins

September 21, 2025

The Sunday Guardian

The Sunday Guardian

China and the post-American order

Pax Britannica ended not because Britain wanted it to, but because it could no longer afford its empire. Pax Americana is unravelling for the same reason: America cannot command the global economy, the institutions, or the narrative as it once did.

time to read

6 mins

September 21, 2025

The Sunday Guardian

The Sunday Guardian

China's stealth fighter J-35 is a mirage for Pakistan

It is increasingly unlikely that Pakistan will be able to fly China's J-35 stealth fighter in this decade.

time to read

2 mins

September 21, 2025

The Sunday Guardian

GANDHI FAMILY VISIT HEATS UP KERALA POLITICAL SCENARIO

Gandhi family's Wayanad visit stirs politics ahead of assembly elections.

time to read

2 mins

September 21, 2025

Listen

Translate

Share

-
+

Change font size