Vuélvete ilimitado con Magzter GOLD

Vuélvete ilimitado con Magzter GOLD

Obtenga acceso ilimitado a más de 9000 revistas, periódicos e historias Premium por solo

$149.99
 
$74.99/Año

Intentar ORO - Gratis

To fix AI, first break it: Red teaming for AI safety

The Sunday Guardian

|

July 06, 2025

Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses.

- POOJA ARORA

To fix AI, first break it: Red teaming for AI safety

Along with this promise, however, come serious risks AI systems have produced biased or harmful outputs, revealed private data, or been 'tricked' into unsafe behaviour. In one healthcare study, for example, red-team testing found that roughly one in five answers from advanced AI models like GPT-4 was inappropriate or unsafe for medical use. To ensure Al's benefits can be realized safely and ethically, the tech community is increasingly turning to red teaming - a practice of stress-testing AI systems to identify flaws before real adversaries or real-world conditions do.

In simple terms, red teaming is about playing 'devil's advocate' with AI systems - actively trying to break, mislead, or misuse them to expose weaknesses.

Originally a military and cybersecurity concept, red teaming refers to an adversarial testing effort where a 'red team' simulates attacks or exploits against a target, while a 'blue team' defends.

In the AI context, AI red teaming means probing AI models and their surrounding systems for vulnerabilities, harmful behaviours, or biases by emulating the strategies a malicious or curious attacker might use.

In essence, a red teamer tries to ask, 'How could this AI go wrong or be made to do something bad?" and then systematically tests those scenarios. Red teaming in AI goes beyond just the model's answers - it can involve examining the whole pipeline (data, infrastructure, user interface) for weaknesses. As modern AI models are open-ended and creative by design, they can also be creatively misused.

MÁS HISTORIAS DE The Sunday Guardian

The Sunday Guardian

The Sunday Guardian

Inside India's 2016 surgical strikes: Planning, precision, deterrence

The strikes came 11 days after the 18 September 2016 Uri attack, in which four militants stormed an Army base, killing 19 soldiers. The scale of the losses shocked the nation and demanded a forceful response.

time to read

3 mins

September 28, 2025

The Sunday Guardian

The Sunday Guardian

World Food India 2025 sees MoUs worth Rs 1 lakh crore in first two days

The second day of World Food India 2025, currently underway at Bharat Mandapam, marked major strides in India's vision to become the global food basket.

time to read

1 mins

September 28, 2025

The Sunday Guardian

CLOSE PARTNERSHIP WITH INDIA VITAL FOR U.S. GLOBAL SECURITY

Absence of a trade deal with India would seriously compromise the US in the ongoing hybrid confrontation with China. Whether a deal will come about or not depends in large part on the White House.

time to read

4 mins

September 28, 2025

The Sunday Guardian

The Sunday Guardian

THREATS TO RUIN FUTURE: EX-STUDENT REVEALS HOW DELHI GODMAN SUBJECTED FEMALE STUDENTS TO SEXUAL ABUSE

A red Volvo with a \"UN\" number plate, a BMW, a fake visiting card of \"permanent ambassador of UN Economic and Social Council (ECOSOC)\"-Swami Chaitanyananda Saraswati had built around himself a larger-than-life aura and knew how to show off in elite circles to project himself as an \"internationally acclaimed writer.\" But none of this corresponded with the reality: he is a serial sexual offender, according to students who have passed from Sri Sharada Institute of Indian Management and Research (SRISIIM), located in Delhi's Vasant Kunj

time to read

7 mins

September 28, 2025

The Sunday Guardian

The Sunday Guardian

VISA WARS AND THE GREAT BRAIN DRAIN: MAKE INDIA GREAT AGAIN

America's dramatic hike in the H1B visa fee is a watershed moment for global talent mobility, forcing India to confront both risks and opportunities. This is more than a cautionary tale; it is a chance for India to assert itself in the geopolitics of human capital.

time to read

5 mins

September 28, 2025

The Sunday Guardian

The Sunday Guardian

CHINA-RUSSIA-NORTH KOREA TRILATERAL ALIGNMENT CHALLENGES LEE JAE-MYUNG

Emerging trilateral ties complicate South Korea's efforts to engage North Korea diplomatically.

time to read

5 mins

September 28, 2025

The Sunday Guardian

The Sunday Guardian

Farewell to MiG-21, India's first supersonic fighter

On 26 September 2025, the skies over Chandigarh fell silent to a sound that had defined Indian air power for more than six decades.

time to read

5 mins

September 28, 2025

The Sunday Guardian

The Sunday Guardian

'Surat-Bilimora section of bullet train project to become operational in 2027'

Union Railway Minister Ashwini Vaishnaw has said that the Surat to Bilimora is the first section of the Bullet Train project that will become operational and several new technologies have been introduced into the work on the tracks.

time to read

3 mins

September 28, 2025

The Sunday Guardian

The Sunday Guardian

POLISH DIPLOMAT DEEPENS INDO-POLISH CULTURAL TIES THROUGH ARTISTIC EXCHANGES

Polish Institute New Delhi director champions cinema, music, literature, and heritage collaborations.

time to read

4 mins

September 28, 2025

The Sunday Guardian

LOC issued against Pune gangster Nilesh Ghaywal

A Look Out Circular has been issued against notorious Pune gangster Nilesh Ghaywal, who is suspected to have left the country despite facing fresh criminal charges.

time to read

1 mins

September 28, 2025

Listen

Translate

Share

-
+

Change font size