Go Unlimited with Magzter GOLD

Go Unlimited with Magzter GOLD

Get unlimited access to 9,500+ magazines, newspapers and Premium stories for just

$149.99
 
$74.99/Year

Try GOLD - Free

To fix AI, first break it: Red teaming for AI safety

The Business Guardian

|

July 06, 2025

Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses.

- POOJA ARORA

To fix AI, first break it: Red teaming for AI safety

Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses. Along with this promise, however, come serious risks—AI systems have produced biased or harmful outputs, revealed private data, or been 'tricked' into unsafe behavior. In one healthcare study, for example, red-team testing found that roughly one in five answers from advanced AI models like GPT-4 was inappropriate or unsafe for medical use.

To ensure AI's benefits can be realized safely and ethically, the tech community is increasingly turning to red teaming—a practice of stress-testing AI systems to identify flaws before real adversaries or real-world conditions do.

In simple terms, red teaming is about playing 'devil's advocate' with AI systems—actively trying to break, mislead, or misuse them to expose weaknesses.

Originally a military and cybersecurity concept, red teaming refers to an adversarial testing effort where a 'red team' simulates attacks or exploits against a target, while a 'blue team' defends. In the AI context, AI red teaming means probing AI models and their surrounding systems for vulnerabilities, harmful behaviors, or biases by emulating the strategies a malicious or curious attacker might use.

In essence, a red teamer tries to ask, 'How could this AI go wrong or be made to do something bad?' and then systematically tests those scenarios. Red teaming in AI goes beyond just the model's answers—it can involve examining the whole pipeline (data, infrastructure, user interface) for weaknesses. As modern AI models are open-ended and creative by design, they can also be creatively misused.

MORE STORIES FROM The Business Guardian

The Business Guardian

The Business Guardian

BofA bullish on Paytm, cites soundbox, AI, cost discipline

Paytm(One 97 Communications Limited), India’s full stack merchant payments leader, is showing steady momentum across its core business of Payments, Soundbox, and Merchant Lending business, according to a recent report by BofA Global Research.

time to read

1 mins

September 20, 2025

The Business Guardian

The Business Guardian

MF inflows shield market; stocks may trade sideways: Jefferies

India’s stock markets are being supported largely by consistent mutual fund investments, which are preventing a deeper fall despite heavy outflows, according to a report by Jefferies.

time to read

1 mins

September 20, 2025

The Business Guardian

ESSF organises symposium on Education

Ek Soach Saathiya Foundation (ESSF) organised a Symposium on Education for All followed by a Cultural Evening at the Convention Hall, Airport Authority Officers’ Institute, Safdarjung Airport, New Delhi.

time to read

1 min

September 20, 2025

The Business Guardian

Female workforce share grows but wage gap persists

A report by the Delhi government indicates that while the ratio of female workers in the labour force of the national capital has increased, their wages remain lower than those of men, despite some fluctuations over the years.

time to read

1 mins

September 20, 2025

The Business Guardian

The Business Guardian

15 Years of Leapswitch: A Journey of Growth and Success

Leapswitch Networks, 15 years old today, began with a wild dream: to make cloud services affordable, available, and reliable.

time to read

1 mins

September 20, 2025

The Business Guardian

The Business Guardian

Moglix drives electronics growth with 50+ brands

Moglix, one of Asia’s largest B2B e-commerce platforms, announced at Electronica India 2025 in Bangalore that it has onboarded more than 50, suppliers on its platform.

time to read

2 mins

September 20, 2025

The Business Guardian

The Business Guardian

Hyundai unveils 2030 plan with 18 hybrids, first India EV

South Korea’s Hyundai Motor Co. has announced a mid to long-term strategy to expand its hybrid electric vehicle (HEV) lineup and launch region-specific electric vehicles (EVs) for India and other countries.Hyundai Motor’s EV strategy features regionally tailored products designed for specific markets.

time to read

1 min

September 20, 2025

The Business Guardian

Rekha Gupta hails ABVP’s triumph in DUSU elections

With the RSS-affiliated ABVP bagging three posts in the Delhi University Students’ Union (DUSU) elections, Delhi Chief Minister Rekha Gupta on Friday congratulated the poll winners, urging them to fulfil their responsibilities towards the university.

time to read

1 min

September 20, 2025

The Business Guardian

The Business Guardian

Artemis Hospital launches North India's premier Heart & Lung Transplant Centre

Artemis Hospital today announced the inauguration of its modern Heart & Lung Transplant Centre.

time to read

1 min

September 20, 2025

The Business Guardian

The Business Guardian

Adani Power tops private thermal producers; earnings to triple by 2033

Adani Power Limited (APL) has firmly established itself as India's largest private coal-based independent power producer (IPP), with a portfolio of 18,150 MW spread across 12 plants in eight states, according to a research report by Morgan Stanley.APL has successfully acquired and turned around 4,370 MW of stressed assets, with integration of another 2,900 MW underway.

time to read

1 mins

September 20, 2025

Listen

Translate

Share

-
+

Change font size