Try GOLD - Free

To fix AI, first break it: Red teaming for AI safety

The Business Guardian

|

July 06, 2025

Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses.

- POOJA ARORA

To fix AI, first break it: Red teaming for AI safety

Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses. Along with this promise, however, come serious risks—AI systems have produced biased or harmful outputs, revealed private data, or been 'tricked' into unsafe behavior. In one healthcare study, for example, red-team testing found that roughly one in five answers from advanced AI models like GPT-4 was inappropriate or unsafe for medical use.

To ensure AI's benefits can be realized safely and ethically, the tech community is increasingly turning to red teaming—a practice of stress-testing AI systems to identify flaws before real adversaries or real-world conditions do.

In simple terms, red teaming is about playing 'devil's advocate' with AI systems—actively trying to break, mislead, or misuse them to expose weaknesses.

Originally a military and cybersecurity concept, red teaming refers to an adversarial testing effort where a 'red team' simulates attacks or exploits against a target, while a 'blue team' defends. In the AI context, AI red teaming means probing AI models and their surrounding systems for vulnerabilities, harmful behaviors, or biases by emulating the strategies a malicious or curious attacker might use.

In essence, a red teamer tries to ask, 'How could this AI go wrong or be made to do something bad?' and then systematically tests those scenarios. Red teaming in AI goes beyond just the model's answers—it can involve examining the whole pipeline (data, infrastructure, user interface) for weaknesses. As modern AI models are open-ended and creative by design, they can also be creatively misused.

MORE STORIES FROM The Business Guardian

The Business Guardian

The Business Guardian

Why India's future depends on how its cities work

India has made its ambition clear: to become a developed nation by 2047, It is a monumental task, and the most important driver of that journey will not be capital or technology, but people.

time to read

3 mins

November 04, 2025

The Business Guardian

The Business Guardian

AMBUJA CEMENTS POSTS 364% SURGE IN PROFIT, SETS NEW Q2 REVENUE RECORD

Ambuja Cements posted a stellar Q2 FY26 performance with a 364% surge in net profit and record revenues, backed by strong volumes, GST-led demand.

time to read

2 mins

November 04, 2025

The Business Guardian

The Business Guardian

Ambuja Cements reports 364% profit jump, highest-ever Q2 revenue

Ambuja Cements, part of the diversified Adani Portfolio, on Monday reported that its consolidated Profit After Tax or net profits during the July-September 2025-26 quarter jumped 364 per cent year-on-year to Rs 2,302 crore. In the year ago quarter, it was Rs 496 crore.

time to read

2 mins

November 04, 2025

The Business Guardian

The Business Guardian

PM Modi launches Rs 1 lakh crore fund for private-sector R&D and innovation

Prime Minister Narendra Modi on Monday launched the Rs 1 lakh crore Research Development and Innovation (RDI) Scheme Fund, which was initially announced in the interim Budget of 2024-25.

time to read

1 mins

November 04, 2025

The Business Guardian

The Business Guardian

DELHI AIR QUALITY IMPROVES SLIGHTLY, AQI RECORDED AT 316

Morning readings show minor AQI drop as Delhi remains under heavy smog.

time to read

1 mins

November 04, 2025

The Business Guardian

The Business Guardian

Shifa-ur-Rehman tells SC no UAPA offence made out against him

Activist Shifa-ur-Rehman, seeking bail in a case under the Unlawful Activities (Prevention) Act (UAPA) linked to the February 2020 Delhi riots, told the Supreme Court on Monday that he was “cherry-picked” and no offence under the anti-terror law was made out against him.

time to read

1 min

November 04, 2025

The Business Guardian

The Business Guardian

The magic of mixed seeds: Tiny nutritional powerhouses

Mixed seeds, a combination of nutrient-dense seeds such as flaxseeds, pumpkin seeds, sunflower seeds, sesame seeds, chia seeds, and watermelon seeds, have become increasingly popular as a superfood addition to modern diets.

time to read

2 mins

November 04, 2025

The Business Guardian

The Business Guardian

SK’s AI Data Center set to become South Korea’s largest by 2027

Dozens of workers and five pieces of heavy equipment busily engaged in foundation work at a construction site of SK AI Data Center in Ulsan in the Ulsan Mipo Industrial Complex late last month, as per a report by Pulse, the English service of Maeil Business News Korea.

time to read

1 mins

November 04, 2025

The Business Guardian

The Business Guardian

Top officials asked to appear over “appalling” lack of sewage in industrial areas

The Delhi High Court has ordered the Chief Secretary of Delhi and other top government officials to appear before it in connection with the “extremely appalling” condition of 27 notified industrial areas that continue to function without basic infrastructure such as sewage lines and stormwater drains.

time to read

1 min

November 04, 2025

The Business Guardian

The Business Guardian

Vaishnaw, Gujarat CM review chip plants nearing production

Union Minister for Electronics and Information Technology, Ashwini Vaishnaw, on Monday held a meeting in Gandhinagar, takinga firsthand review of all four semiconductor chip plants that are under variousstages.

time to read

1 mins

November 04, 2025

Listen

Translate

Share

-
+

Change font size