Prøve GULL - Gratis

To fix AI, first break it: Red teaming for AI safety

The Business Guardian

|

July 06, 2025

Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses.

- POOJA ARORA

To fix AI, first break it: Red teaming for AI safety

Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses. Along with this promise, however, come serious risks—AI systems have produced biased or harmful outputs, revealed private data, or been 'tricked' into unsafe behavior. In one healthcare study, for example, red-team testing found that roughly one in five answers from advanced AI models like GPT-4 was inappropriate or unsafe for medical use.

To ensure AI's benefits can be realized safely and ethically, the tech community is increasingly turning to red teaming—a practice of stress-testing AI systems to identify flaws before real adversaries or real-world conditions do.

In simple terms, red teaming is about playing 'devil's advocate' with AI systems—actively trying to break, mislead, or misuse them to expose weaknesses.

Originally a military and cybersecurity concept, red teaming refers to an adversarial testing effort where a 'red team' simulates attacks or exploits against a target, while a 'blue team' defends. In the AI context, AI red teaming means probing AI models and their surrounding systems for vulnerabilities, harmful behaviors, or biases by emulating the strategies a malicious or curious attacker might use.

In essence, a red teamer tries to ask, 'How could this AI go wrong or be made to do something bad?' and then systematically tests those scenarios. Red teaming in AI goes beyond just the model's answers—it can involve examining the whole pipeline (data, infrastructure, user interface) for weaknesses. As modern AI models are open-ended and creative by design, they can also be creatively misused.

FLERE HISTORIER FRA The Business Guardian

The Business Guardian

Nitish Katara murder case: Delhi HC issues notice on Vikas Yadav's plea seeking 21

The Delhi High Court has issued a notice on a plea filed by Vikas Yadav, a convict in the Nitish Katara murder case, challenging the Delhi government's rejection of his application for 21 days’ furlough.

time to read

1 min

November 07, 2025

The Business Guardian

The Business Guardian

Piyush Goyal, Air New Zealand CEO discuss aviation opportunities

India and New Zealand are advancing discussions ona bilateral trade agreement aimed at building a sector-specific trade deal that strengthens economic ties without compromising on sensitive issues according to Union Minister of Commerce and Industry, Piyush Goyal.Goyal shared insights into the growing opportunities in India’s aviation sector during his meeting with Nikhil Ravishankar, CEO of Air New Zealand.

time to read

2 mins

November 07, 2025

The Business Guardian

The Business Guardian

Remembering Gopichand Hinduja: The wealth of giving

As the world mourns a billionaire, a glimpse into the man who valued people over profits

time to read

4 mins

November 07, 2025

The Business Guardian

STUBBLE BURNING TO DRIVE DELHI AIR POLLUTION SPIKE

Stubble burning and transport emissions will sharply increase Delhi’s PM2.5 levels.

time to read

1 min

November 07, 2025

The Business Guardian

The Business Guardian

Infosys unveils AI agent for energy sector operations

Infosys (NSE: INFY) (BSE: INFY) (NYSE: INFY), a global leader in next-generation digital services and consulting, has developed an AI Agent designed to digitally transform operations in the energy sector.

time to read

1 min

November 07, 2025

The Business Guardian

The Business Guardian

From leftovers: The magic hidden in a bowl of rice

There's something comforting about a bowl of rice — soft, versatile, and deeply rooted in Indian kitchens.

time to read

2 mins

November 07, 2025

The Business Guardian

The Business Guardian

MOOD FOOD: CAN WHAT YOU EAT REALLY MAKE YOU HAPPIER?

Ever noticed how a bowl of hot khichdi feels like a hug on a bad day or how dark chocolate seems to melt away stress instantly? Food has a fascinating connection with our mood — it's not just about taste, but chemistry, psychology, and emotion.

time to read

2 mins

November 07, 2025

The Business Guardian

The Business Guardian

RGCIRC Hosts 12th Annual International Nursing Conference - NURSICON 2025

NURSICON 2025

time to read

1 min

November 07, 2025

The Business Guardian

The Business Guardian

VinFast builds armored EV as new national symbol

Developed by VinFast, the country’s young automaker, in partnership with INKAS of Canada, the Lac Hong 900 LX is both a symbol of industrial maturity and a technological milestone.

time to read

1 mins

November 07, 2025

The Business Guardian

The Business Guardian

Air India launches flexible contract model for pilots

Air India rolls out a “Flexi Contract for Pilots,” a new work model that lets flight crew choose shorter duty patterns while keeping operations running smoothly.

time to read

2 mins

November 07, 2025

Listen

Translate

Share

-
+

Change font size