Prøve GULL - Gratis
To fix AI, first break it: Red teaming for AI safety
The Business Guardian
|July 06, 2025
Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses.
Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses. Along with this promise, however, come serious risks—AI systems have produced biased or harmful outputs, revealed private data, or been 'tricked' into unsafe behavior. In one healthcare study, for example, red-team testing found that roughly one in five answers from advanced AI models like GPT-4 was inappropriate or unsafe for medical use.
To ensure AI's benefits can be realized safely and ethically, the tech community is increasingly turning to red teaming—a practice of stress-testing AI systems to identify flaws before real adversaries or real-world conditions do.
In simple terms, red teaming is about playing 'devil's advocate' with AI systems—actively trying to break, mislead, or misuse them to expose weaknesses.
Originally a military and cybersecurity concept, red teaming refers to an adversarial testing effort where a 'red team' simulates attacks or exploits against a target, while a 'blue team' defends. In the AI context, AI red teaming means probing AI models and their surrounding systems for vulnerabilities, harmful behaviors, or biases by emulating the strategies a malicious or curious attacker might use.
In essence, a red teamer tries to ask, 'How could this AI go wrong or be made to do something bad?' and then systematically tests those scenarios. Red teaming in AI goes beyond just the model's answers—it can involve examining the whole pipeline (data, infrastructure, user interface) for weaknesses. As modern AI models are open-ended and creative by design, they can also be creatively misused.
Denne historien er fra July 06, 2025-utgaven av The Business Guardian.
Abonner på Magzter GOLD for å få tilgang til tusenvis av kuraterte premiumhistorier og over 9000 magasiner og aviser.
Allerede abonnent? Logg på
FLERE HISTORIER FRA The Business Guardian
The Business Guardian
Nitish Katara murder case: Delhi HC issues notice on Vikas Yadav's plea seeking 21
The Delhi High Court has issued a notice on a plea filed by Vikas Yadav, a convict in the Nitish Katara murder case, challenging the Delhi government's rejection of his application for 21 days’ furlough.
1 min
November 07, 2025
The Business Guardian
Piyush Goyal, Air New Zealand CEO discuss aviation opportunities
India and New Zealand are advancing discussions ona bilateral trade agreement aimed at building a sector-specific trade deal that strengthens economic ties without compromising on sensitive issues according to Union Minister of Commerce and Industry, Piyush Goyal.Goyal shared insights into the growing opportunities in India’s aviation sector during his meeting with Nikhil Ravishankar, CEO of Air New Zealand.
2 mins
November 07, 2025
The Business Guardian
Remembering Gopichand Hinduja: The wealth of giving
As the world mourns a billionaire, a glimpse into the man who valued people over profits
4 mins
November 07, 2025
The Business Guardian
STUBBLE BURNING TO DRIVE DELHI AIR POLLUTION SPIKE
Stubble burning and transport emissions will sharply increase Delhi’s PM2.5 levels.
1 min
November 07, 2025
The Business Guardian
Infosys unveils AI agent for energy sector operations
Infosys (NSE: INFY) (BSE: INFY) (NYSE: INFY), a global leader in next-generation digital services and consulting, has developed an AI Agent designed to digitally transform operations in the energy sector.
1 min
November 07, 2025
The Business Guardian
From leftovers: The magic hidden in a bowl of rice
There's something comforting about a bowl of rice — soft, versatile, and deeply rooted in Indian kitchens.
2 mins
November 07, 2025
The Business Guardian
MOOD FOOD: CAN WHAT YOU EAT REALLY MAKE YOU HAPPIER?
Ever noticed how a bowl of hot khichdi feels like a hug on a bad day or how dark chocolate seems to melt away stress instantly? Food has a fascinating connection with our mood — it's not just about taste, but chemistry, psychology, and emotion.
2 mins
November 07, 2025
The Business Guardian
RGCIRC Hosts 12th Annual International Nursing Conference - NURSICON 2025
NURSICON 2025
1 min
November 07, 2025
The Business Guardian
VinFast builds armored EV as new national symbol
Developed by VinFast, the country’s young automaker, in partnership with INKAS of Canada, the Lac Hong 900 LX is both a symbol of industrial maturity and a technological milestone.
1 mins
November 07, 2025
The Business Guardian
Air India launches flexible contract model for pilots
Air India rolls out a “Flexi Contract for Pilots,” a new work model that lets flight crew choose shorter duty patterns while keeping operations running smoothly.
2 mins
November 07, 2025
Listen
Translate
Change font size
