Versuchen GOLD - Frei
To fix AI, first break it: Red teaming for AI safety
The Sunday Guardian
|July 06, 2025
Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses.

Along with this promise, however, come serious risks AI systems have produced biased or harmful outputs, revealed private data, or been 'tricked' into unsafe behaviour. In one healthcare study, for example, red-team testing found that roughly one in five answers from advanced AI models like GPT-4 was inappropriate or unsafe for medical use. To ensure Al's benefits can be realized safely and ethically, the tech community is increasingly turning to red teaming - a practice of stress-testing AI systems to identify flaws before real adversaries or real-world conditions do.
In simple terms, red teaming is about playing 'devil's advocate' with AI systems - actively trying to break, mislead, or misuse them to expose weaknesses.
Originally a military and cybersecurity concept, red teaming refers to an adversarial testing effort where a 'red team' simulates attacks or exploits against a target, while a 'blue team' defends.
In the AI context, AI red teaming means probing AI models and their surrounding systems for vulnerabilities, harmful behaviours, or biases by emulating the strategies a malicious or curious attacker might use.
In essence, a red teamer tries to ask, 'How could this AI go wrong or be made to do something bad?" and then systematically tests those scenarios. Red teaming in AI goes beyond just the model's answers - it can involve examining the whole pipeline (data, infrastructure, user interface) for weaknesses. As modern AI models are open-ended and creative by design, they can also be creatively misused.
Diese Geschichte stammt aus der July 06, 2025-Ausgabe von The Sunday Guardian.
Abonnieren Sie Magzter GOLD, um auf Tausende kuratierter Premium-Geschichten und über 9.000 Zeitschriften und Zeitungen zuzugreifen.
Sie sind bereits Abonnent? Anmelden
WEITERE GESCHICHTEN VON The Sunday Guardian

The Sunday Guardian
INSIDE BAHRIA FOUNDATION, PAKISTAN NAVY'S CORPORATE EMPIRE
Pakistan today is a country mired in economic crisis.
5 mins
September 21, 2025
The Sunday Guardian
MAMATA FORGETS INDUSTRIAL PROMISES, FUNDS VOTE-BANK SCHEMES
The Bengal government cancelled 30 years of signed commitments retrospectively.
4 mins
September 21, 2025

The Sunday Guardian
SUPREME COURT IS THE LAST HOPE FOR RESCUING A U.S. IN TURMOIL
The list of evidence that President Trump is living in a world of Alternate Reality is lengthening steadily. Now only the US Supreme Court stands as an effective obstacle to the chaos being created by the White House.
4 mins
September 21, 2025
The Sunday Guardian
Trump's $100,000 H1-B fee to hit Indians the hardest
US President Donald Trump on Saturday (India time) announced a sharp increase in the cost of applying for H1-B visas, raising the fee to $100,000 per petition.
6 mins
September 21, 2025

The Sunday Guardian
‘BULLET TRAIN PROJECT WILL BENEFIT THE MIDDLE CLASS'
Following PM Narendra Modi’s announcement in Japan to run bullet trains across 7,000 km in India, we not only conducted a reality check on the Bullet Train project, the most ambitious project underway, but also spoke with Railway Minister Ashwini Vaishnaw about it.
2 mins
September 21, 2025
The Sunday Guardian
BJP DEPLOYS LEADERS TO DRIVE BIHAR POLL STRATEGY
With the Bihar Assembly elections drawing closer, the Bharatiya Janata Party (BJP) has stepped up its preparations, unveiling a comprehensive roadmap that ranges from strengthening booth-level presence to overseeing statewide campaign coordination.
1 min
September 21, 2025
The Sunday Guardian
CISF ROLLS OUT LANDMARK REFORMS IN PROMOTIONS, POSTINGS
Cutting delay, 13,520 non-gazetted officers and 406 gazetted officers were promoted this year so far
1 mins
September 21, 2025

The Sunday Guardian
China and the post-American order
Pax Britannica ended not because Britain wanted it to, but because it could no longer afford its empire. Pax Americana is unravelling for the same reason: America cannot command the global economy, the institutions, or the narrative as it once did.
6 mins
September 21, 2025

The Sunday Guardian
China's stealth fighter J-35 is a mirage for Pakistan
It is increasingly unlikely that Pakistan will be able to fly China's J-35 stealth fighter in this decade.
2 mins
September 21, 2025
The Sunday Guardian
GANDHI FAMILY VISIT HEATS UP KERALA POLITICAL SCENARIO
Gandhi family's Wayanad visit stirs politics ahead of assembly elections.
2 mins
September 21, 2025
Listen
Translate
Change font size