कोशिश गोल्ड - मुक्त

To fix AI, first break it: Red teaming for AI safety

The Business Guardian

|

July 06, 2025

Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses.

- POOJA ARORA

To fix AI, first break it: Red teaming for AI safety

Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses. Along with this promise, however, come serious risks—AI systems have produced biased or harmful outputs, revealed private data, or been 'tricked' into unsafe behavior. In one healthcare study, for example, red-team testing found that roughly one in five answers from advanced AI models like GPT-4 was inappropriate or unsafe for medical use.

To ensure AI's benefits can be realized safely and ethically, the tech community is increasingly turning to red teaming—a practice of stress-testing AI systems to identify flaws before real adversaries or real-world conditions do.

In simple terms, red teaming is about playing 'devil's advocate' with AI systems—actively trying to break, mislead, or misuse them to expose weaknesses.

Originally a military and cybersecurity concept, red teaming refers to an adversarial testing effort where a 'red team' simulates attacks or exploits against a target, while a 'blue team' defends. In the AI context, AI red teaming means probing AI models and their surrounding systems for vulnerabilities, harmful behaviors, or biases by emulating the strategies a malicious or curious attacker might use.

In essence, a red teamer tries to ask, 'How could this AI go wrong or be made to do something bad?' and then systematically tests those scenarios. Red teaming in AI goes beyond just the model's answers—it can involve examining the whole pipeline (data, infrastructure, user interface) for weaknesses. As modern AI models are open-ended and creative by design, they can also be creatively misused.

The Business Guardian से और कहानियाँ

The Business Guardian

The Business Guardian

DELHI GOVT LAUNCHES ‘PINK SAHELI SMART CARD’

The Delhi government introduced the ‘Pink Saheli Smart Card’ to enable free travel for women and transgender passengers across the capital.

time to read

2 mins

November 03, 2025

The Business Guardian

Employees’ Enrolment Scheme-2025 launched to widen social security coverage

Employees’ Enrolment Scheme - 2025 was launched by Union Minister for Labour & Employment and Youth Affairs & Sports at the 73rd Foundation Day of the Employees’ Provident Fund Organisation (EPFO) in New Delhi, aiming to boost voluntary compliance and expand social security coverage for eligible employees across India. According to a release by the Ministry, the Employees’ Enrolment Scheme - 2025

time to read

1 min

November 03, 2025

The Business Guardian

The Business Guardian

Liquor vend staff caught refilling costly bottles with cheap alcohol in Delhi mall

In a raid conducted by the Excise Department of the Delhi government, employees of a liquor store were caught refilling and mixing cheap alcohol and water in the bottles of costly brands, at a mall in Narela, officials said on Sunday.

time to read

1 min

November 03, 2025

The Business Guardian

The Business Guardian

India's forex reserves dip $6.9 bn, remain near record level

India’s foreign exchange reserves declined by USD 6.925 billion in the week that ended October 24 to USD 695.355 billion, driven by a slump in both foreign currency assets and gold reserves, the Reserve Bank of India’s latest ‘Weekly Statistical Supplement’ data showed.

time to read

1 min

November 03, 2025

The Business Guardian

The Business Guardian

Scindia: Northeast now a land-linked powerhouse

The decade that just, past has marked a period of transformation for the northeastern states, turning it from a “landlocked” region into a “land-linked powerhouse under Prime Minister Narendra Modi, Union Minister Jyotiraditya Scindia said.

time to read

2 mins

November 03, 2025

The Business Guardian

The Business Guardian

SBI Research expects FY26 GST collections to surpass budget projections

The Goods and Services (GST) revenue for the Financial Year 2026 (FY26) will still be higher than budgeted collections, according to SBI Research.

time to read

1 mins

November 03, 2025

The Business Guardian

The Business Guardian

FPIs return as net buyers in India after three months of outflows

After three consecutive months of persistent selling, foreign portfolio investors (FPIs) again turned net buyers in the Indian stock markets in October.

time to read

2 mins

November 03, 2025

The Business Guardian

The Business Guardian

VOLVO FEELS AT HOME IN INDIA: MD BALI

Volvo Group India now sees India as a home market and has partnered with Tata Motors to boost sustainable transport.

time to read

1 mins

November 03, 2025

The Business Guardian

The Business Guardian

ISHA CHHABRA SHINES IN A.R. RAHMAN’S GULISTAN CHALE

Emerging filmmaker Isha Chhabra has made a powerful debut with her evocative music video Gulistan Chale, created in collaboration with music legend A.R. Rahman.

time to read

2 mins

November 03, 2025

The Business Guardian

Springer Nature honours Indian editors at 2025 symposium

‘The day long Journal Development Symposium 2025, organised by Springer Nature, brought together over 100 journal editors, society representatives, and publishing professionals from across India to discuss the evolving dynamics of research publishing in the age of Artificial Intelligence (AI) and Open Access‘The symposium served as a collaborative platform for dialogue around ethical publishing practices, the responsible integration of AI tools in scholarly workflows, and the broader implications of India’s transformative One Nation One Subscription (ONOS) initiative.

time to read

1 min

November 03, 2025

Listen

Translate

Share

-
+

Change font size