कोशिश गोल्ड - मुक्त
To fix AI, first break it: Red teaming for AI safety
The Business Guardian
|July 06, 2025
Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses.
Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses. Along with this promise, however, come serious risks—AI systems have produced biased or harmful outputs, revealed private data, or been 'tricked' into unsafe behavior. In one healthcare study, for example, red-team testing found that roughly one in five answers from advanced AI models like GPT-4 was inappropriate or unsafe for medical use.
To ensure AI's benefits can be realized safely and ethically, the tech community is increasingly turning to red teaming—a practice of stress-testing AI systems to identify flaws before real adversaries or real-world conditions do.
In simple terms, red teaming is about playing 'devil's advocate' with AI systems—actively trying to break, mislead, or misuse them to expose weaknesses.
Originally a military and cybersecurity concept, red teaming refers to an adversarial testing effort where a 'red team' simulates attacks or exploits against a target, while a 'blue team' defends. In the AI context, AI red teaming means probing AI models and their surrounding systems for vulnerabilities, harmful behaviors, or biases by emulating the strategies a malicious or curious attacker might use.
In essence, a red teamer tries to ask, 'How could this AI go wrong or be made to do something bad?' and then systematically tests those scenarios. Red teaming in AI goes beyond just the model's answers—it can involve examining the whole pipeline (data, infrastructure, user interface) for weaknesses. As modern AI models are open-ended and creative by design, they can also be creatively misused.
यह कहानी The Business Guardian के July 06, 2025 संस्करण से ली गई है।
हजारों चुनिंदा प्रीमियम कहानियों और 10,000 से अधिक पत्रिकाओं और समाचार पत्रों तक पहुंचने के लिए मैगज़्टर गोल्ड की सदस्यता लें।
क्या आप पहले से ही ग्राहक हैं? साइन इन करें
The Business Guardian से और कहानियाँ
The Business Guardian
DELHI GOVT LAUNCHES ‘PINK SAHELI SMART CARD’
The Delhi government introduced the ‘Pink Saheli Smart Card’ to enable free travel for women and transgender passengers across the capital.
2 mins
November 03, 2025
The Business Guardian
Employees’ Enrolment Scheme-2025 launched to widen social security coverage
Employees’ Enrolment Scheme - 2025 was launched by Union Minister for Labour & Employment and Youth Affairs & Sports at the 73rd Foundation Day of the Employees’ Provident Fund Organisation (EPFO) in New Delhi, aiming to boost voluntary compliance and expand social security coverage for eligible employees across India. According to a release by the Ministry, the Employees’ Enrolment Scheme - 2025
1 min
November 03, 2025
The Business Guardian
Liquor vend staff caught refilling costly bottles with cheap alcohol in Delhi mall
In a raid conducted by the Excise Department of the Delhi government, employees of a liquor store were caught refilling and mixing cheap alcohol and water in the bottles of costly brands, at a mall in Narela, officials said on Sunday.
1 min
November 03, 2025
The Business Guardian
India's forex reserves dip $6.9 bn, remain near record level
India’s foreign exchange reserves declined by USD 6.925 billion in the week that ended October 24 to USD 695.355 billion, driven by a slump in both foreign currency assets and gold reserves, the Reserve Bank of India’s latest ‘Weekly Statistical Supplement’ data showed.
1 min
November 03, 2025
The Business Guardian
Scindia: Northeast now a land-linked powerhouse
The decade that just, past has marked a period of transformation for the northeastern states, turning it from a “landlocked” region into a “land-linked powerhouse under Prime Minister Narendra Modi, Union Minister Jyotiraditya Scindia said.
2 mins
November 03, 2025
The Business Guardian
SBI Research expects FY26 GST collections to surpass budget projections
The Goods and Services (GST) revenue for the Financial Year 2026 (FY26) will still be higher than budgeted collections, according to SBI Research.
1 mins
November 03, 2025
The Business Guardian
FPIs return as net buyers in India after three months of outflows
After three consecutive months of persistent selling, foreign portfolio investors (FPIs) again turned net buyers in the Indian stock markets in October.
2 mins
November 03, 2025
The Business Guardian
VOLVO FEELS AT HOME IN INDIA: MD BALI
Volvo Group India now sees India as a home market and has partnered with Tata Motors to boost sustainable transport.
1 mins
November 03, 2025
The Business Guardian
ISHA CHHABRA SHINES IN A.R. RAHMAN’S GULISTAN CHALE
Emerging filmmaker Isha Chhabra has made a powerful debut with her evocative music video Gulistan Chale, created in collaboration with music legend A.R. Rahman.
2 mins
November 03, 2025
The Business Guardian
Springer Nature honours Indian editors at 2025 symposium
‘The day long Journal Development Symposium 2025, organised by Springer Nature, brought together over 100 journal editors, society representatives, and publishing professionals from across India to discuss the evolving dynamics of research publishing in the age of Artificial Intelligence (AI) and Open Access‘The symposium served as a collaborative platform for dialogue around ethical publishing practices, the responsible integration of AI tools in scholarly workflows, and the broader implications of India’s transformative One Nation One Subscription (ONOS) initiative.
1 min
November 03, 2025
Listen
Translate
Change font size
