Magzter GOLDで無制限に

Magzter GOLDで無制限に

10,000以上の雑誌、新聞、プレミアム記事に無制限にアクセスできます。

$149.99
 
$74.99/年

試す - 無料

To fix AI, first break it: Red teaming for AI safety

The Business Guardian

|

July 06, 2025

Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses.

- POOJA ARORA

To fix AI, first break it: Red teaming for AI safety

Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses. Along with this promise, however, come serious risks—AI systems have produced biased or harmful outputs, revealed private data, or been 'tricked' into unsafe behavior. In one healthcare study, for example, red-team testing found that roughly one in five answers from advanced AI models like GPT-4 was inappropriate or unsafe for medical use.

To ensure AI's benefits can be realized safely and ethically, the tech community is increasingly turning to red teaming—a practice of stress-testing AI systems to identify flaws before real adversaries or real-world conditions do.

In simple terms, red teaming is about playing 'devil's advocate' with AI systems—actively trying to break, mislead, or misuse them to expose weaknesses.

Originally a military and cybersecurity concept, red teaming refers to an adversarial testing effort where a 'red team' simulates attacks or exploits against a target, while a 'blue team' defends. In the AI context, AI red teaming means probing AI models and their surrounding systems for vulnerabilities, harmful behaviors, or biases by emulating the strategies a malicious or curious attacker might use.

In essence, a red teamer tries to ask, 'How could this AI go wrong or be made to do something bad?' and then systematically tests those scenarios. Red teaming in AI goes beyond just the model's answers—it can involve examining the whole pipeline (data, infrastructure, user interface) for weaknesses. As modern AI models are open-ended and creative by design, they can also be creatively misused.

The Business Guardian からのその他のストーリー

The Business Guardian

The Business Guardian

Miranda House unveils digital museum honouring women Constitution makers

Marking Samvidhan Divas, Delhi University’s Miranda House unveiled a digital and interactive museum dedicated to the 15 women members of the Constituent Assembly who played a vital role in shaping the Indian Constitution.

time to read

1 min

November 27, 2025

The Business Guardian

The Business Guardian

UIDAI begins clean-up, disables 2 crore Aadhaar IDs of deceased persons

The Unique Identification Authority of India (UIDAI) has deactivated more than two crore Aadhaar numbers belonging to deceased individuals, marking one of the largest cleanup exercises of the national identity database.

time to read

1 min

November 27, 2025

The Business Guardian

Safran vows to raise sourcing from India by five times

Safran CEO Olivier Andriès also vowed that it will triple its revenue in India to exceed 3 billion euros by 2030, of which half will be generated by its sites in India.

time to read

2 mins

November 27, 2025

The Business Guardian

Delhi crowd pays tribute to Guru Tegh Bahadur after blast

Delhi Home Minister Ashish Sood on Wednesday said that the gathering of lakhs of people at the Red Fort to commemorate the 350th martyrdom anniversary of Guru Tegh Bahadur — the same site where a bomb blast took place recently — reflected the capital's collective spirit and served as a powerful response to terrorists.

time to read

1 min

November 27, 2025

The Business Guardian

Will always be remembered as historical festival: CM

Delhi Chief Minister Rekha Gupta described the programme commemorating the 350th martyrdom anniversary of Sri Guru Tegh Bahadur Ji as a “historical festival” for the city, praising the massive participation of devotees from across the country.

time to read

1 min

November 27, 2025

The Business Guardian

The Business Guardian

SAFRAN MRO IN HYDERABAD TO ATTRACT MORE GLOBAL INVESTORS

The MRO facility will be a huge step towards the goal of Aatmanirbharta in the aviation sector. Developing indigenous capabilities in MRO will reduce foreign exchange outflows

time to read

2 mins

November 27, 2025

The Business Guardian

The Business Guardian

Safran MRO facility in Hyderabad will act as gateway for more global players: Aviation Minister

French defence and aviation major Safran's just-inaugurated MRO facility in Hyderabad could be a step-up towards India's ambition to become an aviation hub.

time to read

1 mins

November 27, 2025

The Business Guardian

Rapido eyes IPO after racing ahead of Uber

Rapido, which is focused on the mass market and deeper penetration rather than premium customers, will think of a public listing when it has captured 70-75% of the ride-hailing market, the company’s chief financial officer Vivek Krishna has said in an interaction with The New Indian Express.

time to read

1 min

November 27, 2025

The Business Guardian

The Business Guardian

Kharif food grain production estimated at 173.33 million tonnes for 2025-26: Shivraj Singh Chouhan

Union Agriculture Minister Shivraj Singh Chouhan on Tuesday released the first advanced estimates of production of main Kharif crops, according to.

time to read

1 mins

November 27, 2025

The Business Guardian

The Business Guardian

SENSEX UP 1,000 POINTS AMID GLOBAL CUES, STRONG FUNDAMENTALS

All sectoral indices soared today, with metal, consumer durables, oil and gas leading the pack, NSE data showed.

time to read

2 mins

November 27, 2025

Listen

Translate

Share

-
+

Change font size