Ga onbeperkt met Magzter GOLD

Ga onbeperkt met Magzter GOLD

Krijg onbeperkte toegang tot meer dan 9000 tijdschriften, kranten en Premium-verhalen voor slechts

$149.99
 
$74.99/Jaar

Poging GOUD - Vrij

To fix AI, first break it: Red teaming for AI safety

The Business Guardian

|

July 06, 2025

Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses.

- POOJA ARORA

To fix AI, first break it: Red teaming for AI safety

Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses. Along with this promise, however, come serious risks—AI systems have produced biased or harmful outputs, revealed private data, or been 'tricked' into unsafe behavior. In one healthcare study, for example, red-team testing found that roughly one in five answers from advanced AI models like GPT-4 was inappropriate or unsafe for medical use.

To ensure AI's benefits can be realized safely and ethically, the tech community is increasingly turning to red teaming—a practice of stress-testing AI systems to identify flaws before real adversaries or real-world conditions do.

In simple terms, red teaming is about playing 'devil's advocate' with AI systems—actively trying to break, mislead, or misuse them to expose weaknesses.

Originally a military and cybersecurity concept, red teaming refers to an adversarial testing effort where a 'red team' simulates attacks or exploits against a target, while a 'blue team' defends. In the AI context, AI red teaming means probing AI models and their surrounding systems for vulnerabilities, harmful behaviors, or biases by emulating the strategies a malicious or curious attacker might use.

In essence, a red teamer tries to ask, 'How could this AI go wrong or be made to do something bad?' and then systematically tests those scenarios. Red teaming in AI goes beyond just the model's answers—it can involve examining the whole pipeline (data, infrastructure, user interface) for weaknesses. As modern AI models are open-ended and creative by design, they can also be creatively misused.

MEER VERHALEN VAN The Business Guardian

The Business Guardian

The Business Guardian

India must boost demand and ease rules to strengthen manufacturing: Godrej

India needs stronger domestic demand and simpler rules for small businesses to push its manufacturing growth, Jamshyd Godrej, Managing Director of Godrej & Boyce, said on Wednesday.

time to read

1 mins

November 27, 2025

The Business Guardian

Cabinet clears two rail multitracking projects adding 224 km across Maha, Gujarat

The Cabinet Committee on Economic Affairs (CCEA), chaired by Prime Minister Narendra Modi, has approved two multitracking projects of the Ministry of Railways covering four districts across Maharashtra and Gujarat.

time to read

1 mins

November 27, 2025

The Business Guardian

The Business Guardian

Cabinet approves Rs 7280 crore Rare Earth Permanent Magnets scheme

In a significant initiative aimed at enhancing self-reliance and positioning India as a key player in the global REPM market, the Union Cabinet, chaired by Prime Minister Narendra Modi, on Wednesday approved the 'Scheme to Promote Manufacturing of Sintered Rare Earth Permanent Magnets' with a financial outlay of Rs 7,280 crore.

time to read

1 mins

November 27, 2025

The Business Guardian

The Business Guardian

Nvidia defends AI lead after $250 billion jolt on 'Google deal' buzz

Nvidia, the AI chip giant, publicly defended its market dominance after reports surfaced of Meta Platforms considering billions in spending on Google’s competing AI chips.

time to read

2 mins

November 27, 2025

The Business Guardian

The Business Guardian

Miranda House unveils digital museum honouring women Constitution makers

Marking Samvidhan Divas, Delhi University’s Miranda House unveiled a digital and interactive museum dedicated to the 15 women members of the Constituent Assembly who played a vital role in shaping the Indian Constitution.

time to read

1 min

November 27, 2025

The Business Guardian

The Business Guardian

UIDAI begins clean-up, disables 2 crore Aadhaar IDs of deceased persons

The Unique Identification Authority of India (UIDAI) has deactivated more than two crore Aadhaar numbers belonging to deceased individuals, marking one of the largest cleanup exercises of the national identity database.

time to read

1 min

November 27, 2025

The Business Guardian

Safran vows to raise sourcing from India by five times

Safran CEO Olivier Andriès also vowed that it will triple its revenue in India to exceed 3 billion euros by 2030, of which half will be generated by its sites in India.

time to read

2 mins

November 27, 2025

The Business Guardian

Delhi crowd pays tribute to Guru Tegh Bahadur after blast

Delhi Home Minister Ashish Sood on Wednesday said that the gathering of lakhs of people at the Red Fort to commemorate the 350th martyrdom anniversary of Guru Tegh Bahadur — the same site where a bomb blast took place recently — reflected the capital's collective spirit and served as a powerful response to terrorists.

time to read

1 min

November 27, 2025

The Business Guardian

Will always be remembered as historical festival: CM

Delhi Chief Minister Rekha Gupta described the programme commemorating the 350th martyrdom anniversary of Sri Guru Tegh Bahadur Ji as a “historical festival” for the city, praising the massive participation of devotees from across the country.

time to read

1 min

November 27, 2025

The Business Guardian

The Business Guardian

SAFRAN MRO IN HYDERABAD TO ATTRACT MORE GLOBAL INVESTORS

The MRO facility will be a huge step towards the goal of Aatmanirbharta in the aviation sector. Developing indigenous capabilities in MRO will reduce foreign exchange outflows

time to read

2 mins

November 27, 2025

Listen

Translate

Share

-
+

Change font size