Magzter GOLD ile Sınırsız Olun

Magzter GOLD ile Sınırsız Olun

Sadece 9.000'den fazla dergi, gazete ve Premium hikayeye sınırsız erişim elde edin

$149.99
 
$74.99/Yıl

Denemek ALTIN - Özgür

To fix AI, first break it: Red teaming for AI safety

The Sunday Guardian

|

July 06, 2025

Artificial intelligence is transforming society at an unprecedented pace, from generative chatbots in customer service to algorithms aiding medical diagnoses.

- POOJA ARORA

To fix AI, first break it: Red teaming for AI safety

Along with this promise, however, come serious risks AI systems have produced biased or harmful outputs, revealed private data, or been 'tricked' into unsafe behaviour. In one healthcare study, for example, red-team testing found that roughly one in five answers from advanced AI models like GPT-4 was inappropriate or unsafe for medical use. To ensure Al's benefits can be realized safely and ethically, the tech community is increasingly turning to red teaming - a practice of stress-testing AI systems to identify flaws before real adversaries or real-world conditions do.

In simple terms, red teaming is about playing 'devil's advocate' with AI systems - actively trying to break, mislead, or misuse them to expose weaknesses.

Originally a military and cybersecurity concept, red teaming refers to an adversarial testing effort where a 'red team' simulates attacks or exploits against a target, while a 'blue team' defends.

In the AI context, AI red teaming means probing AI models and their surrounding systems for vulnerabilities, harmful behaviours, or biases by emulating the strategies a malicious or curious attacker might use.

In essence, a red teamer tries to ask, 'How could this AI go wrong or be made to do something bad?" and then systematically tests those scenarios. Red teaming in AI goes beyond just the model's answers - it can involve examining the whole pipeline (data, infrastructure, user interface) for weaknesses. As modern AI models are open-ended and creative by design, they can also be creatively misused.

The Sunday Guardian'den DAHA FAZLA HİKAYE

The Sunday Guardian

The Sunday Guardian

ELECTORAL ROLL: SC seeks ECI’s response to pleas against SIR in Kerala, UP

The Supreme Court has sought the Election Commission of India’s (ECD) response to a batch of pleas filed by various petitioners including the Kerala government challenging the ECT's decision to carry out Special Intensive Revision (SIR) exercise of the voter rollin Kerala.

time to read

1 min

November 23, 2025

The Sunday Guardian

The Sunday Guardian

FRANCE TO INVESTIGATE MUSK'S GROK CHATBOT

France's government is taking action against billionaire Elon Musk 's artificial intelligence chatbot Grok after it generated French-language posts that questioned the use of gas chambers at Auschwitz, officials said.

time to read

1 mins

November 23, 2025

The Sunday Guardian

The Sunday Guardian

Piyush Goyal's maiden Israel visit strengthens ties in tech, trade, agri

Commerce and Industry Minister Piyush Goyal held a series of wide-ranging engagements during his official visit to Israel, further strengthening bilateral cooperation across agriculture, technology, innovation and trade.

time to read

2 mins

November 23, 2025

The Sunday Guardian

The Sunday Guardian

Using welfare for political gain is inappropriate

Despite foreign criticism, India’s welfare policies remain essential and socially responsible.

time to read

2 mins

November 23, 2025

The Sunday Guardian

PM MODI PROPOSES THREE NEW G20 INITIATIVES AT AFRICA SUMMIT

PM also calls for development approaches rooted in sustainability, inclusivity and cultural wisdom.

time to read

2 mins

November 23, 2025

The Sunday Guardian

Unknown lockers found in GMCs across Kashmir

Surprise inspections follow terror-linked findings in doctors’ lockers at Kashmir hospitals.

time to read

1 mins

November 23, 2025

The Sunday Guardian

Delhi Police uncover ISI-backed gun running operation

Drones were used to airdrop Turkish pistols and Chinese weapons.

time to read

3 mins

November 23, 2025

The Sunday Guardian

The blasts in Delhi and Islamabad: Why India may have to resort to pre-emptive actions

While India would not want a war, the Pakistani army would not mind another exchange, if only to re-establish its relevance again. So, though war avoidance is desirable, it cannot bea strategy.

time to read

5 mins

November 23, 2025

The Sunday Guardian

The Sunday Guardian

Siddu vs D.K. once more

The power tussle in Karnataka between the supporters of Chief Minister Siddaramaiah and his deputy and Pradesh Congress Committee (PCC) chief D.K. Shivakumar appears to be unending. The latest round is currently on and i coincides with Siddu completing two and a half years in office.

time to read

3 mins

November 23, 2025

The Sunday Guardian

Reverse migration of Bangladeshis may impact TMC in polls

Since the rollout of the Election Commission's Special Intensive Revision (SIR) in West Bengal on November 4, border posts like Hakimpur in North 24 Parganas district have witnessed a marked increase in Bangladeshi nationals returning home, with district authorities and the Border Security Force noting that more than 1,600 Bangladeshi migrants had crossed back in just days. Many of these individuals had lived in India for over a decade, enrolling in voter lists and welfare

time to read

4 mins

November 23, 2025

Listen

Translate

Share

-
+

Change font size