Mit Magzter GOLD unbegrenztes Potenzial nutzen

Mit Magzter GOLD unbegrenztes Potenzial nutzen

Erhalten Sie unbegrenzten Zugriff auf über 9.000 Zeitschriften, Zeitungen und Premium-Artikel für nur

$149.99
 
$74.99/Jahr

Versuchen GOLD - Frei

HOW TO TEACH AI RIGHT FROM WRONG

BBC Science Focus

|

February 2026

If we want to get good responses from AI, we may need to see what it does when we ask it to be evil

- MICHAEL WOOLDRIDGE

HOW TO TEACH AI RIGHT FROM WRONG

Today’s AI tools are strange beasts.

On the one hand, they have truly remarkable capabilities. You can ask Large Language Models (LLMs) like ChatGPT or Google’s Gemini about quantum mechanics or the collapse of the Roman Empire and they’ll respond fluently and confidently.

But LLMs can also seem wilfully stupid. For one thing, they get a lot wrong. Ask for a list of key references on quantum mechanics and it’s quite possible that some of the references they produce will be entirely fictitious – ‘hallucinations’ invented by the AI.

Hallucinations are the most prominent of problems with current AI models, but they're not the only one. Just as concerning is that LLMs can easily be steered – deliberately or by accident – into generating wildly inappropriate responses. One notorious incident proved deeply embarrassing for Microsoft, when in 2016 its AI chatbot ‘Tay’ had to be taken offline within 24 hours after being coaxed into producing racist, sexist and antisemitic tweets.

TOO EAGER TO BE HELPFUL

Tay was much simpler than current AI models, but the problem remains – with the right sort of prompt, it’s possible to get an offensive, or even potentially harmful, response from an AI.

The problem comes about firstly because these AIs are designed to be helpful. When you present them with a ‘prompt’, they compute the outcome that seems like the best possible response. For the most part, this is exactly what we want. But the neural networks that underpin LLMs are designed to be helpful in response to

WEITERE GESCHICHTEN VON BBC Science Focus

BBC Science Focus

BBC Science Focus

World's biggest cobweb is home to 100,000 spiders

Spiders don't normally create such large colonies, so there's no need to worry about finding one in your basement

time to read

1 min

February 2026

BBC Science Focus

BBC Science Focus

A dementia vaccine could be gamechanging – and available already

Getting vaccinated against shingles could protect you from getting dementia, or slow the progression of the disease

time to read

1 mins

February 2026

BBC Science Focus

BBC Science Focus

DATA IN SPACE

An unusual spacecraft reached orbit in November 2025, one that might herald the dawn of a new era.

time to read

7 mins

February 2026

BBC Science Focus

BBC Science Focus

Climate change is already shrinking your salary

No matter where you live, a new study has found warmer temperatures are picking your pocket

time to read

4 mins

February 2026

BBC Science Focus

BBC Science Focus

A MENTAL HEALTH GLOW-UP

Forget fine lines. Could Botox give you an unexpected mental health tweakment?

time to read

3 mins

February 2026

BBC Science Focus

Most people with high cholesterol gene don't know they have it

Standard testing struggles to detect the condition

time to read

1 mins

February 2026

BBC Science Focus

BBC Science Focus

HOW CAN I BOOST MY IQ?

If you're serious about getting smarter, it's time to ditch the brain-training apps

time to read

4 mins

February 2026

BBC Science Focus

BBC Science Focus

Humans are absolutely terrible at reading dogs' emotions

Think you can tell how our furry friends are feeling? Think again

time to read

1 mins

February 2026

BBC Science Focus

BBC Science Focus

HOW TO TEACH AI RIGHT FROM WRONG

If we want to get good responses from AI, we may need to see what it does when we ask it to be evil

time to read

3 mins

February 2026

BBC Science Focus

BBC Science Focus

What Australia's social media ban could really mean for under-16s

Many people think social media is bad for our kids. Australia is trying to prove it

time to read

5 mins

February 2026

Listen

Translate

Share

-
+

Change font size