Facebook Pixel HOW TO TEACH AI RIGHT FROM WRONG | BBC Science Focus - science - Lees dit verhaal op Magzter.com

Poging GOUD - Vrij

HOW TO TEACH AI RIGHT FROM WRONG

BBC Science Focus

|

February 2026

If we want to get good responses from AI, we may need to see what it does when we ask it to be evil

- MICHAEL WOOLDRIDGE

HOW TO TEACH AI RIGHT FROM WRONG

Today’s AI tools are strange beasts.

On the one hand, they have truly remarkable capabilities. You can ask Large Language Models (LLMs) like ChatGPT or Google’s Gemini about quantum mechanics or the collapse of the Roman Empire and they’ll respond fluently and confidently.

But LLMs can also seem wilfully stupid. For one thing, they get a lot wrong. Ask for a list of key references on quantum mechanics and it’s quite possible that some of the references they produce will be entirely fictitious – ‘hallucinations’ invented by the AI.

Hallucinations are the most prominent of problems with current AI models, but they're not the only one. Just as concerning is that LLMs can easily be steered – deliberately or by accident – into generating wildly inappropriate responses. One notorious incident proved deeply embarrassing for Microsoft, when in 2016 its AI chatbot ‘Tay’ had to be taken offline within 24 hours after being coaxed into producing racist, sexist and antisemitic tweets.

TOO EAGER TO BE HELPFUL

Tay was much simpler than current AI models, but the problem remains – with the right sort of prompt, it’s possible to get an offensive, or even potentially harmful, response from an AI.

The problem comes about firstly because these AIs are designed to be helpful. When you present them with a ‘prompt’, they compute the outcome that seems like the best possible response. For the most part, this is exactly what we want. But the neural networks that underpin LLMs are designed to be helpful in response to

MEER VERHALEN VAN BBC Science Focus

BBC Science Focus

BBC Science Focus

DOES MY DOG HAVE ADHD?

Officially, Attention-Deficit Hyperactivity Disorder (ADHD) is a human condition. People are diagnosed with it. Dogs are not. Yet many of its core features, including hyperactivity, impulsivity and distractibility, can be found in dogs.

time to read

1 min

March 2026

BBC Science Focus

BBC Science Focus

DOES MY BRAIN LIVE A LITTLE IN THE PAST?

Yes, your brain does live a little in the past. It can't help it. The information it receives via your senses is always a little out of date. Whether it's light entering the retinas in your eyes, or sounds vibrating the hairs in your ears, it not only takes time for the data to arrive, but your brain then has to process it.

time to read

2 mins

March 2026

BBC Science Focus

BBC Science Focus

ASTRONOMY FOR BEGINNERS

RETURN OF THE EVENING STAR (VENUS)

time to read

1 mins

March 2026

BBC Science Focus

BBC Science Focus

CAN YOU STOP YOUR SENSE OF TASTE DULLING AS YOU AGE?

Sometimes I hear people say that food just doesn't taste the same as they get older. It's tempting to blame this on age, but there are other factors at play, too.

time to read

1 mins

March 2026

BBC Science Focus

BBC Science Focus

MICROBIOMES OF THE SUPERAGERS

BY STUDYING THE INCREASING NUMBER OF PEOPLE WHO ARE LIVING BEYOND THEIR 100TH BIRTHDAYS, SCIENTISTS ARE DISCOVERING THAT THE SECRET TO REACHING A RIPE OLD AGE IN RUDE HEALTH MIGHT LIE IN OUR GUTS

time to read

8 mins

March 2026

BBC Science Focus

BBC Science Focus

HOW BIG WERE MEDIEVAL WAR HORSES?

You might picture knights charging into battle on towering steeds, but medieval horses were typically no bigger than modern-day ponies.

time to read

1 min

March 2026

BBC Science Focus

BBC Science Focus

FORCES OF HABIT

Could new research on setting up healthy habits resuscitate those stuttering New Year resolutions?

time to read

3 mins

March 2026

BBC Science Focus

BBC Science Focus

5 DANGERS HIDING IN YOUR PROCESSED FOOD

We all know that ultra-processed foods are bad for us, but what ingredients should we particularly try to avoid? And what are they doing to our bodies?

time to read

9 mins

March 2026

BBC Science Focus

BBC Science Focus

Mosquitoes are becoming thirstier for human blood

Habitat loss may be pushing mosquitoes towards human hosts with deadly consequences

time to read

1 mins

March 2026

BBC Science Focus

BBC Science Focus

HOW CAN I GET OVER MY EX?

Relationship breakups can be brutal, just look at the popularity of songs like 'Someone Like You' by Adele, or all the covers of 'Cry Me a River' by Julie London.

time to read

1 mins

March 2026

Listen

Translate

Share

-
+

Change font size