Poging GOUD - Vrij
HOW TO TEACH AI RIGHT FROM WRONG
BBC Science Focus
|February 2026
If we want to get good responses from AI, we may need to see what it does when we ask it to be evil
Today’s AI tools are strange beasts.
On the one hand, they have truly remarkable capabilities. You can ask Large Language Models (LLMs) like ChatGPT or Google’s Gemini about quantum mechanics or the collapse of the Roman Empire and they’ll respond fluently and confidently.
But LLMs can also seem wilfully stupid. For one thing, they get a lot wrong. Ask for a list of key references on quantum mechanics and it’s quite possible that some of the references they produce will be entirely fictitious – ‘hallucinations’ invented by the AI.
Hallucinations are the most prominent of problems with current AI models, but they're not the only one. Just as concerning is that LLMs can easily be steered – deliberately or by accident – into generating wildly inappropriate responses. One notorious incident proved deeply embarrassing for Microsoft, when in 2016 its AI chatbot ‘Tay’ had to be taken offline within 24 hours after being coaxed into producing racist, sexist and antisemitic tweets.
TOO EAGER TO BE HELPFUL
Tay was much simpler than current AI models, but the problem remains – with the right sort of prompt, it’s possible to get an offensive, or even potentially harmful, response from an AI.
The problem comes about firstly because these AIs are designed to be helpful. When you present them with a ‘prompt’, they compute the outcome that seems like the best possible response. For the most part, this is exactly what we want. But the neural networks that underpin LLMs are designed to be helpful in response to
Dit verhaal komt uit de February 2026-editie van BBC Science Focus.
Abonneer u op Magzter GOLD voor toegang tot duizenden zorgvuldig samengestelde premiumverhalen en meer dan 9000 tijdschriften en kranten.
Bent u al abonnee? Aanmelden
MEER VERHALEN VAN BBC Science Focus
BBC Science Focus
DOES MY DOG HAVE ADHD?
Officially, Attention-Deficit Hyperactivity Disorder (ADHD) is a human condition. People are diagnosed with it. Dogs are not. Yet many of its core features, including hyperactivity, impulsivity and distractibility, can be found in dogs.
1 min
March 2026
BBC Science Focus
DOES MY BRAIN LIVE A LITTLE IN THE PAST?
Yes, your brain does live a little in the past. It can't help it. The information it receives via your senses is always a little out of date. Whether it's light entering the retinas in your eyes, or sounds vibrating the hairs in your ears, it not only takes time for the data to arrive, but your brain then has to process it.
2 mins
March 2026
BBC Science Focus
ASTRONOMY FOR BEGINNERS
RETURN OF THE EVENING STAR (VENUS)
1 mins
March 2026
BBC Science Focus
CAN YOU STOP YOUR SENSE OF TASTE DULLING AS YOU AGE?
Sometimes I hear people say that food just doesn't taste the same as they get older. It's tempting to blame this on age, but there are other factors at play, too.
1 mins
March 2026
BBC Science Focus
MICROBIOMES OF THE SUPERAGERS
BY STUDYING THE INCREASING NUMBER OF PEOPLE WHO ARE LIVING BEYOND THEIR 100TH BIRTHDAYS, SCIENTISTS ARE DISCOVERING THAT THE SECRET TO REACHING A RIPE OLD AGE IN RUDE HEALTH MIGHT LIE IN OUR GUTS
8 mins
March 2026
BBC Science Focus
HOW BIG WERE MEDIEVAL WAR HORSES?
You might picture knights charging into battle on towering steeds, but medieval horses were typically no bigger than modern-day ponies.
1 min
March 2026
BBC Science Focus
FORCES OF HABIT
Could new research on setting up healthy habits resuscitate those stuttering New Year resolutions?
3 mins
March 2026
BBC Science Focus
5 DANGERS HIDING IN YOUR PROCESSED FOOD
We all know that ultra-processed foods are bad for us, but what ingredients should we particularly try to avoid? And what are they doing to our bodies?
9 mins
March 2026
BBC Science Focus
Mosquitoes are becoming thirstier for human blood
Habitat loss may be pushing mosquitoes towards human hosts with deadly consequences
1 mins
March 2026
BBC Science Focus
HOW CAN I GET OVER MY EX?
Relationship breakups can be brutal, just look at the popularity of songs like 'Someone Like You' by Adele, or all the covers of 'Cry Me a River' by Julie London.
1 mins
March 2026
Listen
Translate
Change font size
