Try GOLD - Free
Who owns the data used to train AI?
PC Pro
|September 2023
Elon Musk says he owns it. Twitter's Ts & Cs suggest otherwise. James O'Malley investigates who really owns the data being used to train AI
For decades, the fields of rocket science and brain surgery have been cited as fields of endeavour that present almost unimaginable levels of complexity. Now we might want to add another tricky job to the list: managing Twitter.
Since Elon Musk dropped $44 billion and took control of Twitter at the end of last year, it hasn't gone well. The CEO who, let's not forget, is heavily invested in both rocket and neural science - has seen the value of the social network plummet. One study found that more than half of Twitter's top 1,000 advertisers have given up on the platform since his takeover.
The stress is starting to show. When Microsoft announced that it would be pulling advertising from the platform, reportedly because it refused to pay hiked API-access fees, Musk responded with a tweeted threat: "They trained illegally using Twitter data. Lawsuit time."
His argument is that Al models such as the ones created by Microsoft and its partner OpenAI, the firm behind ChatGPT, were getting a free ride on Twitter's data. Large language models (LLMs) that power AI tools such as ChatGPT have been "trained" on text taken from across the internet. This could conceivably have included data from Twitter.
Now Musk wants his pound of flesh. But who really owns data once it's out on the internet? Does Musk have any right to lay claim to it? The answer, you'll be shocked to hear, is complicated.
Scrapes of wrath
"There are so many variables that help to answer whether a specific scraping act is legal or illegal," said Denas Grybauskas, head of legal at web intelligence collection firm Oxylabs.
His company specialises in writing scrapers - software and tools that automate the work of downloading the contents of a website or individual web page, then extracting and organising the data. It's the equivalent of saving a web page on your computer, but automated and performed at mass scale.
This story is from the September 2023 edition of PC Pro.
Subscribe to Magzter GOLD to access thousands of curated premium stories, and 10,000+ magazines and newspapers.
Already a subscriber? Sign In
MORE STORIES FROM PC Pro
PC Pro
Investors may still believe in Elon Musk, but Jon Honeyball isn't buying any of it
My day started badly. Still bleary-eyed at 6am, with a bucket of coffee sitting untouched beside me, I dropped the SIM-removal tool into my keyboard.
3 mins
April 2026
PC Pro
Green cloud
Don't entrust your jobs to dirty, energy-hungry servers:
2 mins
April 2026
PC Pro
"I've said it before, and I'll say it again: the biggest obstacle to security is inconvenience"
Have you seen those password books on Amazon? They're not a cybersecurity abomination, despite what you may think
7 mins
April 2026
PC Pro
"Cyber resilience is now treated as a matter of governance rather than pure technical compliance"
Rule Britannia, Britannia waives the rules... or why the shoulder-shrugging Cyber Security and Resilience Bill causes such problems for UK businesses
6 mins
April 2026
PC Pro
"Not to point any fingers here; I seriously doubt the fault lies with our esteemed editor"
Whether it's PDFs from PC Pro's editor, Outlook messages or his partner's photos, space is at a premium for Steve this month
9 mins
April 2026
PC Pro
"It's a pity there's an Elon-shaped issue with Starlink because the solution is otherwise superb"
The best-connected man in Huntingdon ensures his lab will be always online, takes a nibble at Apple and wonders why Dell will take half a year to deliver a new laptop
10 mins
April 2026
PC Pro
Are we building too many data centres - and could we build them better?
The AI arms race has sparked a rush to build data centres, but we should use them to offer free heating and other benefits rather than big boxes that will go out of date too fast
8 mins
April 2026
PC Pro
IT'S EASY WITH AN eSIM
After more than three decades, the physical SIM card is on its way out. Darien Graham-Smith finds out why we should all welcome the change
8 mins
April 2026
PC Pro
Pippin awful: Apple's doomed console
David Crookes reflects on Apple's ill-judged attempt to corner the gaming market with the Apple Pippin
9 mins
April 2026
PC Pro
AI & DEV TEAMS The start of a beautiful friendship
Are real-life programmers living on borrowed time? Nik Rawlinson explores the growing popularity of AI-powered development
9 mins
April 2026
Translate
Change font size
