Who owns the data used to train AI?
PC Pro|September 2023
Elon Musk says he owns it. Twitter's Ts & Cs suggest otherwise. James O'Malley investigates who really owns the data being used to train AI
-  James O'Malley
Who owns the data used to train AI?

For decades, the fields of rocket science and brain surgery have been cited as fields of endeavour that present almost unimaginable levels of complexity. Now we might want to add another tricky job to the list: managing Twitter.

Since Elon Musk dropped $44 billion and took control of Twitter at the end of last year, it hasn't gone well. The CEO who, let's not forget, is heavily invested in both rocket and neural science - has seen the value of the social network plummet. One study found that more than half of Twitter's top 1,000 advertisers have given up on the platform since his takeover.

The stress is starting to show. When Microsoft announced that it would be pulling advertising from the platform, reportedly because it refused to pay hiked API-access fees, Musk responded with a tweeted threat: "They trained illegally using Twitter data. Lawsuit time."

His argument is that Al models such as the ones created by Microsoft and its partner OpenAI, the firm behind ChatGPT, were getting a free ride on Twitter's data. Large language models (LLMs) that power AI tools such as ChatGPT have been "trained" on text taken from across the internet. This could conceivably have included data from Twitter.

Now Musk wants his pound of flesh. But who really owns data once it's out on the internet? Does Musk have any right to lay claim to it? The answer, you'll be shocked to hear, is complicated.

Scrapes of wrath

"There are so many variables that help to answer whether a specific scraping act is legal or illegal," said Denas Grybauskas, head of legal at web intelligence collection firm Oxylabs.

His company specialises in writing scrapers - software and tools that automate the work of downloading the contents of a website or individual web page, then extracting and organising the data. It's the equivalent of saving a web page on your computer, but automated and performed at mass scale.

この蚘事は PC Pro の September 2023 版に掲茉されおいたす。

7 日間の Magzter GOLD 無料トラむアルを開始しお、䜕千もの厳遞されたプレミアム ストヌリヌ、8,500 以䞊の雑誌や新聞にアクセスしおください。

この蚘事は PC Pro の September 2023 版に掲茉されおいたす。

7 日間の Magzter GOLD 無料トラむアルを開始しお、䜕千もの厳遞されたプレミアム ストヌリヌ、8,500 以䞊の雑誌や新聞にアクセスしおください。

PC PROのその他の蚘事すべお衚瀺
Robobutlers may never happen, but robot care workers are on their way
PC Pro

Robobutlers may never happen, but robot care workers are on their way

Do you hate loading the dishwasher enough to pay someone to do it remotely? Nicole Kobie wonders about the weird future of home robots

time-read
9 分  |
Summer 2023
Technical debt
PC Pro

Technical debt

Cutting corners now means more work down the road - but Steve Cassidy asks whether that's always a bad thing

time-read
3 分  |
Summer 2023
Zyxel ZyWALL ATP500
PC Pro

Zyxel ZyWALL ATP500

Zyxel delivers tough gateway security and advanced threat protection at a very appealing price

time-read
3 分  |
Summer 2023
CREATIVE WORKSTATIONS
PC Pro

CREATIVE WORKSTATIONS

Intel and AMD both offer compelling CPU choices for workstations, giving us ten machines with the widest variety of specifications we've seen for years

time-read
3 分  |
Summer 2023
ANDROID PHONES FROM £219
PC Pro

ANDROID PHONES FROM £219

As this roundup of four affordable contenders shows, there's no need to spend a fortune on a phone

time-read
4 分  |
Summer 2023
Amazon Echo Pop
PC Pro

Amazon Echo Pop

If you want a compact Alexa smart speaker, the Pop is now the cheapest choice - but what does it really add?

time-read
2 分  |
Summer 2023
Getac X600
PC Pro

Getac X600

A powerful alternative to the Panasonic Toughbook 40, with the bonus of optional Nvidia graphics

time-read
3 分  |
Summer 2023
Amazon Fire Max 11
PC Pro

Amazon Fire Max 11

With its 2K screen and sleek design, this is Amazon's best tablet yet-but FireOS remains a hindrance

time-read
3 分  |
Summer 2023
Google Pixel Fold
PC Pro

Google Pixel Fold

The Pixel Fold delivers with a thin and durable design, a wide front display, smart software and great cameras

time-read
7 分  |
Summer 2023
Welcome to the Fediverse
PC Pro

Welcome to the Fediverse

Have commercial social networks had their day? Darien Graham-Smith looks at the free, community-run apps that could usurp Twitter, Reddit and the Meta empire

time-read
9 分  |
Summer 2023