WSJ Tech News Briefing - The New AI Data Trade, Part 1: Cashing In on AI

Generative AI models such as OpenAI’s ChatGPT and Google’s Gemini need data, and the content creators supplying that data want to get paid. This is the first episode of “The New AI Data Trade,” a special two-part series diving into how data makes its way from a publisher or creator to be used by an AI model, and the conflicts that have arisen along the way. In this first episode, we explore how publishers have grown concerned over web scraping. This has led to lawsuits, with publishers such as Reddit, the New York Times and New Corp.’s Dow Jones suing to protect their data. Meanwhile, companies like Cloudflare are making it harder for AI companies to access data from publishers for free. This has opened the door for data-usage deals through startups such as Troveo. Coleman Standifer hosts.


Sign up for the WSJ's free Technology newsletter.


Further Reading


Reddit Sues Anthropic, Alleges Unauthorized Use of Site’s Data 

The AI Scraping Fight That Could Change the Future of the Web 

Amazon to Pay New York Times at Least $20 Million a Year in AI Deal 

Learn more about your ad choices. Visit megaphone.fm/adchoices