Reddit sues Perplexity for scraping of posts, expanding user data battle with AI industry

Reddit sues Perplexity for scraping of posts, expanding user data battle with AI industry

In a significant move, Reddit has filed a lawsuit against the AI company Perplexity, claiming it unlawfully harvested user posts to enhance its AI model. This legal action, initiated in a New York federal court, highlights the intensifying conflict between content creators and the artificial intelligence sector. The lawsuit also implicates three other entities, which Reddit accuses of facilitating Perplexity's data acquisition. These include the Lithuanian data-scraping firm Oxylabs, the former Russian botnet AWMProxy, and Texas-based startup SerpApi. Reddit alleges that these parties employed deceptive tactics, such as concealing their identities and locations, to extract copyrighted material from its platform. Perplexity, known for its AI-driven search engine, has refuted the claims, labeling them as extortionate and asserting that Reddit is resistant to an open internet. SerpApi echoed this sentiment, expressing strong disagreement with Reddit’s allegations and confirming its intention to mount a defense in court. This lawsuit is part of a broader trend where content owners are pursuing legal action against AI firms for utilizing copyrighted content without consent to train their large language models. Reddit has been actively involved in this legal landscape, having previously initiated a similar lawsuit against the AI startup Anthropic earlier this year. Reddit's Chief Legal Officer, Ben Lee, commented on the situation, stating that AI companies are engaged in a fierce competition for quality human-generated content. He described this environment as fostering an 'industrial-scale data laundering economy,' where scrapers circumvent protections to illegally procure data, later selling it to clients in need of training resources. With over 100,000 active subreddit communities, Reddit's extensive repository of moderated discussions has become a prominent source for AI-generated content. The platform remarked that its user posts have increasingly been referenced in AI-generated responses on Perplexity. Following a cease-and-desist letter sent to Perplexity, Reddit claims there was a dramatic increase in citations of its content. In response to the lawsuit, Perplexity contended that it does not utilize Reddit's content for training AI models but simply summarizes and references public discussions on the platform. They argue that this stance renders licensing agreements unnecessary. A representative from Perplexity criticized Reddit’s demands for payment, asserting that their operations comply with legal standards. The ongoing legal battle underscores the growing significance of data licensing in the business strategies of social media companies. In a recent statement, Reddit’s COO revealed that AI licensing deals with major players like Google and OpenAI account for a substantial portion of the platform's revenue, emphasizing the financial stakes involved in this contentious issue.

Sources : CNBC

Published On : Oct 23, 2025, 04:55

Streaming
Amazon Ups the Ante on Prime Video: New Pricing and Features Unveiled

Beginning April 10, Amazon Prime members will see an increase in the cost of ad-free Prime Video, escalating from $3 to ...

Ars Technica | Mar 13, 2026, 17:20
Amazon Ups the Ante on Prime Video: New Pricing and Features Unveiled
Computing
Nvidia Set to Transform AI Landscape with New CPU Innovations at GTC

Nvidia, a leader in graphics processing units (GPUs), is gearing up for a significant revelation at its annual GTC confe...

CNBC | Mar 13, 2026, 19:35
Nvidia Set to Transform AI Landscape with New CPU Innovations at GTC
Cybersecurity
New Wave of Supply-Chain Attacks: Invisible Code Targets GitHub and More

Cybersecurity experts have uncovered a sophisticated supply-chain attack that is inundating code repositories, including...

Ars Technica | Mar 13, 2026, 20:25
New Wave of Supply-Chain Attacks: Invisible Code Targets GitHub and More
Startups
Google Fiber Joins Forces with Astound Broadband Under New Ownership

GFiber, previously known as Google Fiber, is set to undergo a significant transformation as it is acquired by the privat...

Ars Technica | Mar 13, 2026, 21:05
Google Fiber Joins Forces with Astound Broadband Under New Ownership
Automotive
Revolutionizing Electric Vehicles: The Impact of 800V Architecture

For years, the majority of electric vehicles (EVs) have relied on a standard battery pack operating at approximately 400...

Ars Technica | Mar 13, 2026, 18:35
Revolutionizing Electric Vehicles: The Impact of 800V Architecture
View All News