An AI data trap catches Perplexity impersonating Google

An AI data trap catches Perplexity impersonating Google

In a surprising turn of events, the AI startup Perplexity has been caught attempting to scrape data without permission, leading to significant backlash from the tech community. Competing against giants like OpenAI’s ChatGPT and Google’s Gemini, Perplexity's actions have raised questions about ethical practices in the rapidly evolving AI landscape. The controversy began when Cloudflare, a company that plays a crucial role in internet infrastructure and security, received complaints from its clients about Perplexity's evasive data collection methods. In response, Cloudflare devised a digital trap, creating unpublished websites that were designed specifically to block unwanted crawlers, including those from Perplexity. Despite being explicitly instructed not to access these sites, Perplexity's AI managed to retrieve detailed information that could only have come from the restricted pages. Initially, Perplexity adhered to standard web protocols, using its official user-agent string. However, when blocked, it resorted to stealth tactics, deploying disguised crawlers and generic web browser impersonations to continue its scraping activities. Matthew Prince, Cloudflare's CEO, expressed his outrage on social media, likening Perplexity's conduct to that of hackers. He emphasized the importance of adhering to web standards and the trust that supports the open internet. In stark contrast, he praised OpenAI's bots for respecting robots.txt files and their straightforward approach to data scraping. Following the revelations, Cloudflare has removed Perplexity's verification status as a bot and implemented new detection methods to prevent further unauthorized access. This incident serves as a stark reminder for AI startups and established companies alike about the ethical responsibilities of data usage in an era where access to quality data is paramount for innovation. As the web continues to adapt to stronger data access regulations, companies that disregard these norms may face serious consequences, including public exposure and operational limitations.

Sources : Business Insider

Published On : Aug 05, 2025, 01:30

Streaming
Spotify Introduces Customizable Taste Profiles for Enhanced Music Recommendations

At the recent SXSW conference, Spotify co-CEO Gustav Söderström unveiled an exciting new feature designed to give listen...

TechCrunch | Mar 13, 2026, 17:35
Spotify Introduces Customizable Taste Profiles for Enhanced Music Recommendations
Startups
Apple Lowers App Store Commission in China, Strengthening Market Ties

In a strategic move to enhance its relationship with the Chinese market, Apple has announced a reduction in its App Stor...

TechCrunch | Mar 13, 2026, 15:35
Apple Lowers App Store Commission in China, Strengthening Market Ties
Mobile
AT&T Resolves $6,196 Billing Error for FirstNet Customer After Inquiry

If you're a FirstNet user with AT&T and receive an unexpected charge of around $6,200, take heart—it's likely a billing ...

Ars Technica | Mar 13, 2026, 17:50
AT&T Resolves $6,196 Billing Error for FirstNet Customer After Inquiry
Automotive
BYD Unveils Lightning-Fast Charging EV Set to Compete in Europe's Luxury Market

Chinese automaker BYD is preparing to challenge luxury brands like Porsche and BMW in Europe with its latest electric ve...

Ars Technica | Mar 13, 2026, 14:30
BYD Unveils Lightning-Fast Charging EV Set to Compete in Europe's Luxury Market
AI
Job Market Alarm: AI's Impact on New Graduates Could Push Unemployment Rates to Shocking Heights

The rise of artificial intelligence is poised to create significant challenges for recent college graduates as companies...

CNBC | Mar 13, 2026, 16:15
Job Market Alarm: AI's Impact on New Graduates Could Push Unemployment Rates to Shocking Heights
View All News