Close Menu
    What's Hot

    Regulatory Clarity Could Drive 40% of Americans to Adopt DeFi Protocols, Survey Shows

    September 18, 2025

    Things That Ruin Guests’ Experience at Weddings, From Planner

    September 18, 2025

    Why Is Crypto Up Today? – September 18, 2025

    September 18, 2025
    Facebook X (Twitter) Instagram
    Hot Paths
    • Home
    • News
    • Politics
    • Money
    • Personal Finance
    • Business
    • Economy
    • Investing
    • Markets
      • Stocks
      • Futures & Commodities
      • Crypto
      • Forex
    • Technology
    Facebook X (Twitter) Instagram
    Hot Paths
    Home»Money»An AI Data Trap Catches Perplexity Impersonating Google
    Money

    An AI Data Trap Catches Perplexity Impersonating Google

    Press RoomBy Press RoomAugust 5, 2025No Comments4 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    If you want to succeed in AI, a good hack would be to impersonate Google. You just can’t get caught.

    This is what just happened to Perplexity, a startup that competes with ChatGPT, Google’s Gemini, and other generative AI services.

    Quality data is crucial for success in AI, but tech companies don’t want to pay for this, so they crawl the web and scrape information for free, often without permission. This has sparked a backlash by some content creators and others interested in preserving the incentives that built the web.

    Cloudflare and its CEO, Matthew Prince, have stormed into this battle with new features that help websites block unwanted AI bot crawlers. Cloudflare is an infrastructure, security, and software company that helps run about 20% of the internet. It thrives when the web does well, hence its interest in helping sites get paid for content.

    Some Cloudflare customers recently complained to the company that Perplexity was evading these blocks and continued to scrape and collect data without permission.

    So, CloudFlare set a digital trap and caught this startup red-handed, according to a Monday blog describing the escapade.

    “Some supposedly ‘reputable’ AI companies act more like North Korean hackers,” Prince wrote on X on Monday. “Time to name, shame, and hard block them.”

    Perplexity didn’t respond to a request for comment. 

    The bait: Honeytrap domains and locked doors

    Cloudflare created entirely new, unpublished websites and configured them with robots.txt files that explicitly blocked all crawlers — including Perplexity’s declared bots, PerplexityBot and Perplexity-User. These test sites had no public links, search engine entries, or metadata that would normally make them discoverable.

    Yet, when Cloudflare queried Perplexity’s AI with questions about these specific sites, the startup’s service responded with detailed information that could only have come from those restricted pages. The conclusion? Perplexity had accessed the content despite being clearly told not to.

    The cloak: How Perplexity masked its crawl

    Perplexity initially crawled these sites using its official user-agent string, complying with standard protocols. However, Cloudflare said it discovered that once blocked, Perplexity resorted to stealth tactics.

    Related stories

    Business Insider tells the innovative stories you want to know

    Business Insider tells the innovative stories you want to know

    Cloudflare found that Perplexity began deploying undeclared crawlers disguised as normal web browsers and sending requests from unknown or rotated IP addresses and unofficial ASNs, which are crucial identifiers that help route internet traffic efficiently.

    When its official crawlers were blocked, Perplexity also used a generic web browser designed to impersonate Google’s Chrome browser on Apple Mac computers. (Business Insider asked Google whether it has told Perplexity to stop impersonating Chrome. Google did not respond).

    According to Cloudflare, Perplexity has been making millions of such “stealth” requests daily across tens of thousands of web domains.

    This behavior not only violated web standards, but also betrays the fundamental trust that underpins the functioning of the open web, Cloudflare explained.

    The comparison: How OpenAI gets it right

    To emphasize what good bot behavior looks like, Cloudflare compared Perplexity’s conduct to that of OpenAI’s crawlers, which scrape data for developing ChatGPT and giant AI models such as the upcoming GPT-5.

    When OpenAI’s bots encountered a robots.txt file or a similar block, they simply backed off. No circumvention. No masking. No backdoor crawling, according to Cloudflare tests.

    The Fallout: De-verification and blocking

    As a result of these findings, Cloudflare has de-listed Perplexity as a verified bot and rolled out new detection and blocking techniques across its network.

    Cloudflare’s takedown serves as a cautionary tale in the AI arms race. While the web shifts toward stronger control over data access and usage, actors who flout these evolving norms may find themselves not just blocked, but publicly called out.

    In an era where AI systems are hungry for training data, Cloudflare’s sting operation is a signal to startups and established players alike: Respect the rules of the web, or risk being exposed.

    Sign up for BI’s Tech Memo newsletter here. Reach out to me via email at abarr@businessinsider.com.

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Press Room

    Related Posts

    Things That Ruin Guests’ Experience at Weddings, From Planner

    September 18, 2025

    Ranked: the Top 10 US Cities Where Inflation Is on the Rise

    September 18, 2025

    Read the Pitch Deck Vibranium Labs Used to Raise $4.6 Million

    September 18, 2025
    Leave A Reply Cancel Reply

    LATEST NEWS

    Regulatory Clarity Could Drive 40% of Americans to Adopt DeFi Protocols, Survey Shows

    September 18, 2025

    Things That Ruin Guests’ Experience at Weddings, From Planner

    September 18, 2025

    Why Is Crypto Up Today? – September 18, 2025

    September 18, 2025

    Vanguard FTSE Emerging Markets ETF declares quarterly distribution of $0.2795

    September 18, 2025
    POPULAR
    Business

    The Business of Formula One

    May 27, 2023
    Business

    Weddings and divorce: the scourge of investment returns

    May 27, 2023
    Business

    How F1 found a secret fuel to accelerate media rights growth

    May 27, 2023
    Advertisement
    Load WordPress Sites in as fast as 37ms!

    Archives

    • September 2025
    • August 2025
    • July 2025
    • June 2025
    • May 2025
    • April 2025
    • March 2025
    • February 2025
    • January 2025
    • December 2024
    • November 2024
    • April 2024
    • March 2024
    • February 2024
    • January 2024
    • December 2023
    • November 2023
    • October 2023
    • September 2023
    • May 2023

    Categories

    • Business
    • Crypto
    • Economy
    • Forex
    • Futures & Commodities
    • Investing
    • Market Data
    • Money
    • News
    • Personal Finance
    • Politics
    • Stocks
    • Technology

    Your source for the serious news. This demo is crafted specifically to exhibit the use of the theme as a news site. Visit our main page for more demos.

    We're social. Connect with us:

    Facebook X (Twitter) Instagram Pinterest YouTube

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • Home
    • Buy Now
    © 2025 ThemeSphere. Designed by ThemeSphere.

    Type above and press Enter to search. Press Esc to cancel.