By using this site, you agree to the Privacy Policy and Terms of Use.
Accept

GadgetBond

  • Latest
  • How-to
  • Tech
    • AI
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Add GadgetBond as a preferred source to see more of our stories on Google.
Font ResizerAa
GadgetBondGadgetBond
  • Latest
  • Tech
  • AI
  • Deals
  • How-to
  • Apps
  • Mobile
  • Gaming
  • Streaming
  • Transportation
Search
  • Latest
  • Deals
  • How-to
  • Tech
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • AI
    • Anthropic
    • ChatGPT
    • ChatGPT Atlas
    • Gemini AI (formerly Bard)
    • Google DeepMind
    • Grok AI
    • Meta AI
    • Microsoft Copilot
    • OpenAI
    • Perplexity
    • xAI
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Follow US
AIAmazonPerplexitySecurityTech

Perplexity AI claims to follow robots.txt, but Amazon investigates

Perplexity AI faces AWS investigation over web scraping allegations. Company claims compliance while admitting to URL-specific protocol exceptions.

By
Shubham Sawarkar
Shubham Sawarkar's avatar
ByShubham Sawarkar
Editor-in-Chief
I’m a tech enthusiast who loves exploring gadgets, trends, and innovations. With certifications in CISCO Routing & Switching and Windows Server Administration, I bring a sharp...
Follow:
- Editor-in-Chief
Jun 28, 2024, 12:45 PM EDT
Share
We may get a commission from retail offers. Learn more
Close up of AWS sign at their offices in SOMA district; Amazon Web Services (AWS) is a subsidiary of Amazon.
Photo: Alamy
SHARE

Tech giant Amazon is looking into its cloud division, Amazon Web Services (AWS), after accusations surfaced that a customer, Perplexity AI, might be scraping content from websites without their permission. This investigation centers around a specific practice: ignoring a common web standard known as the Robots Exclusion Protocol (robots.txt).

What is robots.txt and why does it matter?

Imagine your website as your house. Robots.txt acts like a sign on your door. It tells automated programs, or “bots,” which areas of your website they are allowed to visit and which ones are off-limits. While respecting robots.txt isn’t mandatory, it’s generally been a well-understood courtesy since the 1990s.

Related /

  • Perplexity, OpenAI, and Anthropic under fire for ignoring robots.txt

Wired discovers a suspicious crawler

Tech publication Wired reported uncovering a virtual machine, essentially a powerful computer program, that was bypassing a website’s robots.txt instructions. This machine, hosted on an AWS server with an IP address (44.221.181.252) linked to Perplexity AI, reportedly visited several prominent news websites hundreds of times in the last three months.

How did they know it was Perplexity AI?

Wired conducted a test. They entered headlines or short descriptions from the websites in question into Perplexity’s AI chatbot. The chatbot then responded with information that closely resembled the articles, with little to no attribution given to the original source. This suggested Perplexity might be using the scraped content to power its AI.

Is Perplexity the only culprit?

While Amazon’s investigation focuses on Perplexity AI, a recent Reuters report suggests this practice of ignoring robots.txt might be more widespread among AI companies looking to train their large language models.

What does Amazon say?

Amazon is clear: its customers must comply with robots.txt instructions. Their terms of service strictly prohibit illegal activity, and that includes respecting website owners’ wishes regarding how their content is accessed.

Perplexity AI denies wrongdoing, with a caveat

Perplexity maintains they follow robots.txt guidelines. Their spokesperson claims their chatbot respects the protocol, and their services comply with Amazon’s terms of service. However, they admit to an exception: if a user specifically includes a URL in their chatbot query, the robots.txt instructions might be bypassed in that instance.

Perplexity CEO previously denied accusations

Aravind Srinivas, CEO of Perplexity AI, has previously refuted claims that his company disregards robots.txt and then tries to cover it up. He acknowledges using third-party web crawlers alongside their own, and admits the bot identified by Wired belonged to one of these external services.

The investigation by Amazon is ongoing. Whether Perplexity AI will face any consequences for its alleged actions remains to be seen.


Discover more from GadgetBond

Subscribe to get the latest posts sent to your email.

Topic:AWS
Most Popular

The $19 Apple polishing cloth supports iPhone 17, Air, Pro, and 17e

Apple MacBook Neo: big power, surprising price, one clear target — Windows

Everything Nothing announced on March 5: Headphone (a), Phone (4a), and Phone (4a) Pro

MacBook Neo and external monitors: it’s complicated

OpenAI’s GPT-5.4 is coming — and it’s sooner than you think

Also Read
A simple illustration shows a large black computer mouse cursor pointing toward a white central hub with five connected nodes on an orange background.

Claude Marketplace lets you use one AI commitment across multiple tools

Perplexity Computer promotional banner featuring a glowing glass orb with a laptop icon floating above a field of wildflowers against a gray background, with the text "perplexity computer works" in the center and a vertical list of action words — sends, creates, schedules, researches, orchestrates, remembers, deploys, connects — displayed in fading gray text on the right side.

Perplexity Computer is the AI that actually does your work

99ONE Rogue 102321

99ONE Rogue wants to kill the ugly helmet comms box forever

TACT Dial 01 tactile desk instrument

TACT Dial 01: turn it, press it, focus — that’s literally it

Close-up of a person holding the Google Pixel 10 Pro Fold in Moonstone gray with both hands, rear-facing triple camera array and Google "G" logo prominently visible, worn against a silver knit top and blue jacket with a poolside background.

Pixel Care+ makes owning a Pixel a lot less scary — here’s why

Woman with blonde curly hair sitting outside in a lush park, holding a blue Google Pixel 10 and smiling at the screen.

Pixel 10a, Pixel 10, Pixel 10 Pro: one winner for every buyer

Google Search AI Mode showing Canvas in action, with a split-screen view of a conversational AI chat on the left and an "EE Opportunity Tracker" scholarship and grant tracking dashboard on the right, displaying a total funding secured amount of $5,000, scholarship cards with deadlines, and status labels including "To Apply" and "Awarded."

Google’s Canvas AI Mode rolls out to everyone in the U.S.

Google NotebookLM app listing on the Apple App Store displayed on an iPhone screen, showing the app icon, tagline "Understand anything," a Get button with In-App Purchases noted, 1.9K ratings, age rating 4+, and a chart ranking of No. 36 in Productivity.

NotebookLM Cinematic Video Overviews are live — here’s what’s new

Company Info
  • Homepage
  • Support my work
  • Latest stories
  • Company updates
  • GDB Recommends
  • Daily newsletters
  • About us
  • Contact us
  • Write for us
  • Editorial guidelines
Legal
  • Privacy Policy
  • Cookies Policy
  • Terms & Conditions
  • DMCA
  • Disclaimer
  • Accessibility Policy
  • Security Policy
  • Do Not Sell or Share My Personal Information
Socials
Follow US

Disclosure: We love the products we feature and hope you’ll love them too. If you purchase through a link on our site, we may receive compensation at no additional cost to you. Read our ethics statement. Please note that pricing and availability are subject to change.

Copyright © 2026 GadgetBond. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | Do Not Sell/Share My Personal Information.