GadgetBond

  • Latest
  • How-to
  • Tech
    • AI
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Add GadgetBond as a preferred source to see more of our stories on Google.
Font ResizerAa
GadgetBondGadgetBond
  • Latest
  • Tech
  • AI
  • Deals
  • How-to
  • Apps
  • Mobile
  • Gaming
  • Streaming
  • Transportation
Search
  • Latest
  • Deals
  • How-to
  • Tech
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • AI
    • Anthropic
    • ChatGPT
    • ChatGPT Atlas
    • Gemini AI (formerly Bard)
    • Google DeepMind
    • Grok AI
    • Meta AI
    • Microsoft Copilot
    • OpenAI
    • Perplexity
    • xAI
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Follow US
AIAnthropicOpenAIPerplexitySecurity

Perplexity, OpenAI, and Anthropic under fire for ignoring robots.txt

Is robots.txt enough?

By
Shubham Sawarkar
Shubham Sawarkar's avatar
ByShubham Sawarkar
Editor-in-Chief
I’m a tech enthusiast who loves exploring gadgets, trends, and innovations. With certifications in CISCO Routing & Switching and Windows Server Administration, I bring a sharp...
Follow:
- Editor-in-Chief
Jun 23, 2024, 8:31 AM EDT
Share
We may get a commission from retail offers. Learn more
A futuristic artificial intelligence (AI) robot illustration in a cosmic setting, representing the fusion of technology and space, emphasizing cybersecurity and innovation.
Illustration by Moty Weiss / Dribbble
SHARE

The world of artificial intelligence (AI) is booming, with new companies and applications emerging at an astonishing rate. But behind the scenes of this exciting progress, a troubling trend is taking root. Several AI companies, including Perplexity, have been accused of scraping content from websites – essentially copying and pasting information – even when those websites explicitly tell them not to.

This blatant disregard for boundaries is raising concerns about ethics and ownership in the digital age.

The crux of the issue lies in a protocol called robots.txt. Established in 1994, robots.txt acts as a set of instructions for web crawlers, the automated programs that websites use to index content. Websites can use robots.txt to tell crawlers which pages are off-limits for scraping. While compliance with robots.txt is voluntary, it’s a well-respected norm within the web development community.

Here’s where things get messy. Perplexity, a company offering a free AI search engine, has been accused of scraping content from Forbes, Wired, and The Shortcut, even though these websites clearly indicated “no scraping” zones in their robots.txt files. This raises a big question: why would Perplexity, or any AI company for that matter, risk damaging their reputation by blatantly ignoring these protocols?

The answer lies in the data itself. Websites are treasure troves of information, and for AI companies, this information is the fuel that drives their technology. Text and data scraped from websites are used to train AI models, making them better at tasks like generating text, translating languages, or answering questions.

However, scraping copyrighted content without permission is not only unethical, but it can also have legal ramifications. In the case of Perplexity, their AI tool was caught generating content that closely resembled scraped articles, with minimal attribution and sometimes even factual inaccuracies. This raises serious concerns about the quality and reliability of AI-generated information.

The plot thickens further with the revelation from TollBit, a startup that connects publishers with AI firms. According to TollBit, big names in the AI industry, like OpenAI (creators of ChatGPT) and Anthropic (creators of Claude), have also been bypassing robots.txt restrictions. These companies previously claimed to respect “do not crawl” instructions, making their actions even more hypocritical.

Perplexity’s CEO, Aravind Srinivas, attempted to defend his company’s actions by downplaying the importance of robots.txt. He argued that it’s not a legal framework and suggested a need for a “new kind of relationship” between publishers and AI companies. This line of reasoning is concerning, as it suggests a disregard for established norms and a desire to operate in a grey area.

The larger concern here is the potential erosion of trust between content creators and AI companies. If AI companies can’t be held accountable for respecting basic boundaries like robots.txt, it creates an environment where content creators are constantly at risk of having their work stolen and repurposed without proper credit or compensation.

The future of AI is undoubtedly bright, but it needs to be built on a foundation of ethical practices and respect for intellectual property. As AI technology continues to evolve, it’s crucial to establish clear guidelines and regulations that ensure responsible data collection and utilization. This will not only protect the rights of content creators but also foster a more sustainable and trustworthy environment for AI development.

Related /

  • Anthropic unveils Claude 3.5 Sonnet: faster, smarter, and more personable
  • Ilya Sutskever, ex-OpenAI chief scientist, launches Safe Superintelligence Inc.
  • Edward Snowden warns against trusting OpenAI after NSA hire
  • AI is coming for Hollywood – you can now make your own shows
  • The future of news is automated: will AI complement or replace journalists?
  • Google, OpenAI and the Race to Leverage AI for News Generation

Discover more from GadgetBond

Subscribe to get the latest posts sent to your email.

Topic:ChatGPTClaude AI
Most Popular

What to watch on Paramount+ right now

Apple’s next Pro iPhone may not solve the scratch problem

Apple Music iOS 27 update: AutoMix, artist pages, and Siri AI

Apple’s iPhone 18 plan is changing

Hypelist lets you build lists around the things you love

Swipewipe makes clearing your camera roll feel oddly easy

Under-16s face social media ban in the UK

Here’s how to reset your Mac login password in a few steps

Before the web, there was print

Rec League is the kind of app the internet has been missing

Also Read
Snap SPECS AR glasses

Snap’s new SPECS AR glasses are real, pricey, and coming this fall

Soccer player Antonee Robinson stands backstage at a sporting event wearing a black team jacket and an accreditation badge while using a pair of unreleased over-ear Beats headphones. The headphones feature a white exterior with dark blue ear cushions and a minimalist Beats logo on the ear cup. Other team members wearing wireless earbuds can be seen in the background as the group prepares to enter the venue.

The new Beats headphones, Antonee Robinson just teased on his way to the World Cup

Promotional banner for Xbox Game Pass Ultimate showcasing a lineup of popular games across multiple genres. The artwork features an anime-style character, an American football player, an adventurer in a fedora, a futuristic armored soldier, and a block-based fantasy game scene. The Xbox logo and "Game Pass Ultimate" branding are displayed prominently in the center, emphasizing access to a wide catalog of console, PC, and cloud gaming titles through a single subscription.

Xbox Game Pass Ultimate: pricing, perks, and how it all fits together

Promotional artwork for PC Game Pass featuring a collage of game characters and worlds. The image includes a red-eyed fantasy character, a tactical soldier, an adventurer wearing a fedora, and a mythological bearded figure with glowing eyes. The Xbox logo and "PC Game Pass" branding appear across the center, highlighting a diverse library of action, adventure, strategy, and role-playing games available through the subscription service.

PC Game Pass in 2026: library, limits, and the new price cut

Promotional Xbox gaming image with the slogan “Play the Way You Want” displayed in large green text at the center. Surrounding the message are multiple gaming devices, including an Xbox console and controller, a gaming handheld, a laptop, a smartphone, and a TV, all showing Xbox games and the Xbox app interface. The artwork highlights Xbox Cloud Gaming and Game Pass, emphasizing the ability to play across console, PC, handheld, mobile, and streaming devices from a single gaming ecosystem.

Xbox Game Pass Premium: the middle tier that might be just right

Xbox Game Pass key art

Xbox Game Pass Essential: who it’s for, what it includes, what it skips

Promotional image of the PlayStation Portal handheld gaming device featuring the PlayStation Plus cloud streaming interface on its display. The screen shows the PlayStation Plus logo surrounded by a glowing purple ring, while the device's white DualSense-style controller grips frame the display on both sides. Set against a dark background with PlayStation-inspired colors, the image highlights cloud gaming and remote play capabilities available through PlayStation Plus.

New to PlayStation Plus? Here’s how the service really works

Promotional image for Amazon Luna cloud gaming featuring the Luna logo on a purple gradient background. Multiple devices, including a smart TV, desktop monitor, laptop, tablet, and smartphone, display the same racing game scene with Sonic the Hedgehog and other characters. An Amazon Luna wireless controller is positioned in front of the screens, illustrating seamless game streaming across different devices through Amazon’s cloud gaming platform.

How Amazon Luna works and who it is for

Company Info
  • Homepage
  • Support my work
  • Latest stories
  • Company updates
  • GDB Recommends
  • Daily newsletters
  • About us
  • Contact us
  • Write for us
  • Editorial guidelines
Legal
  • Privacy Policy
  • Cookies Policy
  • Terms & Conditions
  • DMCA
  • Disclaimer
  • Accessibility Policy
  • Security Policy
  • Do Not Sell or Share My Personal Information
Socials
Follow US

Disclosure: We love the products we feature and hope you’ll love them too. If you purchase through a link on our site, we may receive compensation at no additional cost to you. Read our ethics statement. Please note that pricing and availability are subject to change.

Copyright © 2026 GadgetBond. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | Do Not Sell/Share My Personal Information.