Close Menu
  • Home
  • Aerospace & Defense
    • Automation & Process Control
      • Automotive & Transportation
  • Banking & Finance
    • Chemicals & Materials
    • Consumer Goods & Services
  • Economy
    • Electronics & Semiconductor
  • Energy & Resources
    • Food & Beverage
    • Hospitality & Tourism
    • Information Technology
  • Agriculture
What's Hot

Baidu in China deploys Robotaxis on Lideshare app Lyft

Yum Brands (Yum) Q2 2025 Revenue

Milei vetoes pension, disability spending increases as Argentina feels cuts | Business and Economy News

Facebook X (Twitter) Instagram
USA Business Watch – Insightful News on Economy, Finance, Politics & Industry
  • Home
  • Aerospace & Defense
    • Automation & Process Control
      • Automotive & Transportation
  • Banking & Finance
    • Chemicals & Materials
    • Consumer Goods & Services
  • Economy
    • Electronics & Semiconductor
  • Energy & Resources
    • Food & Beverage
    • Hospitality & Tourism
    • Information Technology
  • Agriculture
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
USA Business Watch – Insightful News on Economy, Finance, Politics & Industry
Home » Confusing accused of scraping websites that explicitly blocked AI scraping
Information Technology

Confusing accused of scraping websites that explicitly blocked AI scraping

ThefuturedatainsightsBy ThefuturedatainsightsAugust 5, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


According to the Internet Infrastructure Provider CloudFlare, AI startups’ stumps are raw scraping content from websites that explicitly indicate they don’t want to be scraped away.

On Monday, CloudFlare published a survey that found that AI startups ignored blocks and observing their raw or scraping activities. The Network Infrastructure giant has accused them of obscuring their identity when trying to scrape web pages “to avoid website preferences,” CloudFlare researchers wrote.

AI products like Prplexity offer rely on gobbling large amounts of data from the Internet, and AI startups have repeatedly scraped text, images and videos from the Internet without permission to make the product work. Recently, the website has tried to fight back using the Web Standard Robots.txt file. It tries to tell search engines and AI companies whether they can index their efforts that they have seen a wide range of results.

According to CloudFlare, it appears they are willing to bypass these blocks by changing the “user agent” of the bot.

“This activity was observed across tens of thousands of domains and millions of requests per day. We were able to fingerprint this crawler using a combination of machine learning and network signals,” read CloudFlare’s post.

Perplexity spokesman Jesse Dwyer dismissed the CloudFlare blog post as “sales pitch” and added an email to TechCrunch that said it “indicates that the content was not accessed.” In a follow-up email, Dwyer insisted on the CloudFlare blog a bot named “Not us.”

CloudFlare said the action was first noticed after customers complained that they were baffled and raw and rubbed the site in distress, especially to block known bots in Prplexity. CloudFlare then ran tests to check and confirmed that the confusion was avoiding these blocks.

TechCrunch Events

San Francisco
|
October 27th-29th, 2025

“Perplexity observed that it uses not only declared user agents, but also a common browser that impersonates Google Chrome on MacOS when declared crawlers are blocked,” CloudFlare said.

The company also said it has created Perplexity bots from its verified list and added new techniques to block them.

CloudFlare has recently taken a public stance against AI Crawlers. Last month, CloudFlare announced the launch of a market that will allow website owners and publishers to claim AI scrapers to visit their sites. CloudFlare CEO Matthew Prince sounded the alarm at the time, saying that AI was breaking the internet, particularly the publisher’s business model. Last year, CloudFlare launched a free tool to prevent bots from shaking websites to train AI.

This is not the first time that confusion has been accused of rubbing without permission.

Last year, news outlets such as wired claim that confusion was plagiarizing their content. A few weeks later, Perplexity CEO Aravind Srinivas was unable to answer immediately when asked to provide a definition of plagiarism in an interview with Devin Coldewey of The TechCrunch at The Disrupt 2024 Conference.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleJeh Aerospace nets $11 million to expand the supply chain of commercial aircraft in India
Next Article Hybrid chips allow bidirectional conversion between Terra Hearts and optical signals for ultra-high speed communication
Thefuturedatainsights
  • Website

Related Posts

AI-powered Fintech Alaan raised $48 million, making it one of MENA’s biggest Series A rounds

August 5, 2025

Ju’s Rules Meta violates California’s privacy laws by quietly collecting flow users’ menstrual health data.

August 5, 2025

Elon Musk says he’s reclaiming Vine’s archives

August 5, 2025
Leave A Reply Cancel Reply

Latest Posts

“Heroic” farmers and workers praised the battle of Storm Floris

Defra strengthens rules to tackle farm pollution after legal pressure

New Fungicide Data Drive Set to Protect UK Crops

Emergency warnings for potato growers on nematic residues

Latest Posts

Alaska Airlines launches flights in London and Iceland, debuting new colouring

August 5, 2025

Boeing defense workers take a strike after rejecting contract

August 4, 2025

Palantir lands $10 billion in Army software and data contracts

August 1, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Baidu in China deploys Robotaxis on Lideshare app Lyft
  • Yum Brands (Yum) Q2 2025 Revenue
  • Milei vetoes pension, disability spending increases as Argentina feels cuts | Business and Economy News
  • Preola appointed Anise Hanna as president
  • Improved sliding US rig count efficiency, threat of land oil output – Energy news, top headlines, comments, features, events

Recent Comments

No comments to show.

Welcome to USA Business Watch – your trusted source for real-time insights, in-depth analysis, and industry trends across the American and global business landscape.

At USABusinessWatch.com, we aim to inform decision-makers, professionals, entrepreneurs, and curious minds with credible news and expert commentary across key sectors that shape the economy and society.

Facebook X (Twitter) Instagram Pinterest YouTube

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Archives

  • August 2025
  • July 2025
  • June 2025
  • March 2022
  • January 2021

Categories

  • Aerospace & Defense
  • Agriculture
  • Automation & Process Control
  • Automotive & Transportation
  • Banking & Finance
  • Chemicals & Materials
  • Consumer Goods & Services
  • Economy
  • Economy
  • Electronics & Semiconductor
  • Energy & Resources
  • Food & Beverage
  • Hospitality & Tourism
  • Information Technology
  • Political
Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 usabusinesswatch. Designed by usabusinesswatch.

Type above and press Enter to search. Press Esc to cancel.