Chaos of AI Company Crawlers: Website Blocking Efforts Prove Futile

AIbase基地

Published inAI News · 5 min read · Jul 30, 2024

199

A recent survey has revealed that hundreds of websites are attempting to block content scraping by AI company Anthropic, but are inadvertently blocking the wrong bots due to outdated directives. This phenomenon highlights the challenges faced by website owners in dealing with the constantly evolving AI web crawler ecosystem.

According to an anonymous operator of the web crawler tracking site Dark Visitors, many websites are currently blocking two bots that Anthropic no longer uses, "ANTHROPIC-AI" and "CLAUDE-WEB," while unknowingly allowing the company's actual new crawler, "CLAUDEBOT," to pass through. This situation arises primarily because website owners have copied and pasted outdated directives into their robots.txt files, while AI companies continuously introduce crawler bots with new names.

Stock Price Data Analysis

Image source note: The image was generated by AI, provided by the image licensing service provider Midjourney

This chaotic situation is not limited to Anthropic. The operator of Dark Visitors points out that tech giants like Apple and Meta have recently added new proxies, making it nearly impossible for website owners to manually keep up with these changes. More concerning is that some AI companies have been found scraping websites they shouldn't, or outright ignoring the directives in robots.txt files.

This scenario leads to a series of issues. Some websites opt to block all crawlers entirely or allow only a few specific ones to access, which may affect search engine indexing, internet archiving, and academic research. Meanwhile, some websites are facing technical and economic pressures from the large-scale access of AI crawlers. For example, repair guide website iFixit reported that Anthropic's crawler accessed its site nearly a million times in a single day. Another service provider, Read the Docs, said a crawler accessed 10TB worth of files in a day, resulting in high bandwidth costs.

A study by the Data Provenance Initiative further reveals the widespread confusion faced by content creators and website owners when trying to prevent AI tools from training. The study points out that the responsibility of blocking AI scraping tools falls entirely on website owners, and the ever-increasing and frequently changing number of crawlers makes this task extremely difficult.

In the face of this complex situation, experts advise website administrators to actively block suspicious AI crawlers, even if it might inadvertently block some non-existent proxies. At the same time, there are predictions that more creators will move their content behind paywalls to prevent unrestricted scraping.

Huawei Launches Global AI Talent Recruitment, Focused on Building a Top AI Team

Huawei has launched a global AI talent recruitment program aimed at forming a top team to promote the development of large models and general artificial intelligence (AGI). After being announced through its official Weibo account, it has attracted attention. Yu Chengdong stated that he welcomes young talents to join. The recruitment requirements include academic excellence, technical enthusiasm, and innovative thinking.

Jingyuan Technology's Zhang Lei Proposes the Concept of Physical Artificial Intelligence: Predicting the Future Energy System Will Be Centered Around Smart Assets

Jingyuan Technology Group's Zhang Lei proposed the concept of physical artificial intelligence, indicating that AI is transitioning from a tool in energy systems to a decision-making entity. The competitiveness of future energy companies will depend on smart assets rather than the scale of physical assets. He emphasized that AI has self-perception and decision-making capabilities, which is the essential difference of the technological revolution.

Kumo Wins Fast Company's Annual Technology Innovation Award, AI Helps Enterprises Achieve Data Intelligence

Kumo was selected for the 'Top 10 Innovations in Future Technology' list by Fast Company, becoming a leading player in the foundational AI category, thanks to its KumoRFM foundation model specifically designed for relational data. The selection included 137 innovative companies across various fields, highlighting Kumo's groundbreaking contributions in advancing enterprise data intelligence.

515 million! China's generative AI users doubled in half a year, with over 90% of users favoring domestic large models

The 6th China Internet Basic Resources Conference released the "Report on the Development of Generative Artificial Intelligence Applications (2025)". As of June 2025, the number of generative AI users in China reached 515 million, with a penetration rate of 36.5%. The user base increased by 266 million in the first half of the year, growing by 106.6%, doubling in size within six months, demonstrating strong development momentum.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Chaos of AI Company Crawlers: Website Blocking Efforts Prove Futile

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Amazon's Layoff Plan Exposed: Internal Documents Reveal Robots Could Replace Over 6 Million Jobs by 2033!

The valuation of multi-modal artificial intelligence startup Fal.ai has exceeded 4 billion USD, tripling in value within six months

AI Daily: Alibaba's Quark C Plan Exposed; Veo 3.1 to Add Video Precise Editing Features; Anthropic Launches Claude Code Web Version

Huawei Launches Global AI Talent Recruitment, Focused on Building a Top AI Team

Anthropic Launches Claude for Life Sciences: AI Accelerates Life Science Research

Anthropic Launches Claude Code Web Version: AI Coding Assistant Expands from Command Line to Browser

Anthropic Launches Claude Code Web Version for Coding Tasks in the Browser

Jingyuan Technology's Zhang Lei Proposes the Concept of Physical Artificial Intelligence: Predicting the Future Energy System Will Be Centered Around Smart Assets

Kumo Wins Fast Company's Annual Technology Innovation Award, AI Helps Enterprises Achieve Data Intelligence

515 million! China's generative AI users doubled in half a year, with over 90% of users favoring domestic large models

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Chaos of AI Company Crawlers: Website Blocking Efforts Prove Futile

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Amazon's Layoff Plan Exposed: Internal Documents Reveal Robots Could Replace Over 6 Million Jobs by 2033!

The valuation of multi-modal artificial intelligence startup Fal.ai has exceeded 4 billion USD, tripling in value within six months

AI Daily: Alibaba's Quark C Plan Exposed; Veo 3.1 to Add Video Precise Editing Features; Anthropic Launches Claude Code Web Version

Huawei Launches Global AI Talent Recruitment, Focused on Building a Top AI Team

Anthropic Launches Claude for Life Sciences: AI Accelerates Life Science Research

Anthropic Launches Claude Code Web Version: AI Coding Assistant Expands from Command Line to Browser

Anthropic Launches Claude Code Web Version for Coding Tasks in the Browser

Jingyuan Technology's Zhang Lei Proposes the Concept of Physical Artificial Intelligence: Predicting the Future Energy System Will Be Centered Around Smart Assets

Kumo Wins Fast Company's Annual Technology Innovation Award, AI Helps Enterprises Achieve Data Intelligence

515 million! China's generative AI users doubled in half a year, with over 90% of users favoring domestic large models

GEO Services