Do AI bots ignore robots.txt?

Some do. Tollbit data shows 13.26% of AI bot requests ignored robots.txt directives in Q2 2025, up from 3.3% in Q4 2024. Major AI companies publicly commit to respecting robots.txt, but not all AI crawlers follow the rules.

What can I do if AI bots ignore my robots.txt?

Use server-level blocking via .htaccess or firewall rules. Implement rate limiting, use CDN-level bot management, and add HTTP headers like X-Robots-Tag for additional control.

Is robots.txt legally enforceable?

The EU Copyright Directive recognizes robots.txt as a valid machine-readable opt-out. Ignoring a robots.txt block could strengthen a copyright infringement claim, though enforcement varies by jurisdiction.

13% of AI Bots Now Ignore robots.txt: What Business Owners Should Know | Blog

Robots.txt has been the internet voluntary content access agreement since 1994. For 30 years, it has worked because crawlers respected the rules. Now, Tollbit data shows that 13.26% of AI bot requests ignore robots.txt directives - up from 3.3% in Q4 2024.

The Numbers

13.26% of AI bot requests ignored robots.txt in Q2 2025
3.3% ignored robots.txt in Q4 2024
That is a 4x increase in non-compliance in just two quarters

Why This Is Happening

New AI browsers: Perplexity Comet, Firecrawl, and Browserless are "indistinguishable from humans in site logs" according to Tollbit
Training demand: The demand for content creates incentives to access content regardless of restrictions
Bot fragmentation: New AI companies may not implement robots.txt compliance from the start

What Still Works

The major AI companies (OpenAI, Anthropic, Google) publicly commit to respecting robots.txt. For businesses that need stronger protection:

Server-level blocking (.htaccess): Block specific user agents at the web server level
Firewall/CDN blocking: Cloudflare, Sucuri, Wordfence can block at the infrastructure level
Rate limiting: Throttle suspicious crawl patterns
HTTP headers: X-Robots-Tag: noai for additional signals

Legal Developments

The EU Copyright Directive now recognizes robots.txt as a valid machine-readable opt-out mechanism. This legal framework is still evolving, but the trend is toward stronger enforcement.

AppWT Approach

We implement multi-layered AI bot management: strategic robots.txt, server-level .htaccess blocking, Imunify360 security, and regular monitoring. The goal is maximum AI visibility for business discovery while protecting against unwanted scraping. Learn about our cybersecurity services.

Tony Paris

Founder and Tech Wizard at AppWT Web & AI Solutions. With over 29 years of experience in web development, Tony helps businesses succeed online through custom websites, SEO, and AI integration.

Learn more about Tony

Enjoyed this article?

Share it with your network

13% of AI Bots Now Ignore robots.txt: What Business Owners Should Know

The Numbers

Why This Is Happening

What Still Works

Legal Developments

AppWT Approach

Tags

Tony Paris

Enjoyed this article?

Ready to Get Started?

Accessibility

13% of AI Bots Now Ignore robots.txt: What Business Owners Should Know

Schedule a FREE Consultation

The Numbers

Why This Is Happening

What Still Works

Legal Developments

AppWT Approach

Tags

Tony Paris

Enjoyed this article?

Ready to Get Started?

Share This Article

Accessibility

Accessibility Statement

Our Commitment to Accessibility

Conformance Status

Accessibility Features

Technical Specifications

Feedback

Assessment Approach

Date

Start a Project Inquiry

Inquiry Submitted!

Free Online Consultation

Your Information

Tell Us About Your Project

Select Your Preferred Time

Available Times for

Request Submitted!