Amazon Made 600M Products Invisible to ChatGPT. Your Cloudflare Might Be Doing the Same Thing.
Since July 1, 2025, Cloudflare blocks AI crawlers by default on every site. If you haven't touched your bot settings since then, here's the blunt version: ChatGPT can't see your products. Neither can Perplexity. Neither can Claude. You did nothing wrong and you're still invisible.
And you're not alone - sites blocking AI crawlers now outnumber those blocking Googlebot by 7 to 1 (Cloudflare). Most of them have no idea it's happening to them.
Stay in the loop
Get news and updates about GEO, AI search and new features. Unsubscribe anytime.
Check yours right now - takes 30 seconds
- Cloudflare Dashboard → select your domain
- Security → Bots
- Look for the "AI Crawlers" toggle
- If it says Block - that's the default, and your store is invisible to AI search right now
Checked it? Good. Now let me explain why this matters and exactly what to do about it.
The numbers that should worry you
| Crawler | Purpose | Crawls per referral click |
| Googlebot | Google Search | 14:1 |
| PerplexityBot | Perplexity Search | 88:1 (electronics) |
| OAI-SearchBot | ChatGPT Search | 401:1 (electronics) |
| GPTBot | AI model training | 1,700:1 |
| ClaudeBot | AI model training | 73,000:1 |
Source: Cloudflare crawl-to-referral data and industry breakdown
Look at the split. PerplexityBot at 88:1 and OAI-SearchBot at 401:1 are search crawlers - they send real customers to your store. GPTBot at 1,700:1 and ClaudeBot at 73,000:1 are training crawlers - they take your content and give you almost nothing back.
The default Cloudflare block kills both without asking. That's the mistake. You want to block one and allow the other, deliberately.
The setup I'd run for e-commerce (copy-paste this)
Allow (these send you customers):
OAI-SearchBot- ChatGPT Search & ShoppingChatGPT-User- when users ask ChatGPT to browse your sitePerplexityBot- Perplexity citationsClaudeBot- Claude web search (now on all plans)Google-Extended- Google AI Overviews & Gemini
Block (these take your content for training):
GPTBot- OpenAI model trainingCCBot- Common Crawlanthropic-ai- Anthropic model trainingBytespider- ByteDance
That keeps you visible in ChatGPT Shopping (50M queries/day), Perplexity, Claude, and AI Overviews - channels that convert 1.3x to 6x higher than non-branded organic search - while still blocking the 80% of AI crawl traffic that's pure training and returns zero referral value. You give up nothing that pays you and keep everything that does.
"But what about bandwidth costs?"
Fair question, so here's the honest answer. When Read the Docs blocked all AI crawlers, their bandwidth dropped 75% - 800 GB/day down to 200 GB/day - saving $1,500/month (Search Engine Journal).
But Read the Docs is a documentation site. They don't sell products through AI search. You do. The selective setup above blocks the bandwidth-heavy training crawlers and keeps the search crawlers that actually drive revenue - you get most of the savings without amputating the channel.
One more thing: Perplexity cheats
Block PerplexityBot in robots.txt and Perplexity may still read your content anyway. Cloudflare caught them running stealth, undeclared crawlers that rotate IPs and spoof real-browser user agents (Cloudflare investigation), and delisted them as a verified bot over it.
So if you genuinely need Perplexity out, robots.txt won't get you there. Use Cloudflare's WAF rules or AI Crawl Control - anything less is theater.
Do this right now
Step 1: Open Cloudflare → Security → Bots. Screenshot your current settings before you change anything.
Step 2: Switch from blanket block to selective - allow the search crawlers, block the training ones.
Step 3: Check back in 2 weeks. If you're on GEOlikeaPro, run an SOV audit before and after - you'll see the move land in your AI visibility scores.
See where you stand - free to run.
FAQ
Does Cloudflare block AI crawlers by default?
Yes, since July 1, 2025. All new and existing zones have AI bot blocking enabled unless you explicitly turn it off. Check: Cloudflare Dashboard → Security → Bots. <a href="https://blog.cloudflare.com/declaring-your-aindependence-block-ai-bots-scrapers-and-crawlers-with-a-single-click/" target="_blank" rel="noopener">Source</a>
Will this affect my Google rankings?
No. Googlebot is separate from AI crawlers. Blocking GPTBot or ClaudeBot has zero impact on Google organic rankings. But blocking Google-Extended may remove you from Google AI Overviews — which now appear in 25% of searches.
Can I allow AI search but block AI training?
Yes. OpenAI uses OAI-SearchBot for search and GPTBot for training. Block GPTBot, allow OAI-SearchBot. Same logic applies to Anthropic (allow ClaudeBot search, block anthropic-ai training). Configure via Cloudflare AI Crawl Control or robots.txt.
Does robots.txt actually stop AI crawlers?
Not always. Cloudflare documented Perplexity using stealth crawlers that ignore robots.txt entirely. For reliable enforcement, use WAF rules or Cloudflare's AI Crawl Control in addition to robots.txt. <a href="https://blog.cloudflare.com/perplexity-is-using-stealth-undeclared-crawlers-to-evade-website-no-crawl-directives/" target="_blank" rel="noopener">Source</a>
How much bandwidth do AI crawlers use?
50 billion+ requests/day across Cloudflare's network (~1% of all web traffic). Read the Docs saved $1,500/month by blocking them. Your impact depends on site size, but the selective approach (allow search, block training) cuts most bandwidth waste while keeping visibility.