OAI-SearchBot vs GPTBot vs ChatGPT-User — What Each OpenAI Crawler Does

April 23, 2026

OpenAI runs three separate crawlers: GPTBot, OAI-SearchBot, and ChatGPT-User. Practitioners configuring robots.txt often block all three — and unknowingly lock themselves out of ChatGPT Shopping. This guide breaks down what each crawler does, how they’re verified, and the right config for e-commerce.

The three crawlers at a glance

CrawlerPurposeFrequencyDrives traffic?
GPTBotTraining foundation modelsAsync, continuousNo
OAI-SearchBotIndex for ChatGPT search & citationsAsync, regularYes — citations in ChatGPT search
ChatGPT-UserLive user-triggered fetchesReal-time per queryYes — live answers

Source: OpenAI Crawlers documentation.

GPTBot — the training crawler

GPTBot is OpenAI’s crawler for model training. It collects publicly available data to improve future versions of GPT models. User agent: GPTBot/1.1.

Key characteristics:

  • Operates offline and asynchronously — not tied to live user activity
  • Does not drive referral traffic to your site
  • Content scraped by GPTBot may appear in future model responses without attribution
  • Respects robots.txt — blocking it prevents training use
E-commerce decision

GPTBot has the highest crawl-to-referral ratio of all AI crawlers (effectively infinite — it never sends referrals). Many publishers block it to preserve bandwidth and content rights. For e-commerce, the decision is simpler: blocking GPTBot doesn’t hurt sales because it doesn’t send traffic.

OAI-SearchBot — the search indexer

OAI-SearchBot powers ChatGPT’s live search capabilities — inline citations, product recommendations, and real-time answers. User agent: OAI-SearchBot/1.0.

Key characteristics:

  • Asynchronous crawler that builds the ChatGPT search index
  • Augments data from Bing and other sources
  • Blocking it removes your pages from ChatGPT search results
  • Respects robots.txt
This is the critical one

When a ChatGPT user asks “best running shoes under $150,” OAI-SearchBot’s index determines which products are candidates. If you block it, you’re invisible to ChatGPT Shopping entirely.

ChatGPT-User — the live fetcher

ChatGPT-User is triggered when a user asks ChatGPT (or a Custom GPT) to fetch a specific URL or browse a website in real time. User agent: ChatGPT-User/1.0.

Key characteristics:

  • Not a traditional crawler — acts more like a browser agent
  • Triggered per user query, not on a schedule
  • Used when the user explicitly requests web content
  • Respects robots.txt directives targeting ChatGPT-User

Blocking ChatGPT-User means users who ask ChatGPT to “check availability on yourstore.com” will get an error instead of live data. For stores that want to support agentic shopping workflows, keep this allowed.

robots.txt configuration for e-commerce

The recommended setup: block training, allow search and user fetches.

# Block model training
User-agent: GPTBot
Disallow: /

# Allow search indexing (drives ChatGPT Shopping citations)
User-agent: OAI-SearchBot
Allow: /

# Allow live user fetches (supports agentic shopping)
User-agent: ChatGPT-User
Allow: /

This configuration gives you ChatGPT Shopping visibility and supports agent-driven queries while preventing your content from training future models without attribution.

If you want maximum visibility (allow all, including training):

User-agent: GPTBot
Allow: /

User-agent: OAI-SearchBot
Allow: /

User-agent: ChatGPT-User
Allow: /

Verification via IP ranges

Each crawler publishes its IP ranges as JSON. Use these to verify legitimate OpenAI traffic in your server logs and configure WAF allow-lists:

These endpoints are updated regularly. Automate periodic fetches to keep your WAF rules current.

Common configuration mistakes

  • Blocking all three via User-agent: * Disallow — kills ChatGPT Shopping visibility. Use explicit user-agent directives.
  • Cloudflare default blocking (July 2025+) — new Cloudflare domains block AI crawlers by default. Check your Bots dashboard and explicitly allow OAI-SearchBot.
  • Confusing ChatGPT-User with ChatGPT traffic — ChatGPT-User is only used for explicit web fetches. Most ChatGPT citations come from OAI-SearchBot’s pre-built index.
  • WAF blocking the user agent string — robots.txt allows the crawler, but your WAF returns 403. Check both layers.

How to check if you’re blocked

Ask ChatGPT: “Does [yourstore.com] sell [specific product]?” If ChatGPT says “I can’t access that site” or returns generic information without visiting, you’re likely blocked at one of these layers:

  1. robots.txt disallowing OAI-SearchBot or ChatGPT-User
  2. Cloudflare Bot Fight Mode or AI Crawl Control blocking the request
  3. WAF rule returning 403 to the OpenAI user agent
  4. Server-side rendering failing for the crawler’s request headers

GEOlikeaPro’s Crawler View shows exactly what each OpenAI crawler sees when it visits your pages — including whether you’re blocked at the robots.txt, WAF, or rendering layer. Sign up free to audit your crawler configuration.

FAQ

What's the difference between GPTBot and OAI-SearchBot?

GPTBot trains foundation models — it doesn't send you traffic. OAI-SearchBot indexes content for ChatGPT's live search and citations — this is what makes your products show up in ChatGPT Shopping. You want OAI-SearchBot allowed at minimum.

Does blocking GPTBot hurt my ChatGPT visibility?

No. GPTBot is for training future models, not for search. Blocking GPTBot keeps your content out of training data but does NOT affect ChatGPT Shopping or citations — those come from OAI-SearchBot, which is a separate crawler.

What is ChatGPT-User and when does it fetch my site?

ChatGPT-User fetches pages in real time when a user explicitly asks ChatGPT to check a URL or when a Custom GPT uses browsing. It's not a crawler — it acts like a browser agent per user query. Blocking it prevents live user lookups.

Should I allow or block OpenAI crawlers for e-commerce?

For e-commerce: block GPTBot (training, no traffic), allow OAI-SearchBot (ChatGPT Shopping citations), allow ChatGPT-User (live user fetches). This gives maximum visibility for products without feeding training data.

How do I verify OpenAI crawler traffic in my logs?

OpenAI publishes IP ranges for each crawler at openai.com/gptbot.json, openai.com/searchbot.json, and openai.com/chatgpt-user.json. Cross-reference the source IP with these lists to verify legitimate OpenAI traffic. Spoofed user agents with non-matching IPs are impersonators.

Why isn't my store showing in ChatGPT search results?

Most common causes: (1) robots.txt blocks OAI-SearchBot, (2) Cloudflare default blocking is enabled (introduced July 2025 for new domains), (3) WAF returns 403 to the OpenAI user agent, (4) the page requires JavaScript rendering that the crawler can't execute. Check all four layers.

See how AI search engines rank your store

Run a free AI visibility audit — find out where ChatGPT, Perplexity and Google AI rank your products.

Try free audit →

Free tier · No credit card required