GPTBot

Training crawlerby OpenAI

OpenAI's primary web crawler. It collects publicly available pages that may be used to train future versions of GPT and ChatGPT.

Share of AI crawl visits

57.2%

Rank #1 of 12 crawlers

Visits observed

329K

identified requests

Sites reached

70%

59 of 84

Pages per visit

60.5

5.4K sessions

Crawl velocity

61/min

peak pages per minute

What GPTBot is for

Collects pages to train future models. No direct traffic back to you.

Blocking GPTBot keeps your content out of OpenAI training data, but does not affect whether ChatGPT can cite you in answers - that is OAI-SearchBot.

Crawler guide

Evergreen identity, source, and robots.txt handling guidance.

Read the full bot guide

How GPTBot reads a site

Its real crawl signature, from the pages Trakkr observes it fetch.

Click-depth of pages fetched

67% at depth 3+

D0 3%D1 10%D2 20%D3 52%D4 12%D5+ 4%

First hit is the homepage2.8%

Visits per site5.6K

Weekend vs weekday1.29×29%

When GPTBot crawls

Activity over a day in UTC - it peaks around 14:00 and is quietest near 02:00.

0006121823

Allow or block GPTBot

Honours robots.txt. Add one of these to your robots.txt.

Block it

User-agent: GPTBot
Disallow: /

Allow it explicitly

User-agent: GPTBot
Allow: /

User-agent

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; GPTBot/1.2; +https://openai.com/gptbot

OpenAI's official bot docs

Other AI crawlers

Every bot Trakkr tracks is a doorway.

OAI-SearchBotAI search crawler · OpenAI

ChatGPT-UserLive fetcher · OpenAI

ClaudeBotTraining crawler · Anthropic

Claude-SearchBotAI search crawler · Anthropic

PerplexityBotAI search crawler · Perplexity

Perplexity-UserLive fetcher · Perplexity

BytespiderTraining crawler · ByteDance

Meta-ExternalFetcherLive fetcher · Meta

AmazonbotOther · Amazon

Common questions

What is GPTBot?

GPTBot is an AI web crawler operated by OpenAI. OpenAI's primary web crawler. It collects publicly available pages that may be used to train future versions of GPT and ChatGPT.

How do I block GPTBot?

Add a directive to your robots.txt: "User-agent: GPTBot" followed by "Disallow: /". Honours robots.txt.

Does blocking GPTBot hide me from AI?

Blocking GPTBot keeps your content out of training data but does not remove you from AI search answers - those are served by the search crawlers. Blocking GPTBot keeps your content out of OpenAI training data, but does not affect whether ChatGPT can cite you in answers - that is OAI-SearchBot.

Methodology

Identity and robots.txt facts come from OpenAI's published bot documentation. Behaviour is measured from identified GPTBot requests in the server logs of the brands Trakkr tracks - 576K crawler visits across 84 sites, recounted as new data arrives.

Open data

Open API GitHub repository

Trakkr DataAll crawlers Citations·Data as of 132d agoCC BY 4.0

Telemetry updated Feb 1, 2026.