Trakkr Data

GPTBot

Training crawlerby OpenAI

OpenAI's primary web crawler. It collects publicly available pages that may be used to train future versions of GPT and ChatGPT.

Share of AI crawl visits
57.2%
Rank #1 of 12 crawlers
Visits observed
329K
identified requests
Sites reached
70%
59 of 84
Pages per visit
60.5
5.4K sessions
Crawl velocity
61/min
peak pages per minute

What GPTBot is for

Collects pages to train future models. No direct traffic back to you.

Blocking GPTBot keeps your content out of OpenAI training data, but does not affect whether ChatGPT can cite you in answers - that is OAI-SearchBot.

Crawler guide

Evergreen identity, source, and robots.txt handling guidance.

Read the full bot guide

How GPTBot reads a site

Its real crawl signature, from the pages Trakkr observes it fetch.

Click-depth of pages fetched
67% at depth 3+
D0 3%D1 10%D2 20%D3 52%D4 12%D5+ 4%
First hit is the homepage2.8%
Visits per site5.6K
Weekend vs weekday1.29×29%

When GPTBot crawls

Activity over a day in UTC - it peaks around 14:00 and is quietest near 02:00.

0006121823

Allow or block GPTBot

Honours robots.txt. Add one of these to your robots.txt.

Block it
User-agent: GPTBot
Disallow: /
Allow it explicitly
User-agent: GPTBot
Allow: /
User-agent
Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; GPTBot/1.2; +https://openai.com/gptbot
OpenAI's official bot docs

Other AI crawlers

Every bot Trakkr tracks is a doorway.

Common questions

What is GPTBot?

GPTBot is an AI web crawler operated by OpenAI. OpenAI's primary web crawler. It collects publicly available pages that may be used to train future versions of GPT and ChatGPT.

How do I block GPTBot?

Add a directive to your robots.txt: "User-agent: GPTBot" followed by "Disallow: /". Honours robots.txt.

Does blocking GPTBot hide me from AI?

Blocking GPTBot keeps your content out of training data but does not remove you from AI search answers - those are served by the search crawlers. Blocking GPTBot keeps your content out of OpenAI training data, but does not affect whether ChatGPT can cite you in answers - that is OAI-SearchBot.

Methodology

Identity and robots.txt facts come from OpenAI's published bot documentation. Behaviour is measured from identified GPTBot requests in the server logs of the brands Trakkr tracks - 576K crawler visits across 84 sites, recounted as new data arrives.

Trakkr DataAll crawlersCitations·Data as of 132d agoCC BY 4.0

Telemetry updated Feb 1, 2026.