GPTBot
OpenAI's primary web crawler. It collects publicly available pages that may be used to train future versions of GPT and ChatGPT.
What GPTBot is for
Collects pages to train future models. No direct traffic back to you.
Blocking GPTBot keeps your content out of OpenAI training data, but does not affect whether ChatGPT can cite you in answers - that is OAI-SearchBot.
Crawler guide
Evergreen identity, source, and robots.txt handling guidance.
How GPTBot reads a site
Its real crawl signature, from the pages Trakkr observes it fetch.
When GPTBot crawls
Activity over a day in UTC - it peaks around 14:00 and is quietest near 02:00.
Allow or block GPTBot
Honours robots.txt. Add one of these to your robots.txt.
User-agent: GPTBot Disallow: /
User-agent: GPTBot Allow: /
Other AI crawlers
Every bot Trakkr tracks is a doorway.
Common questions
What is GPTBot?
GPTBot is an AI web crawler operated by OpenAI. OpenAI's primary web crawler. It collects publicly available pages that may be used to train future versions of GPT and ChatGPT.
How do I block GPTBot?
Add a directive to your robots.txt: "User-agent: GPTBot" followed by "Disallow: /". Honours robots.txt.
Does blocking GPTBot hide me from AI?
Blocking GPTBot keeps your content out of training data but does not remove you from AI search answers - those are served by the search crawlers. Blocking GPTBot keeps your content out of OpenAI training data, but does not affect whether ChatGPT can cite you in answers - that is OAI-SearchBot.
Identity and robots.txt facts come from OpenAI's published bot documentation. Behaviour is measured from identified GPTBot requests in the server logs of the brands Trakkr tracks - 576K crawler visits across 84 sites, recounted as new data arrives.
Telemetry updated Feb 1, 2026.