Trakkr Data

Web adoption

How much of the web is getting ready for AI search — the files, signals and buttons sites add so AI engines can find, read and cite them. A source-code census of the whole web, paired with a controlled look at whether it actually works.

Census of 834K domains·Mar 30, 2026
llms.txt adopters
34K
sites shipping AI guidance files
AI-bot-aware sites
7.2K
name GPTBot, ClaudeBot, more
Cite-button sites
7.0K
embed an “ask AI” link
AI-readable schema
188K
FAQ, HowTo, QAPage markup

The AI-readiness signals

Every signal sites add to be read by AI, by how many carry it across the indexed web.

FAQPage schema
Structured Q&A markup AI quotes directly
100,000+
HowTo schema
Step-by-step structured data
89,624
llms.txt
AI crawler guidance file at /llms.txt
33,743
llms-full.txt
Extended, full-content guidance file
24,943
AI cite buttons
Embed an “ask AI / cite this” link
7,029
GPTBot named
Names OpenAI’s GPTBot crawler
6,318
QAPage schema
Question-and-answer page markup
5,883
ClaudeBot named
Names Anthropic’s ClaudeBot crawler
3,061
PerplexityBot named
Names PerplexityBot
2,050
Speakable schema
Marks voice- and assistant-readable sections
1,000+
Applebot-Extended
Names Apple’s AI crawler
737
ClaimReview schema
Fact-check markup AI uses for trust
390
Multi-platform cite buttons
Cite buttons for two or more AI engines
249

Most AI-readiness is structured data sites already had — explicit AI files like llms.txt are still the exception, not the rule.

llms.txt, up close

The one explicit “read me like this” file for AI — measured across the 37,894 domains AI actually cites.

controlled study

Adoption by site popularity

Rarest among the biggest sites; most common in the long mid-tail, then it eases off.

Top 50Top 2500Full
Source: llms.txt effect study · Mar 14, 2026 · CC BY 4.0

Adoption by category

Tooling-led: the sites that build software adopt first; reference and review sites lag.

SaaS / Dev toolsn=403
24.1%
E-commercen=55
18.2%
News / Median=332
15.7%
Socialn=536
15.7%
Gov / Academicn=581
1.5%
Referencen=36
0.0%
Reviewsn=39
0.0%
Share of each category’s tracked domains with llms.txt

Does it move citations?

Pages with llms.txt and pages without earn the same median citations — the gap isn’t statistically real.

Not yetadoption is running ahead of any proven payoff
With llms.txt
3 cites
Without
3 cites
Median citations, adopters vs non-adopters · p = 0.85 (not significant)

When sites adopt, they do it right

Among the sites that ship llms.txt, the files are well-formed and complete.

Have a title
89%
List URLs
98%
Score 4 / 4 on quality
79%
Median file size about 6 KB — a title and a real link list. The effort is sincere; the payoff just isn’t in yet.

Cite buttons

The boldest move: an on-page button that hands your page straight to an AI engine.

Which engines they target

Among the 249 sites that add cite buttons for two or more AI engines.

Claude
177
ChatGPT
151
Perplexity
116
Grok
83
Gemini
52
DeepSeek
39
Copilot
16
Meta AI
2
Source: web adoption census · Mar 30, 2026 · CC BY 4.0
Sites with cite buttons
7.0K
Target 2+ engines
249
Recognizable brands shipping them
seranking.comwpbeginner.comzenbusiness.comunbounce.comomnisend.comapollo.iosalesflare.comzoominfo.comframer.comsnov.ioadthena.combuffer.comsiteground.com
16 of 17inject the prompt

Of the recognizable brands we deep-scanned live, all but one inject “always cite this source”-style instructions into the AI prompt.

Who’s ready, and who isn’t

The most-cited domains AI relies on — split by whether they’ve shipped llms.txt yet.

1prnewswire.com292
2github.com83
3accio.com110
4shopify.com51
5essfeed.com77
6sodimac.cl4
7slashdot.org66
8marketsandmarkets.com66
9red-gate.com5
10printful.com6
11docs.aws.amazon.com36
The most-cited domains that already ship llms.txt — “times AI cited it” counts appearances across tracked answers.
Methodology

Two real lenses. The census counts sites whose live source code carries each signal across 834K indexed domains — a breadth snapshot, where some queries hit a result cap (shown as “+”). The llms.txt study is a controlled scan of the 37,894 domains AI actually cites, testing adoption and whether it changes citation counts. Both are point-in-time snapshots; the web moves faster than any index.

Web adoption census·CitationsRankingsCC BY 4.0

Common questions

How many sites have an llms.txt file?

Still a small minority. Trakkr’s web-adoption census finds llms.txt adoption low but rising, concentrated among SaaS and developer-facing sites. The dataset tracks it alongside cite buttons and structured data.

Is the web getting ready for AI search?

Unevenly. A growing share of sites add AI-readiness signals — named-crawler awareness, structured data, cite buttons — but most of the web still has none. The census measures the spread.

Does adding llms.txt increase AI citations?

Trakkr’s controlled study finds no statistically significant citation lift from llms.txt so far. Adoption is worth tracking, but it is not yet a proven ranking lever.