Study 005

The llms.txt Effect

We HTTP-scanned 37,894 AI-cited domains from a corpus of 102,857. 5,035 have llms.txt. The citation advantage? Statistically zero.

12.6%
adoption rate
p=0.80
not significant
36K
domains scanned
320K+
citations analyzed
Last updated · Mar 14, 2026
[01]

The Landscape

13.3%of AI-cited domains have llms.txt
Adoption by Citation Tier
Top 506.0%
Top 1007.0%
Top 25013.6%
Top 50014.4%
Top 100015.3%
Top 250015.9%
Top 500016.1%
Top 1000015.7%
Top 2500013.7%
Full (37,894)13.3%

Based on 37,894 most-cited domains in AI responses

The Inverse Pattern

The most-cited domains in AI don't have llms.txt. Among the top 50 most-cited sites, only 6% have adopted the standard. As you move down the citation rankings, adoption actually increases - suggesting that llms.txt is being adopted by sites hoping to improve their visibility, not by sites that already have it.

The Key Question
If the most-cited domains don't use llms.txt, what actually drives AI citations? Our data suggests it's domain authority, content depth, and training data exposure - not technical signals like llms.txt.
[02]

The Verdict

Average Citations per Domain
With llms.txt
6.8
avg citations
Without llms.txt
6.7
avg citations
Mann-Whitney U:p=0.85Not Significant
Median (with)
3.0
Median (without)
3.0

Medians are identical - both exactly 3.0

Nearly Identical Averages

Across 37,894 domains, sites with llms.txt average 6.8 citations while sites without average 6.7 - a difference so small it's indistinguishable from noise. The Mann-Whitney U test gives p=0.85, about as far from statistical significance as you can get.

The medians confirm the story: both groups land at exactly 3.0 citations. At the full 38K scale the test becomes technically significant (p<0.001) due to sheer sample size, but the effect size is r=-0.065 - well below the 0.1 threshold for even a "small" effect. Statistical significance without practical significance.

The Bottom Line
Having llms.txt provides zero measurable advantage in AI citation frequency. The data is unambiguous: whatever drives AI citations, it isn't llms.txt.
[03]

Who's Adopting

24.1%adoption in saas / developer tools
Adoption Rate by Domain Category
SaaS / Developer Tools
97/40324.1%
E-commerce
10/5518.2%
News / Media
52/33215.7%
Social Platforms
84/53615.7%
Government / Academic
9/5811.5%
Reference / Wiki
0/360.0%
Review Sites
0/390.0%

The Tech Echo Chamber

llms.txt adoption is led by SaaS and developer tools at 24% - the exact community that proposed the standard. Government and academic sites sit at just 1.5%, while review sites and reference wikis are at 0%.

This creates a selection bias problem: the sites most likely to adopt llms.txt are already technically sophisticated, well-structured, and API-friendly - qualities that independently correlate with AI visibility.

Why This Matters
The categories with the highest domain authority (reference, review, academic) have the lowest llms.txt adoption. The sites that dominate AI citations don't need llms.txt - they're cited because of brand authority and content quality.
[04]

The Leaderboard

With llms.txt

Most-cited domains that have adopted llms.txt

Domain
CitationsBrands
prnewswire.com
1,070292 brands
github.com
44983 brands
chainalysis.com
29110 brands
accio.com
236110 brands
shopify.com
20251 brands
essfeed.com
20077 brands
sodimac.cl
1604 brands
slashdot.org
14366 brands
marketsandmarkets.com
13766 brands
trmlabs.com
1347 brands
Without llms.txt

Most-cited domains that have not adopted llms.txt

Domain
CitationsBrands
reddit.com
2,769460 brands
techradar.com
2,499377 brands
reuters.com
1,915309 brands
linkedin.com
1,579418 brands
forbes.com
1,479355 brands
youtube.com
1,344222 brands
wired.com
1,244362 brands
axios.com
1,015294 brands
ft.com
945283 brands
theverge.com
943259 brands
Authority Wins
The non-adopter column reads like a who's who of the internet. Reddit, Reuters, Forbes, LinkedIn - these sites dominate AI citations without any llms.txt optimization.
[05]

Brand Visibility

Trakkr Visibility Score Comparison
Median Visibility
With llms.txt
23.1
Without
23.6

Composite AI visibility score (0-100)

Average Visibility
With llms.txt
27.8
Without
26.3

Mean across all brands analyzed

Based on 205 brands with both audit data and visibility reports

Same Scores, Different File

Using Trakkr's multi-dimensional visibility scoring - which combines presence, rank, mentions, and sentiment across multiple AI models - brands with llms.txt score 23.15 median visibility versus 23.55 without.

This 0.4-point difference is well within noise. Whether you look at raw citation counts or composite visibility metrics, the result is the same: llms.txt is not currently a factor in AI recommendation engines.

Why This Matters
This analysis cross-references 205 brands that have both website audit data (where we detect llms.txt) and active visibility monitoring. It's the most comprehensive llms.txt-to-visibility comparison available.
[06]

What This Means

What This Actually Means

The data tells a clear story. Here are the four takeaways that matter for anyone building an AI visibility strategy.

01

llms.txt is a signal, not a lever

Having llms.txt tells AI systems "we care about being understood by LLMs." But current AI models don't actually read or prioritize llms.txt when generating citations. The standard is early - adoption is a bet on the future, not a present-day advantage.

02

AI citations are driven by training data

AI models cite sources they encountered during training: authoritative domains, frequently linked content, structured data, and topically relevant pages. A text file at /llms.txt doesn't retroactively change what the model learned.

03

Don't skip it - just don't expect miracles

llms.txt is low-cost to implement and good practice for structured content. As AI models evolve and potentially start using it during retrieval-augmented generation, early adopters may benefit. The cost of adoption is near zero; the potential future upside is real.

04

Focus on what actually works

The domains that dominate AI citations share common traits: deep, authoritative content, strong backlink profiles, structured data, consistent publishing, and topical expertise. These fundamentals drive AI visibility today - not technical signals.

Our Recommendation
Adopt llms.txt if you haven't - it's cheap insurance and good practice. But don't expect it to move the needle on AI citations today. Instead, invest in the fundamentals: content depth, topical authority, structured data, and consistent quality. Those are the levers that actually work.
[07]

Methodology

Data Pipeline
01
Citation Corpus

Aggregated citation data from 882 brand snapshots containing 337K+ citations across 102K+ unique domains

02
Domain Ranking

Ranked all domains by total AI citation appearances and selected the 37,894 with 2+ appearances for analysis

03
llms.txt Detection

Async HTTP checks against /llms.txt with content validation to reject HTML error pages and soft 404s

04
Statistical Testing

Mann-Whitney U test comparing citation distributions between adopters and non-adopters (non-parametric, suitable for skewed data)

05
Cross-Validation

Cross-referenced with Trakkr visibility reports (205 brands) and website audit data to check for confounding factors

Key Numbers
Domains Scanned
37,894
Brand Snapshots
882
Total Citations
337K+
Statistical Test
p=0.85
Methodology Notes

Non-parametric test: We used Mann-Whitney U rather than a t-test because citation distributions are heavily right-skewed.

Content validation: HTTP 200 responses were validated to exclude HTML error pages, soft 404s, and login redirects that return 200 status.

Confound check: We verified that llms.txt adopters don't systematically differ in website audit scores, controlling for site quality.

File quality: Among adopters, 89% include a title, 98% contain URLs, and 79% score 4/4 on our content quality rubric. These are well-implemented files - they just don't move citations.

Data source: Production citation data from the Trakkr platform, representing real-world AI visibility monitoring across 882 brands.