Does llms.txt help with AI citations?

No. A scan of 37,894 AI-cited domains found no statistically significant citation advantage for domains with llms.txt. The Mann-Whitney U test returned p=0.85, meaning there is no measurable difference in citation rates between domains with and without the file.

How many websites have llms.txt?

Of the 37,894 AI-cited domains scanned, 13.3% (approximately 5,035 domains) have adopted llms.txt. Adoption is heavily concentrated in the technology sector, with much lower rates in other industries.

Is llms.txt worth implementing?

Based on current data, llms.txt provides zero measurable citation advantage. The cost of implementation is low, but it should not be prioritized over content quality and authority building for AI visibility.

Research

Study 005

The llms.txt Effect

We HTTP-scanned 37,894 AI-cited domains from a corpus of 102,857. 5,035 have llms.txt. The citation advantage? Statistically zero.

12.6%

adoption rate

p=0.80

not significant

36K

domains scanned

320K+

citations analyzed

Last updated · Mar 14, 2026

[01]

The Landscape

/ Who Has llms.txt

13.3%of AI-cited domains have llms.txt

Adoption by Citation Tier

Top 506.0%

Top 1007.0%

Top 25013.6%

Top 50014.4%

Top 100015.3%

Top 250015.9%

Top 500016.1%

Top 1000015.7%

Top 2500013.7%

Full (37,894)13.3%

Based on 37,894 most-cited domains in AI responses

The Inverse Pattern

The most-cited domains in AI don't have llms.txt. Among the top 50 most-cited sites, only 6% have adopted the standard. As you move down the citation rankings, adoption actually increases - suggesting that llms.txt is being adopted by sites hoping to improve their visibility, not by sites that already have it.

The Key Question

If the most-cited domains don't use llms.txt, what actually drives AI citations? Our data suggests it's domain authority, content depth, and training data exposure - not technical signals like llms.txt.

[02]

The Verdict

/ Does It Help?

Average Citations per Domain

With llms.txt

6.8

avg citations

≈

Without llms.txt

6.7

avg citations

Mann-Whitney U:p=0.85Not Significant

Median (with)

3.0

Median (without)

3.0

Medians are identical - both exactly 3.0

Nearly Identical Averages

Across 37,894 domains, sites with llms.txt average 6.8 citations while sites without average 6.7 - a difference so small it's indistinguishable from noise. The Mann-Whitney U test gives p=0.85, about as far from statistical significance as you can get.

The medians confirm the story: both groups land at exactly 3.0 citations. At the full 38K scale the test becomes technically significant (p<0.001) due to sheer sample size, but the effect size is r=-0.065 - well below the 0.1 threshold for even a "small" effect. Statistical significance without practical significance.

The Bottom Line

Having llms.txt provides zero measurable advantage in AI citation frequency. The data is unambiguous: whatever drives AI citations, it isn't llms.txt.

[03]

Who's Adopting

/ The Tech Bubble

24.1%adoption in saas / developer tools

Adoption Rate by Domain Category

SaaS / Developer Tools

97/40324.1%

E-commerce

10/5518.2%

News / Media

52/33215.7%

Social Platforms

84/53615.7%

Government / Academic

9/5811.5%

Reference / Wiki

0/360.0%

Review Sites

0/390.0%

The Tech Echo Chamber

llms.txt adoption is led by SaaS and developer tools at 24% - the exact community that proposed the standard. Government and academic sites sit at just 1.5%, while review sites and reference wikis are at 0%.

This creates a selection bias problem: the sites most likely to adopt llms.txt are already technically sophisticated, well-structured, and API-friendly - qualities that independently correlate with AI visibility.

Why This Matters

The categories with the highest domain authority (reference, review, academic) have the lowest llms.txt adoption. The sites that dominate AI citations don't need llms.txt - they're cited because of brand authority and content quality.

[04]

The Leaderboard

/ Notable Domains

With llms.txt

Most-cited domains that have adopted llms.txt

Domain

CitationsBrands

prnewswire.com

1,070292 brands

github.com

44983 brands

chainalysis.com

29110 brands

accio.com

236110 brands

shopify.com

20251 brands

essfeed.com

20077 brands

sodimac.cl

1604 brands

slashdot.org

14366 brands

marketsandmarkets.com

13766 brands

trmlabs.com

1347 brands

Without llms.txt

Most-cited domains that have not adopted llms.txt

Domain

CitationsBrands

reddit.com

2,769460 brands

techradar.com

2,499377 brands

reuters.com

1,915309 brands

linkedin.com

1,579418 brands

forbes.com

1,479355 brands

youtube.com

1,344222 brands

wired.com

1,244362 brands

axios.com

1,015294 brands

ft.com

945283 brands

theverge.com

943259 brands

Authority Wins

The non-adopter column reads like a who's who of the internet. Reddit, Reuters, Forbes, LinkedIn - these sites dominate AI citations without any llms.txt optimization.

[05]

Brand Visibility

/ Trakkr Scores

Trakkr Visibility Score Comparison

Median Visibility

With llms.txt

23.1

Without

23.6

Composite AI visibility score (0-100)

Average Visibility

With llms.txt

27.8

Without

26.3

Mean across all brands analyzed

Based on 205 brands with both audit data and visibility reports

Same Scores, Different File

Using Trakkr's multi-dimensional visibility scoring - which combines presence, rank, mentions, and sentiment across multiple AI models - brands with llms.txt score 23.15 median visibility versus 23.55 without.

This 0.4-point difference is well within noise. Whether you look at raw citation counts or composite visibility metrics, the result is the same: llms.txt is not currently a factor in AI recommendation engines.

Why This Matters

This analysis cross-references 205 brands that have both website audit data (where we detect llms.txt) and active visibility monitoring. It's the most comprehensive llms.txt-to-visibility comparison available.

[06]

What This Means

/ Strategic Implications

What This Actually Means

The data tells a clear story. Here are the four takeaways that matter for anyone building an AI visibility strategy.

llms.txt is a signal, not a lever

Having llms.txt tells AI systems "we care about being understood by LLMs." But current AI models don't actually read or prioritize llms.txt when generating citations. The standard is early - adoption is a bet on the future, not a present-day advantage.

AI citations are driven by training data

AI models cite sources they encountered during training: authoritative domains, frequently linked content, structured data, and topically relevant pages. A text file at /llms.txt doesn't retroactively change what the model learned.

Don't skip it - just don't expect miracles

llms.txt is low-cost to implement and good practice for structured content. As AI models evolve and potentially start using it during retrieval-augmented generation, early adopters may benefit. The cost of adoption is near zero; the potential future upside is real.

Focus on what actually works

The domains that dominate AI citations share common traits: deep, authoritative content, strong backlink profiles, structured data, consistent publishing, and topical expertise. These fundamentals drive AI visibility today - not technical signals.

Our Recommendation

Adopt llms.txt if you haven't - it's cheap insurance and good practice. But don't expect it to move the needle on AI citations today. Instead, invest in the fundamentals: content depth, topical authority, structured data, and consistent quality. Those are the levers that actually work.

[07]

Methodology

/ How We Measured This

Data Pipeline

Citation Corpus

Aggregated citation data from 882 brand snapshots containing 337K+ citations across 102K+ unique domains

Domain Ranking

Ranked all domains by total AI citation appearances and selected the 37,894 with 2+ appearances for analysis

llms.txt Detection

Async HTTP checks against /llms.txt with content validation to reject HTML error pages and soft 404s

Statistical Testing

Mann-Whitney U test comparing citation distributions between adopters and non-adopters (non-parametric, suitable for skewed data)

Cross-Validation

Cross-referenced with Trakkr visibility reports (205 brands) and website audit data to check for confounding factors

Key Numbers

Domains Scanned

37,894

Brand Snapshots

882

Total Citations

337K+

Statistical Test

p=0.85

Methodology Notes

Non-parametric test: We used Mann-Whitney U rather than a t-test because citation distributions are heavily right-skewed.

Content validation: HTTP 200 responses were validated to exclude HTML error pages, soft 404s, and login redirects that return 200 status.

Confound check: We verified that llms.txt adopters don't systematically differ in website audit scores, controlling for site quality.

File quality: Among adopters, 89% include a title, 98% contain URLs, and 79% score 4/4 on our content quality rubric. These are well-implemented files - they just don't move citations.

Data source: Production citation data from the Trakkr platform, representing real-world AI visibility monitoring across 882 brands.