Study 006

AI Crawls Your Product Pages. It Cites Your Blog.

337K AI citations. 11.4M crawler visits. 882 brands. The data reveals a systematic mismatch between what AI reads and what AI recommends.

19.2%
blog citation share
0.46x
product page citation efficiency
320K+
citations analyzed
10.8M
crawler visits tracked
Last updated · Mar 14, 2026

To measure this mismatch, we use a single metric: citation efficiency. It captures how well each page type converts crawler attention into actual AI citations. A score of 1.0x means a page type is cited in proportion to how often it gets crawled. Above 1.0x, the content outperforms. Below 1.0x, AI reads the pages but doesn't recommend them.

[01]

The Mismatch

The Mismatch

Product pages receive 13% of all crawler visits but earn just 6.3% of citations. AI reads your catalog obsessively. When it comes time to recommend, it points somewhere else.

Blog content shows the inverse: 14.4% of crawls, 20.2% of citations. Review sites punch 5x above their weight. FAQ pages get read but never recommended. About pages - rarely crawled - get cited far more than expected.

The pages AI reads most are not the pages it recommends. This gap defines where content investment creates real visibility.

From the dataset
AI Citations
337K
unique URL-brand pairs
Crawler Visits
11.4M
tracked bot visits
Brands Analyzed
882
across industries
URLs Classified
58%
into 13 page types
[02]

The Efficiency Matrix

Citation efficiency for all 13 classifiable page types. Scores above 1.0x mean the content earns more citations than its crawl traffic would predict. Below 1.0x, AI reads the pages but rarely recommends them.

Cited more than crawled
Blog
14.4%
20.2%
1.4x
Service
1.4%
2.6%
1.9x
Review
0.3%
1.2%
4.8x
About
0.4%
1.7%
4.6x
Resource
0.4%
1.4%
3.8x
Balanced
Docs
0.7%
0.8%
1.1x
Home
10.7%
7.9%
0.7x
Integration
0.4%
0.4%
1.0x
Crawled more than cited
Product
13.0%
6.3%
0.5x
Category
1.9%
0.9%
0.5x
FAQ
1.9%
0.4%
0.2x
Legal
0.4%
0.2%
0.6x
Compare
0.4%
0.2%
0.6x

The five overperformers share a trait: they provide context, opinion, or authority rather than product information. AI engines use them as evidence for recommendations, not as the recommendations themselves.

[03]

The Big Three

Three page types tell the full story of citation efficiency. One converts crawler attention into recommendations at 1.4x. One wastes half its traffic. And one is nearly invisible.

The Workhorse
Blog / Editorial
1.40x
Citation 20.2%
Crawl 14.4%
Avg. queries 8.0

One in five AI citations lands on a blog post. No other page type comes close. Blog content is also the stickiest - once AI cites a post, it tends to cite it again across an average of 8 different queries.

The Laggard
Product Pages
0.49x
Citation 6.3%
Crawl 13.0%
Avg. queries 7.2

AI crawlers are obsessed with your catalog - 13% of all visits go to product pages. But only 6.3% of citations point there. Product pages are read twice as often as they're recommended.

The Ghost
Help / FAQ
0.21x
Citation 0.4%
Crawl 1.9%
Avg. queries 6.1

The lowest citation efficiency of any page type in the study. FAQ pages receive 1.9% of crawl traffic but earn just 0.4% of citations. AI reads your FAQ for context but never surfaces it as an answer.

[04]

Crawler Personalities

Each AI Has a Reading Style

GPTBot obsessively scans product catalogs - 84% of its visits land on product pages. Meta's crawler is even more extreme at 96%. OpenAI Search is lighter but still product-focused at 77%.

ClaudeBot is the outlier. It's the only crawler that reads broadly across page types - products, homepages, blogs, and categories all receive meaningful attention. "AI-optimized" isn't one strategy. It's four.

Optimizing for ChatGPT means product page structure. Optimizing for Claude means content diversity. There is no single "AI-optimized" strategy - only specific ones.

Page Type Distribution by Crawler
OpenAIE-commerce scanner
461K visits
Product Pages 84%Category 12%Other 3%
Meta AIPure product crawler
77K visits
Product Pages 96%Other 2%Homepage 2%
OpenAI SearchLighter product focus
21K visits
Product Pages 77%Other 17%Blog 4%
AnthropicMost diverse reader
2K visits
Other 62%Product Pages 16%Homepage 10%
[05]

Owned vs Third-Party

Your Own Content Gets Better Treatment

Only 1.8% of AI citations point to your own domain. The rest cite third-party sources about your category. But that small slice tells an important story.

When AI cites your own content, sentiment averages 76/100 - a +18 point lift versus third-party citations (58/100). Owned citations don't just increase visibility - they improve how AI portrays you.

Wikipedia alone captures 9.6% of all third-party citations. For most brands, the citation competition isn't competitors - it's the encyclopedia.

Your homepage is the stickiest citation. Once AI starts citing your homepage, it averages 35x repeat appearances. Invest in making your homepage AI-readable.

Sentiment Comparison
Your Content1.8% of citations
76
Third-Party Sources98.2% of citations
58
When AI cites your domain, it cites
Blog / Editorial
19.3%
Product Pages
7.9%
Homepage
6.6%
Service / Use Case
3.7%
About / Contact
2.8%
[06]

The Outliers

Beyond the big three, the matrix reveals outliers - page types with surprisingly high or low citation efficiency that most content strategies overlook.

1.9x
Service / Use Case Pages

The quiet overperformer. Solutions pages that frame your product around what it solves - not what it does - are cited nearly 2x more than they're crawled.

4.8x
Review / Directory Sites

The highest citation efficiency in the entire study. G2, Capterra, and Trustpilot are almost never crawled but frequently cited as authority signals by AI.

4.6x
About / Contact Pages

AI cites "who you are" far more than expected. Your about page isn't just for humans - it's a key AI trust signal.

3.8x
Resource / Report Pages

Whitepapers, templates, and research reports earn citations far beyond their crawl share. Authority content punches well above its weight.

0.6x
Comparison Pages

Surprisingly low despite high SEO value. AI engines may avoid brand-created comparisons, preferring third-party sources for impartiality.

0.6x
Legal / Terms Pages

Dead weight for citations. AI reads them but won't recommend them. Pure compliance, zero visibility value.

[07]

The Playbook

The citation efficiency data points to six investment priorities, each grounded in specific findings from above.

01

Invest in blog content

1.4x

20.2% of all AI citations point to blog content, with the highest repeat rate (8.0x) of any page type. Authoritative, data-rich editorial earns both first citations and repeat appearances across queries.

02

Enrich product pages

0.5x

Product pages receive 13% of crawls but earn only 6.3% of citations. Add expert context, use-case framing, and structured data. AI wants to recommend products - give it reasons to.

03

Build use-case pages

1.9x

Service and use-case pages are the quiet overperformer. Pages that explain what you solve - not what you sell - are cited nearly 2x more than they're crawled.

04

Don't count on FAQ

0.2x

The lowest citation efficiency of any page type. FAQ pages receive 1.9% of crawl visits but just 0.4% of citations. Use FAQ for user experience and support cost reduction, not AI visibility.

05

Get on review sites

4.8x

G2, Capterra, and Trustpilot earn AI citations far beyond their crawl share. Third-party credibility signals matter more to AI than first-party claims.

06

Know your AI audience

GPTBot reads 84% product pages. ClaudeBot reads broadly across types. Meta only sees products. Your optimization strategy depends on which AI engines your audience actually uses.

[08]

Methodology

Data Sources

Citation Data

337,362 unique URL-brand citation pairs extracted from AI-generated responses across 882 brands. Collected via Trakkr's citation snapshot pipeline from multiple AI providers.

Classification

Domain-aware URL classifier (v3) using path pattern matching combined with domain override sets. 58% of URLs classified into 13 page types; 42% remain "other." Citation efficiency scores are brand-normalized so each brand contributes equally regardless of volume.

Limitations

42% "Other" Classification

Many corporate pages have generic URL structures that resist URL-based classification. True distribution of classified types may be higher if we could resolve "other."

Snapshot-Based Analysis

Data reflects a point-in-time snapshot. AI citation patterns shift as models update - results indicate current trends, not permanent rankings.

Which of your page types are overperforming?

Track your citation efficiency by page type, see which crawlers visit what, and find the content gaps where AI reads but doesn't recommend.

Start monitoring free

Free trial included