AI Site Grade

mckessoncompile.com — AI Site Grade

McKesson Compile's content-rich data site is invisible to AI knowledge systems due to zero schema markup, no AI bot references in robots.txt, and no external citations.

McKesson Compile operates a legitimate, content-rich data provider site that remains invisible to AI knowledge systems because of zero schema markup, no AI bot references in robots.txt, and no external citations.

Findings: 8
Evidence checks: 26
Completed: 30 May 2026

Analysis

I have enough data to write a thorough audit. Let me compile the findings.

McKesson Compile: A data-rich site invisible to AI knowledge

The cold LLM has zero prior knowledge of mckessoncompile.com — it cannot confirm the domain's legitimacy, purpose, or relationship to McKesson Corporation — despite the site being a fully operational, content-rich data provider owned by one of the largest healthcare companies in the U.S.

Crawler Access

All major AI crawlers — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, Bytespider, Applebot-Extended, ChatGPT-User, anthropic-ai — receive a full 200 response with identical byte payload (97,537 bytes) as a browser baseline. No UA-based blocking, no Cloudflare challenge, no JS shell. The robots.txt is permissive (only HubSpot CMS preview paths disallowed) but mentions zero AI bots by name. The /llms.txt returns a 404 (HubSpot-branded error page). The site runs on HubSpot CMS behind Cloudflare with HSTS and CSP headers present.

Cold-Knowledge Gap

The LLM queried cold on mckessoncompile.com returned: *"I do not have any specific, verifiable information... cannot confirm this site is an official or legitimate McKesson property."* This is a complete knowledge vacuum. The site itself is a legitimate McKesson subsidiary (acquired January 2024) offering Provider 360 ReadyData (6.8M+ HCPs, 300M+ longitudinal patients, 98% payer coverage for Medicare claims), real-world data analytics, and oncology-specific datasets. The gap between the LLM's blank slate and the site's substantive positioning as "analytics-ready data crafted for life sciences" is total.

Schema Posture

Every page inspected — homepage, about-us, provider-360, oncology, case-studies, newsroom, blog posts — contains zero JSON-LD schema of any type. No Organization, WebSite, Product, FAQPage, Article, or BreadcrumbList markup exists. The site has rich structured content (statistics, comparison language, use-case descriptions, customer testimonials) that would map naturally to schema types, but none is deployed. The blog listing page (/blog) returns zero visible text — a JS-rendered shell that AI crawlers see as empty.

External Signals

External search returns zero indexed results for "McKesson Compile" across multiple queries — no press coverage, no reviews, no Reddit threads, no industry analyst mentions surfaced via DuckDuckGo. The newsroom page does link to external press (Forbes, CDO Magazine, Endpoints News, McKesson corporate blog), but these are not discoverable through general web search for the brand name. The site's LinkedIn presence (linkedin.com/company/compile-inc/) and customer login (accounts.compile.com) point to the pre-acquisition "Compile" brand identity, creating a fragmented citation trail.

Findings

Zero JSON-LD schema markup on any page High
Every inspected page lacks any JSON-LD schema (Organization, WebSite, Product, FAQPage, Article, BreadcrumbList), despite rich structured content that would naturally map to these types.
What to change: Add JSON-LD schema for Organization, WebSite, Product, and Article types on all pages, using the McKesson Compile brand and Provider 360 product details.
Complete cold-knowledge vacuum for the domain High
The LLM has no prior knowledge of mckessoncompile.com, cannot confirm its legitimacy or relationship to McKesson, despite the site being a fully operational data provider owned by McKesson.
What to change: Implement schema markup, publish an llms.txt file, and build external citations to establish the domain's identity in AI knowledge bases.
No /llms.txt file for AI guidance Medium
The /llms.txt endpoint returns a 404 error, missing an opportunity to provide AI crawlers with a structured summary of the site's content and resources.
What to change: Create an llms.txt file that describes the site's purpose, key pages, and data offerings for AI crawlers.
Robots.txt does not reference any AI bots Medium
The robots.txt file is permissive but mentions zero AI crawlers by name, missing the chance to explicitly welcome or guide them.
What to change: Add explicit directives for GPTBot, ClaudeBot, PerplexityBot, and other AI crawlers to ensure optimal crawling.
Blog listing page renders as empty JS shell High
The /blog page returns zero visible text content, appearing as a JavaScript-rendered shell that AI crawlers see as empty.
What to change: Implement server-side rendering or static HTML for the blog listing to ensure content is visible to crawlers.
Zero external search results for the brand High
Multiple web searches for 'McKesson Compile' and related terms return no indexed results, indicating no press coverage, reviews, or industry mentions are discoverable.
What to change: Build external citations through press releases, industry publications, and backlinks from McKesson's main domain.
Fragmented brand identity across domains Medium
The site uses mckessoncompile.com, but customer login and LinkedIn presence point to the pre-acquisition 'Compile' brand (compile.com, linkedin.com/company/compile-inc/), creating a fragmented citation trail.
What to change: Consolidate brand assets under the mckessoncompile.com domain and update external profiles to reflect the McKesson Compile name.
No breadcrumb or navigation schema Low
The site lacks BreadcrumbList schema, which helps AI crawlers understand site structure and page relationships.
What to change: Add BreadcrumbList schema to all pages to improve navigation understanding.

What's working

All major AI crawlers receive full content access — All 11 tested AI crawlers receive a 200 response with identical content as a browser, with no UA-based blocking or Cloudflare challenges.
Rich, substantive content on key pages — Pages like Provider 360 and Oncology contain detailed, data-rich content (6.8M HCPs, 300M+ patients, 98% payer coverage) that is valuable for AI training and retrieval.
Newsroom page links to external press coverage — The newsroom page includes links to Forbes, CDO Magazine, Endpoints News, and McKesson corporate blog, providing citation sources.
Permissive robots.txt allows full crawling — The robots.txt file only disallows HubSpot CMS preview paths, allowing all crawlers to access the entire site.
Sitemap available with 44 URLs — A sitemap is present and lists 44 URLs, helping crawlers discover site content.

Track mckessoncompile.com across AI search

This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.

Open this AI Site Grade Grade another site Track your brand

Analysis

McKesson Compile: A data-rich site invisible to AI knowledge

Crawler Access

Cold-Knowledge Gap

Schema Posture

External Signals

Findings

Zero JSON-LD schema markup on any page High

Complete cold-knowledge vacuum for the domain High

No /llms.txt file for AI guidance Medium

Robots.txt does not reference any AI bots Medium

Blog listing page renders as empty JS shell High

Zero external search results for the brand High

Fragmented brand identity across domains Medium

No breadcrumb or navigation schema Low

What's working

Track mckessoncompile.com across AI search