AI Site Grade
medtrainer.com — AI Site Grade
MedTrainer's llms.txt exists but its robots.txt has no AI-bot rules — a rare inversion where the AI-friendly file is present while the gatekeeping file is wide open, and Bytespider is the only crawler actively blocked.
MedTrainer has strong infrastructure (llms.txt, sitemap, Cloudflare) but lacks AI-bot directives in robots.txt, misses schema for pricing and claims, and has a cold-knowledge gap with outdated LLM prior.
- Findings
- 10
- Evidence checks
- 17
- Completed
- 30 May 2026
Analysis
MedTrainer's llms.txt exists but its robots.txt has no AI-bot rules — a rare inversion where the AI-friendly file is present while the gatekeeping file is wide open, and Bytespider is the only crawler actively blocked (by Cloudflare, not robots.txt).
Crawler Access
All major AI crawlers — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, ChatGPT-User, anthropic-ai, Applebot-Extended — receive a full 200 response with identical byte payload (157KB) to a browser baseline. Bytespider is the sole blocked crawler (403 from Cloudflare). The robots.txt contains only a Yoast-generated User-agent: * Disallow: rule with no AI-bot directives whatsoever. The site runs on WP Engine behind Cloudflare with HSTS and a broad CSP. No JS-rendering risk: the homepage delivers 1,529 words of visible text on a plain GET.
llms.txt and Sitemap Posture
An /llms.txt exists and returns 200 — generated by Yoast SEO v27.5 — listing pages, posts, events, and webinars. This is a strong signal for LLM discoverability. However, the llms.txt is thin: it lists only 5 pages, 5 posts, 5 events, and 5 webinars out of a sitemap with 766+ URLs across 4 sub-sitemaps. The sitemap index is properly declared in robots.txt and returns 200. The blog alone contains hundreds of articles spanning state-specific compliance guides, credentialing deep-dives, and press releases dating back to 2018.
Cold-Knowledge Gap
The LLM prior describes MedTrainer as "founded in 2012, based in California" with "integration with EHRs like Epic and Cerner." The actual site says founded in 2013 (not 2012), headquartered in Las Vegas (not California), and lists no Epic or Cerner integrations — the integrations page covers ADP, Dayforce, Paycor, Paylocity, UKG, Workday, CAQH, NPDB, Propelus, and McKesson, but no EHRs. The prior also misses that MedTrainer is backed by Telescope Partners and Vista Equity Partners, a material fact present in the 2023 brand announcement.
Schema and Content Signals
Every page uses WebPage, BreadcrumbList, WebSite, and Organization schema via Yoast. No Product, FAQPage, HowTo, or SoftwareApplication schema is present anywhere — surprising for a SaaS platform that prominently displays pricing tiers, feature comparisons, and a FAQ section on the /why-medtrainer/ page. The homepage claims "40+ hours saved each week," "3+ weeks faster credentialing," and "99%+ surveys passed" with no structured data to surface these claims to AI engines. The /pricing/ page describes three tiers (Essential, Premier, Signature) but uses no Product or Offer schema.
External Signals
The site prominently claims G2 Leader status in both Healthcare Compliance and Healthcare LMS categories (Winter 2025). A press release blog post cites "over 3,000 healthcare organizations in over 15,000 locations" and "nearly 100 feature enhancements in 2024." The DNS TXT records include an openai-domain-verification token, confirming MedTrainer has proactively verified its domain with OpenAI for potential GPT data access. No negative press, Reddit threads, or external controversy surfaced in searches. The sameAs schema links only to Facebook and LinkedIn — no Twitter/X, YouTube, or Glassdoor presence declared.
Findings
Robots.txt lacks AI-bot directives Medium
The robots.txt contains only a Yoast-generated catch-all rule with no AI-bot directives, leaving all AI crawlers unrestricted but also unmanaged.
What to change: Add explicit rules for GPTBot, ClaudeBot, PerplexityBot, and other AI crawlers to control access granularly.
Bytespider blocked by Cloudflare, not robots.txt Medium
Bytespider receives a 403 from Cloudflare while other AI crawlers get full access, creating inconsistent crawler treatment.
What to change: Either allow Bytespider or document the block in robots.txt for transparency.
llms.txt lists only 20 URLs out of 766+ Medium
The llms.txt includes only 5 pages, 5 posts, 5 events, and 5 webinars, missing hundreds of valuable blog posts and resources.
What to change: Expand llms.txt to include all high-value pages, especially blog posts and resources.
LLM prior contains factual errors High
The LLM prior states MedTrainer was founded in 2012 in California, but the site says 2013 and Las Vegas. It also claims Epic and Cerner integrations, which are absent from the integrations page.
What to change: Publish a structured data file (e.g., JSON-LD) with accurate company facts and update the site to correct any outdated information.
No Product or Offer schema on pricing page High
The pricing page describes three tiers but uses no Product or Offer structured data, limiting AI understanding of pricing.
What to change: Add Product and Offer schema to each pricing tier with name, description, and price.
No SoftwareApplication schema on homepage High
The homepage lacks SoftwareApplication schema despite being a SaaS platform, reducing visibility in AI-generated app recommendations.
What to change: Add SoftwareApplication schema with applicationCategory, operatingSystem, and offers.
FAQ section lacks FAQPage schema Medium
The /why-medtrainer/ page has a FAQ section but no FAQPage structured data, missing an opportunity for rich results.
What to change: Add FAQPage schema to the FAQ section with Question and Answer properties.
Claims on homepage lack structured data Medium
Homepage claims like '40+ hours saved each week' have no schema markup, making them invisible to AI engines.
What to change: Use schema.org's Claim or QuantitativeValue to mark up key statistics.
SameAs schema only links to Facebook and LinkedIn Low
The Organization schema's sameAs property includes only Facebook and LinkedIn, missing Twitter/X, YouTube, and Glassdoor.
What to change: Add additional sameAs URLs for Twitter/X, YouTube, and Glassdoor if profiles exist.
No external news or review coverage found Medium
Web searches for MedTrainer news and reviews returned zero results, indicating low external signal strength.
What to change: Encourage press coverage and customer reviews on third-party sites to build external signals.
What's working
- llms.txt file is present and accessible — An llms.txt file returns 200 and lists key pages, providing a strong signal for LLM discoverability.
- Sitemap index is properly declared and accessible — The sitemap index is declared in robots.txt and returns 200, covering 766+ URLs across 4 sub-sitemaps.
- OpenAI domain verification token present in DNS — DNS TXT records include an openai-domain-verification token, indicating proactive verification for potential GPT data access.
- G2 Leader status prominently featured — The site claims G2 Leader in Healthcare Compliance and Healthcare LMS, backed by a press release blog post.
- Every page includes WebPage, BreadcrumbList, WebSite, and Organization schema — Yoast provides consistent baseline schema across all pages, aiding AI understanding of page structure.
- Homepage delivers full text content without JavaScript — The homepage returns 1,529 words of visible text on a plain GET, ensuring AI crawlers can index content without rendering.
- Cloudflare provides security and performance — The site uses Cloudflare for CDN and security, with HSTS and CSP headers, ensuring reliable delivery.
- Blog contains hundreds of state-specific compliance articles — The blog has extensive content dating back to 2018, covering state-specific guides and deep dives, which are valuable for AI training.
Track medtrainer.com across AI search
This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.