AI Site Grade
transitionshealthcarellc.com — AI Site Grade
Transitions Healthcare's selective Cloudflare blockade silently blocks ClaudeBot and GPTBot while allowing other AI crawlers, creating a partial AI visibility gap for the two most widely used training crawlers.
Transitions Healthcare's selective Cloudflare blockade silently blocks ClaudeBot and GPTBot while allowing other AI crawlers, and the site lacks LocalBusiness schema on facility pages and has zero external search footprint, limiting AI visibility.
- Findings
- 10
- Evidence checks
- 23
- Completed
- 30 May 2026
Analysis
ClaudeBot and GPTBot are blocked by Cloudflare while most other AI crawlers pass through — a selective AI blockade that leaves the site partially invisible to the two most widely used training crawlers.
Crawler Access
The site runs on Cloudflare (A records point to 104.26.8.168, 172.67.74.116) behind a WordPress installation using the All in One SEO plugin. compare_bot_access on the homepage reveals a stark split: ClaudeBot and GPTBot both receive HTTP 429 (rate-limited/blocked) with only 162 bytes returned, while Google-Extended, OAI-SearchBot, ChatGPT-User, PerplexityBot, Perplexity-User, anthropic-ai, and Applebot-Extended all get a full 200 response with identical 53,838-byte payloads as the browser baseline. Bytespider gets HTTP 520 (Cloudflare generic error). The robots.txt contains no AI-bot-specific directives — only a blanket Disallow: /wp-admin/ and a Crawl-delay: 10. The selective blocking is therefore happening at the Cloudflare WAF layer, not in robots.txt, meaning the site's operators may not even be aware that ClaudeBot and GPTBot are being silently turned away.
Content & Schema Posture
The homepage and all subpages carry consistent Organization JSON-LD schema with name, URL, description, and sameAs links to Facebook, X (Twitter), and LinkedIn. However, no individual facility page uses LocalBusiness or MedicalBusiness schema — the seven communities (Allens Cove, Autumn Grove, Gettysburg, North Huntingdon, Oakland Manor, Shook Home, Washington PA) are listed as plain text with addresses and phone numbers only. No FAQPage, Product, Service, or Review schema exists anywhere on the site. The blog is active (most recent post dated May 27, 2026) and the sitemap is dynamically generated with 438KB of URLs including image entries, but the llms.txt file (159KB, auto-generated by All in One SEO) is present and well-populated with full blog post summaries — a strong signal for LLM indexing that partially compensates for the bot blockade.
Cold-Knowledge Gap
The LLM's prior knowledge describes Transitions Healthcare as operating "about 10-15 facilities" with a focus on skilled nursing, post-acute care, and memory support. The site itself lists exactly 7 communities (6 in Pennsylvania, 1 in Maryland). The model overestimates the facility count by roughly 2x. The model also mentions "Transitions to Home" as a program name — the site calls it "Transitions Rehab to Home" (a subtle but consistent branding mismatch). The model knows nothing about the American Heart Association Heart Failure Certification earned by North Huntingdon and Gettysburg, the on-site dialysis partnership with Dialyze Direct, or the WellSpan Health collaboration at Gettysburg — all of which are prominent on the facility pages.
External Signals
Web searches returned zero indexed results for the brand name combined with location or review keywords — no Reddit threads, no news articles, no CMS rating citations surfaced through DuckDuckGo. The site has no Google Business Profile schema and no review aggregation. The only external backlinks visible are to the payment portal (collectcheckout.com), the AHA certification page, and the site developer (advantageim.com). This near-total absence of external footprint means AI models have almost no third-party signals to triangulate against, making the site's own content and schema the sole source of truth — and the selective ClaudeBot/GPTBot blockade means those two models are operating on stale or absent data.
Findings
ClaudeBot and GPTBot blocked by Cloudflare WAF High
ClaudeBot and GPTBot receive HTTP 429 (rate-limited) responses with only 162 bytes, while other AI crawlers get full 200 responses. The robots.txt has no AI-bot directives, so the blocking occurs at the Cloudflare WAF layer, likely unbeknownst to site operators.
What to change: Remove the Cloudflare WAF rules that block ClaudeBot and GPTBot, or add explicit allow rules for these crawlers.
No LocalBusiness or MedicalBusiness schema on facility pages High
Each of the seven community pages lists addresses and phone numbers as plain text without any structured data markup. This prevents AI crawlers from extracting facility details as structured entities.
What to change: Add LocalBusiness or MedicalBusiness JSON-LD schema to each facility page with name, address, phone, and relevant certifications.
Zero indexed search results for brand and locations High
Web searches for the brand name combined with location or review keywords returned no results. No Google Business Profile schema, no review aggregation, and no third-party backlinks from authoritative healthcare directories.
What to change: Create and verify Google Business Profiles for each facility, encourage reviews, and build backlinks from healthcare directories and local news.
LLM prior knowledge overestimates facility count by 2x Medium
The LLM's prior knowledge states Transitions Healthcare operates 'about 10-15 facilities', but the site lists exactly 7 communities. This discrepancy may cause AI-generated answers to fabricate facilities.
What to change: Add a clear facility count and list on the homepage or about page, and ensure the llms.txt file includes an accurate facility list.
Program name mismatch: 'Transitions Rehab to Home' vs 'Transitions to Home' Medium
The LLM's prior knowledge refers to a program called 'Transitions to Home', but the site uses 'Transitions Rehab to Home'. This mismatch can cause AI-generated content to use the wrong program name.
What to change: Ensure the program name is consistently used across all site content and included in the llms.txt file.
AHA Heart Failure Certification and partnerships not marked up with schema Medium
North Huntingdon and Gettysburg pages mention American Heart Association Heart Failure Certification and partnerships with Dialyze Direct and WellSpan Health, but these are not represented in structured data.
What to change: Add structured data for certifications (e.g., MedicalWebPage or LocalBusiness with award) and partnerships (e.g., Organization schema with subOrganization).
No FAQPage, Product, or Service schema on the site Medium
The site lacks any FAQPage, Product, or Service schema, which limits the ability of AI crawlers to extract common questions or service offerings as structured entities.
What to change: Add FAQPage schema to the blog or FAQ section, and Service schema for each type of care offered (skilled nursing, memory support, etc.).
Bytespider receives HTTP 520 error from Cloudflare Medium
Bytespider (ByteDance's crawler) gets an HTTP 520 (generic Cloudflare error) on the homepage, meaning it cannot access the site at all.
What to change: Investigate Cloudflare WAF logs to determine why Bytespider is blocked and add an allow rule if desired.
No review or rating schema anywhere on the site Low
The site does not implement Review or AggregateRating schema, missing an opportunity to display star ratings or testimonials in search results and AI answers.
What to change: Add Review schema for testimonials and AggregateRating schema if any third-party ratings exist.
Blog page contains only 48 words of visible content Low
The blog listing page at /blog/ has only 48 words of visible text, which may limit its value for AI crawlers despite the llms.txt file including full summaries.
What to change: Add more descriptive text or excerpts on the blog listing page to improve crawlability.
What's working
- llms.txt file present with full blog summaries — The site has a 159KB llms.txt file auto-generated by All in One SEO, containing full blog post summaries that help LLMs index content despite crawler blocks.
- Consistent Organization JSON-LD schema on all pages — Every page includes Organization schema with name, URL, description, and sameAs links to social media, providing a solid foundation for entity recognition.
- Most AI crawlers allowed through Cloudflare — Google-Extended, OAI-SearchBot, ChatGPT-User, PerplexityBot, Perplexity-User, anthropic-ai, and Applebot-Extended all receive full 200 responses, ensuring broad AI visibility.
- Robots.txt has no AI-bot blocking directives — The robots.txt only disallows /wp-admin/ and sets a crawl delay, meaning no AI crawlers are explicitly blocked at the robots.txt level.
- Large dynamic sitemap with 438KB of URLs — The sitemap is dynamically generated and includes many URLs with image entries, helping crawlers discover all pages.
- Active blog with recent posts — The blog has a recent post dated May 27, 2026, indicating fresh content that can be indexed by AI crawlers.
Track transitionshealthcarellc.com across AI search
This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.