AI Site Grade
patriotsubaru.com — AI Site Grade
Patriot Subaru's Akamai WAF blocks every major AI crawler except Anthropic, leaving the site invisible to GPTBot, PerplexityBot, and others while a detailed llms.txt sits unreachable.
Patriot Subaru's selective Akamai gate allows only Anthropic's crawler, blocking all other AI bots and hiding its inventory, sitemap, and llms.txt from the broader AI ecosystem.
- Findings
- 9
- Evidence checks
- 32
- Completed
- 30 May 2026
Analysis
Akamai Blocks Every AI Crawler Except Anthropic — A Selective Gate
Patriot Subaru's site is fully invisible to every major AI crawler except anthropic-ai, which receives a 200 with full HTML while GPTBot, ClaudeBot, PerplexityBot, Google-Extended, ChatGPT-User, and OAI-SearchBot all hit a 403 from AkamaiGHost. The robots.txt itself is 403 to browser UAs, and the sitemap.xml is also blocked — yet the site maintains a 338KB llms.txt that only anthropic-ai can read.
Crawler Access
The homepage at patriotsubaru.com and www.patriotsubaru.com both return 403 Access Denied from Akamai for every tested user-agent except anthropic-ai. The robots.txt (accessible only to anthropic-ai) explicitly disallows GPTBot, OAI-SearchBot, ChatGPT-User, Claude-User, Claude-SearchBot, CCBot, and PerplexityBot from /api/, /apis/, /pixall/, and static resource paths — but the Akamai WAF blocks these bots before they even reach the robots.txt. The sitemap.xml (218KB, containing hundreds of URLs including inventory pages and blog posts) is also 403 to all non-Anthropic crawlers. The llms.txt exists and is well-structured with descriptions for each URL, but is only served to anthropic-ai.
Cold-Knowledge Gap
LLM knowledge describes Patriot Subaru as a "car dealership based in Saco, Maine" with "generally positive customer reviews" and notes it is "part of the Patriot Automotive Group." The site itself claims "Exclusive Lifetime Warranty, 1,700+ google reviews with a 4.8 rating. Highest rated Subaru dealer in Maine." This specific claim — the lifetime warranty, the exact review count and rating, the "highest rated" positioning — is entirely absent from cold LLM knowledge. The model knows the dealership exists but cannot cite its primary differentiators.
Schema Posture
The homepage HTML (served to anthropic-ai) contains no visible JSON-LD schema in the extracted snippet. The page uses the ddc-site and v9-oem-subaru-0001-v1 CSS classes, indicating a Dealer.com / DealerDotCom platform build. Inventory pages like the Forester SRP and used Outback page also lack visible structured data in the extracted head sections. No AutoDealer, LocalBusiness, or Product schema types were detected on any fetched page.
External Signals
Web search returned zero indexed results for patriotsubaru.com across multiple queries — no reviews, no Reddit threads, no press mentions surfaced. The llms.txt claims 1,700+ Google reviews at 4.8 stars, but no external verification was reachable. The site's DNS points to Akamai (2.19.248.22) and uses ns1.dealer.com / ns2.dealer.com name servers, confirming the Dealer.com platform. The llms.txt references "serving Auburn, Portland area" and "serving Portland, Bath area shoppers," but the cold model knowledge only knows "southern Maine and New England."
Findings
Akamai WAF blocks all AI crawlers except anthropic-ai High
Every major AI crawler (GPTBot, ClaudeBot, PerplexityBot, Google-Extended, ChatGPT-User, OAI-SearchBot) receives a 403 from AkamaiGHost. Only anthropic-ai gets a 200 with full HTML. This selective blocking prevents the site from being indexed or understood by most AI systems.
What to change: Remove the Akamai WAF rules that block non-Anthropic AI crawlers, or replace them with rate-limiting instead of blanket 403s. Ensure robots.txt is accessible to all bots and contains appropriate directives.
robots.txt returns 403 to standard user-agents High
The robots.txt file at /robots.txt returns a 403 Access Denied when fetched with a standard browser user-agent. Only anthropic-ai can retrieve it. This violates the robots exclusion protocol and prevents compliant crawlers from understanding crawl rules.
What to change: Configure the server to serve robots.txt with a 200 status to all user-agents, including standard browsers. Remove any WAF rules that block access to robots.txt.
Sitemap.xml blocked to all crawlers except anthropic-ai High
The sitemap.xml (218KB, containing hundreds of URLs) returns 403 to all non-Anthropic crawlers. This prevents search engines and AI crawlers from discovering the site's inventory and content pages.
What to change: Serve sitemap.xml with a 200 status to all user-agents. Remove any WAF rules blocking it.
llms.txt only accessible to anthropic-ai High
The site maintains a 338KB llms.txt with well-structured descriptions for each URL, but it is only served to anthropic-ai. Other AI crawlers receive a 403, defeating the purpose of the file.
What to change: Serve llms.txt with a 200 status to all AI crawlers. Remove any WAF rules blocking it.
No JSON-LD schema detected on homepage or inventory pages Medium
The homepage HTML (served to anthropic-ai) contains no visible JSON-LD schema. Inventory pages for Forester and used Outback also lack structured data. No AutoDealer, LocalBusiness, or Product schema types were found, which limits AI understanding of the site's offerings.
What to change: Add JSON-LD structured data for LocalBusiness, AutoDealer, and Product/Vehicle types to all relevant pages. Include schema for inventory items with make, model, year, price, and availability.
Cold LLM knowledge lacks site's primary differentiators Medium
LLM knowledge describes Patriot Subaru as a dealership in Saco, Maine with generally positive reviews, but misses the site's claimed 'Exclusive Lifetime Warranty, 1,700+ google reviews with a 4.8 rating, highest rated Subaru dealer in Maine.' These differentiators are absent from model knowledge.
What to change: Ensure these claims are prominently featured in crawlable text and structured data on the homepage and about page. Consider adding them to llms.txt and ensuring it is accessible to all crawlers.
Zero indexed pages in web search results High
Multiple web searches for patriotsubaru.com and related queries returned zero results. The site appears to have no indexed pages in major search engines, likely due to the Akamai blocking.
What to change: Resolve the Akamai blocking issue to allow search engine crawlers to index the site. Submit sitemap to Google Search Console and Bing Webmaster Tools.
robots.txt explicitly disallows several AI crawlers from key paths Medium
The robots.txt (accessible only to anthropic-ai) disallows GPTBot, OAI-SearchBot, ChatGPT-User, Claude-User, Claude-SearchBot, CCBot, and PerplexityBot from /api/, /apis/, /pixall/, and static resource paths. While these bots are already blocked by Akamai, the disallow directives further restrict access.
What to change: Review and remove unnecessary disallow directives for AI crawlers. Allow access to content pages while blocking only sensitive paths like /api/ if needed.
No external reviews or mentions found in web search Low
Web searches for 'Patriot Subaru Saco Maine' and related terms returned zero results. The site claims 1,700+ Google reviews with a 4.8 rating, but no external verification was reachable.
What to change: Encourage customers to leave reviews on Google and other platforms. Ensure the site's review count and rating are verifiable via structured data and linked review pages.
What's working
- Well-structured llms.txt with detailed descriptions — The site maintains a 338KB llms.txt file with descriptions for each URL, providing a clear AI-friendly overview of the site's content. This is a strong asset for AI visibility, though currently only accessible to anthropic-ai.
- Anthropic crawler receives full HTML content — The anthropic-ai crawler successfully retrieves full HTML pages (e.g., homepage, inventory pages, contact page) with substantial content, indicating the site's content is well-structured for at least one AI crawler.
- Comprehensive sitemap.xml with hundreds of URLs — The sitemap.xml (218KB) contains hundreds of URLs including inventory pages and blog posts, providing a complete map of the site's content. This is a strong foundation for indexing once access is opened.
- Contact page with address and phone accessible to anthropic — The contact page (contact.htm) is served to anthropic-ai and contains the dealership's address and phone number, which helps AI models provide accurate location information.
Track patriotsubaru.com across AI search
This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.