AI Site Grade

motorworldgroup.com — AI Site Grade

MotorWorld Group's Akamai WAF blocks every major AI crawler except anthropic-ai, leaving the site invisible to GPTBot, ClaudeBot, PerplexityBot, and Google-Extended.

MotorWorld Group's site is fully invisible to all major AI crawlers except anthropic-ai due to an Akamai WAF misconfiguration, with zero structured data, no external search presence, and a total cold-knowledge gap.

Findings
12
Evidence checks
43
Completed
30 May 2026

Analysis

I have enough data now. Let me compile the audit.

MotorWorld / MileOne Autogroup — AI-Visibility Audit

The site is fully invisible to every major AI crawler except anthropic-ai — and even that access is accidental, not intentional. The Akamai WAF blocks GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, and standard browsers with a 403, while anthropic-ai (the training crawler, not Claude-User or Claude-SearchBot) slips through to nginx on the backend and gets a full 336KB page. This is a firewall misconfiguration, not a deliberate AI strategy.

Crawler Access

compare_bot_access on the homepage returned 403 for every tested agent — GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, PerplexityBot, Google-Extended, Bytespider, Applebot-Extended, and even a standard browser — all blocked by AkamaiGHost (edgesuite.net). Only anthropic-ai (the model-training crawler) received a 200 with 336KB of HTML served by nginx. The robots.txt is also behind the Akamai wall: browser and most bot UAs get a 403, but anthropic-ai can read it. The robots.txt itself contains explicit Disallow rules for GPTBot, OAI-SearchBot, ChatGPT-User, Claude-User, Claude-SearchBot, PerplexityBot, and CCBot — though these rules only block JS/CSS/JSON/API/pixall paths, not the main content. The real block is the Akamai WAF layer. No llms.txt exists (403).

Cold-Knowledge Gap

A frontier LLM queried cold about "Motor World Group" returned zero knowledge — no awareness of the brand, its locations, its brands, or its parent company MileOne Autogroup. The site itself is a 12-brand, 12-dealership multi-franchise auto group in Wilkes-Barre/Allentown, PA, selling Acura, Cadillac, Chrysler, Dodge, Jeep, Ram, Genesis, Honda, Hyundai, Lexus, Mercedes-Benz, and Toyota. The model has no record of any of this. The gap between the site's actual scale (1,363 new vehicles listed, 10 service locations, a body shop) and the model's blank slate is total.

Schema Posture

The homepage and inventory pages contain zero JSON-LD structured data of any type. The Wayback snapshot confirms no AutoDealer, Organization, LocalBusiness, or Vehicle schema is present. The site runs on Dealer.com (DDC) platform — a common automotive CMS — but the default DDC schema templates appear disabled or stripped. No Product schema on inventory pages, no FAQPage schema despite FAQ content on the site, no BreadcrumbList. The <title> tags and meta descriptions are populated, but structured data is absent across all checked pages.

External Signals

Web searches for "motorworldgroup.com", "MotorWorld MileOne", and "MileOne Autogroup" returned zero results across multiple queries. The brand has no indexed external press, reviews, Reddit threads, or directory listings surfaced by search engines. The DNS TXT records show an anthropic-domain-verification token, confirming the brand has registered with Anthropic for some purpose — but the actual site access for Claude's user-facing bots (Claude-User, Claude-SearchBot) is blocked at the WAF level, contradicting that registration. The domain also has google-site-verification, apple-domain-verification, and adobe-idp-site-verification tokens, indicating broad platform registration that is not matched by actual crawler access.

Findings

  1. Akamai WAF blocks all major AI crawlers except anthropic-ai High

    The Akamai WAF returns 403 for GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, PerplexityBot, Google-Extended, Bytespider, Applebot-Extended, and standard browsers. Only anthropic-ai (training crawler) receives a 200 response, indicating a firewall misconfiguration rather than an intentional AI strategy.

    What to change: Reconfigure the Akamai WAF to allow access for all major AI crawlers (GPTBot, ClaudeBot, PerplexityBot, Google-Extended, etc.) to the main content paths, while maintaining security for sensitive areas.

  2. robots.txt is blocked by Akamai for most crawlers High

    The robots.txt file is served behind the Akamai WAF, returning 403 for standard browsers and most bots. Only anthropic-ai can read it, defeating its purpose as a crawler directive.

    What to change: Move robots.txt outside the Akamai WAF so all crawlers can access it, or configure the WAF to allow all known bot user agents.

  3. No llms.txt file published Medium

    The site returns 403 for llms.txt, meaning no AI-specific content guidance is provided.

    What to change: Create an llms.txt file that lists key pages (inventory, locations, about) for AI crawlers to reference.

  4. No JSON-LD structured data on homepage or inventory pages High

    The homepage and inventory pages contain zero JSON-LD structured data of any type. No AutoDealer, Organization, LocalBusiness, Vehicle, Product, FAQPage, or BreadcrumbList schema is present, despite the site being a multi-franchise auto group with 1,363 vehicles.

    What to change: Implement JSON-LD structured data for AutoDealer, Organization, LocalBusiness, Vehicle, Product, and BreadcrumbList across all pages.

  5. Frontier LLM has zero knowledge of MotorWorld Group High

    A cold query about 'Motor World Group' returned no awareness of the brand, its 12 dealerships, locations, or parent company MileOne Autogroup. The model has no record of the site's scale or offerings.

    What to change: Improve AI visibility by fixing crawler access, adding structured data, and building external signals (press, reviews, directory listings).

  6. No external search results for brand or domain High

    Web searches for 'motorworldgroup.com', 'MotorWorld MileOne', and 'MileOne Autogroup' returned zero results across multiple queries. The brand has no indexed external press, reviews, or directory listings.

    What to change: Build external signals through press releases, local directory listings, review platforms, and social media presence.

  7. Claude user-facing bots blocked despite Anthropic domain verification Medium

    The DNS TXT records include an anthropic-domain-verification token, but Claude-User and Claude-SearchBot are blocked at the WAF level, contradicting the registration.

    What to change: Allow Claude-User and Claude-SearchBot through the Akamai WAF to match the domain verification.

  8. About page returns 404 for anthropic-ai Medium

    The about-us.htm page returns a 404 error when fetched as anthropic-ai, preventing crawlers from accessing company information.

    What to change: Restore the about-us.htm page or implement a redirect to a working page.

  9. No Product or Vehicle schema on inventory pages High

    New and used inventory pages lack Product or Vehicle structured data, making it impossible for AI to extract vehicle details (make, model, year, price, VIN).

    What to change: Add JSON-LD Product and Vehicle schema to each inventory listing page.

  10. No BreadcrumbList schema on any page Low

    Despite having breadcrumb navigation in the HTML, no BreadcrumbList structured data is present, reducing AI understanding of site hierarchy.

    What to change: Add JSON-LD BreadcrumbList schema to all pages with breadcrumb navigation.

  11. No FAQPage schema despite FAQ content Low

    The site contains FAQ content but no FAQPage structured data, missing an opportunity for AI-enhanced search results.

    What to change: Add JSON-LD FAQPage schema to pages with FAQ content.

  12. No LocalBusiness schema on location pages High

    Location pages lack LocalBusiness structured data, preventing AI from extracting address, phone, hours, and services for each dealership.

    What to change: Add JSON-LD LocalBusiness schema to each location page with address, phone, hours, and services.

What's working

  • anthropic-ai crawler can access full site content — The anthropic-ai training crawler receives a 200 response with full HTML (336KB) for the homepage and all tested pages, allowing Anthropic's model training to ingest site content.
  • Sitemap.xml is accessible and contains URLs — The sitemap.xml is accessible to anthropic-ai and contains 342KB of URLs, providing a comprehensive list of site pages for crawling.
  • Domain verification tokens for major platforms are present — DNS TXT records include verification tokens for Anthropic, Google, Apple, and Adobe, indicating proactive platform registration.
  • robots.txt includes AI-specific Disallow rules — The robots.txt explicitly disallows GPTBot, OAI-SearchBot, ChatGPT-User, Claude-User, Claude-SearchBot, PerplexityBot, and CCBot from JS/CSS/JSON/API/pixall paths, showing awareness of AI crawlers.
  • Wayback Machine has a recent snapshot of the homepage — A Wayback Machine snapshot from 2026-03-30 shows the homepage content, providing a fallback for historical AI training data.
  • Contact page is accessible and contains location info — The contact.htm page returns 200 for anthropic-ai and contains dealership location information.
  • New and used inventory pages return 200 with full HTML — Both new-inventory and used-inventory pages return 200 responses with full HTML content (330KB+), allowing crawlers to access vehicle listings.
  • Sales and service location pages are accessible — The sales and service location pages return 200 for anthropic-ai, providing dealership-specific content.

Track motorworldgroup.com across AI search

This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.

Open this AI Site Grade Grade another site Track your brand