AI Site Grade

stevewhiteag.com — AI Site Grade

Steve White Auto Group's site is exclusively accessible to Anthropic's AI crawler, blocking all other bots and human visitors behind an Akamai WAF, leaving it invisible to Google, OpenAI, and Perplexity.

The site is locked behind an Akamai WAF that only allows Anthropic's AI crawler, blocking all other bots and human visitors, resulting in zero search engine indexing and a complete cold-knowledge mismatch about the dealership's brands and location.

Findings
9
Evidence checks
48
Completed
30 May 2026

Analysis

The only AI crawler that can access this site is anthropic-ai — every other bot and every browser gets a 403 from Akamai, creating a bizarre single-bot exclusivity that leaves the site invisible to Google, OpenAI, Perplexity, and human visitors alike.

Crawler Access

The site sits behind Akamai (edgesuite.net) with a WAF rule that blocks every User-Agent except anthropic-ai. compare_bot_access confirmed: GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, Google-Extended, PerplexityBot, Perplexity-User, Bytespider, Applebot-Extended, and even a standard Browser all receive 403 Access Denied. Only anthropic-ai gets a 200 with full 312KB content served from an nginx origin behind the Akamai shield. The robots.txt is itself blocked to non-anthropic bots (403), but when fetched as anthropic-ai it reveals explicit Disallow rules for GPTBot, OAI-SearchBot, ChatGPT-User, Claude-User, Claude-SearchBot, and PerplexityBot — though those rules are moot since Akamai blocks them before they ever reach the robots.txt. The llms.txt exists and is a 505KB auto-generated directory of inventory pages, but is likewise only accessible to anthropic-ai.

Cold-Knowledge Gap

An LLM queried cold about "Steve White Auto Group" described it as a Ford/Hyundai/Kia dealership in Greensboro, North Carolina serving the Piedmont Triad. The actual site is a Volkswagen, Volvo, and Audi dealership in Greenville, South Carolina. The model's prior is entirely wrong on location, brands, and market — a complete identity mismatch. This means any AI system relying on parametric knowledge (without live retrieval) will describe a different business than what exists.

Content & Schema Posture

The site is built on the DDC (Dealer.com) platform — a common automotive CMS. The homepage title and meta description correctly state "New Volkswagen, Volvo, Audi Dealership in Greenville, SC." However, critical pages return 404: /about/index.htm, /contact/index.htm, /audi/index.htm, /volkswagen/index.htm, /volvo/index.htm, /new-volkswagen/index.htm, and /new-volvo/index.htm. The sitemap lists 80+ URLs including individual vehicle detail pages, but the brand-specific landing pages that AI crawlers would naturally seek are broken. No JSON-LD structured data was detected on the homepage (the page is JS-heavy and renders content dynamically). The llms.txt is auto-generated and lists inventory pages but lacks any curated "about" or "core content" section that would help an LLM understand the business.

External Signals

The domain has zero indexed results in web search — no external mentions, reviews, Reddit threads, or press coverage surfaced. The DNS points to dealer.com nameservers and uses Google MX for email. The site has two Google Search Console verification TXT records, suggesting it was once submitted to Google, but the Akamai block prevents Googlebot from crawling it. The Wayback Machine shows a snapshot from May 2026, indicating the site is actively maintained on the DDC platform despite being effectively invisible to search engines and most AI crawlers.

Findings

  1. Akamai WAF blocks all bots except anthropic-ai High

    The site's Akamai WAF allows only the anthropic-ai user agent, returning 403 Access Denied to GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, Google-Extended, PerplexityBot, Perplexity-User, Bytespider, Applebot-Extended, and standard browsers. This makes the site invisible to all major AI crawlers and search engines.

    What to change: Remove the Akamai WAF rule that blocks non-anthropic user agents, or replace it with a permissive rule that allows all legitimate crawlers and browsers.

  2. robots.txt is blocked to non-anthropic bots High

    The robots.txt file returns 403 for any user agent except anthropic-ai, preventing other crawlers from reading crawl directives. When fetched as anthropic-ai, it reveals explicit Disallow rules for GPTBot, OAI-SearchBot, ChatGPT-User, Claude-User, Claude-SearchBot, and PerplexityBot, though these are moot due to the Akamai block.

    What to change: Make robots.txt publicly accessible and remove Disallow rules for AI crawlers that should be allowed.

  3. llms.txt is only accessible to anthropic-ai Medium

    The llms.txt file (505KB auto-generated inventory directory) returns 403 to all other user agents, limiting its utility for AI crawlers that could use it to understand the site's content.

    What to change: Make llms.txt publicly accessible to all crawlers.

  4. Zero search engine indexing and external mentions High

    Web searches for the domain and brand return zero results. No external mentions, reviews, or press coverage were found. The Akamai block prevents Googlebot from crawling, despite Google Search Console verification TXT records suggesting past submission.

    What to change: Remove the Akamai block to allow search engine crawlers, and build external backlinks and citations.

  5. Cold LLM knowledge describes wrong location and brands High

    An LLM queried about 'Steve White Auto Group' described it as a Ford/Hyundai/Kia dealership in Greensboro, NC, while the actual site is a Volkswagen, Volvo, and Audi dealership in Greenville, SC. This complete identity mismatch means AI systems relying on parametric knowledge will misrepresent the business.

    What to change: Increase online presence through citations, reviews, and structured data to correct the LLM's knowledge.

  6. Brand-specific landing pages return 404 High

    Critical pages for Audi, Volkswagen, Volvo, about, contact, and service center all return 404 when fetched as anthropic-ai. These are the pages AI crawlers would naturally seek to understand the dealership's offerings.

    What to change: Restore these pages with proper content and ensure they return 200.

  7. No JSON-LD structured data detected on homepage Medium

    The homepage is JavaScript-heavy and renders content dynamically. No JSON-LD structured data was found, which would help AI crawlers understand the business type, location, and inventory.

    What to change: Add JSON-LD structured data for LocalBusiness, AutoDealer, and VehicleOffer schemas on relevant pages.

  8. llms.txt is auto-generated and lacks curated core content Medium

    The llms.txt file is a 505KB auto-generated directory of inventory pages but does not include curated 'about' or 'core content' sections that would help an LLM understand the business.

    What to change: Curate the llms.txt to include a summary of the business, key pages, and structured data hints.

  9. Sitemap returns 403 to standard request High

    The sitemap.xml returns 403 when fetched without the anthropic-ai user agent, preventing search engines from discovering the site's URLs.

    What to change: Make sitemap.xml publicly accessible.

What's working

  • Anthropic AI crawler is allowed and served full content — The anthropic-ai crawler receives a 200 response with full 312KB homepage content, including inventory pages and llms.txt, enabling Claude to index the site.
  • llms.txt exists with 505KB of inventory data — The site provides an llms.txt file containing a comprehensive directory of inventory pages, which can help AI crawlers discover vehicle listings.
  • Sitemap lists 80+ URLs including vehicle detail pages — The sitemap.xml contains over 80 URLs, including individual vehicle detail pages, providing a roadmap for crawlers.
  • Homepage title and meta description correctly state brands and location — The homepage title and meta description accurately describe the dealership as 'New Volkswagen, Volvo, Audi Dealership in Greenville, SC', providing correct context for crawlers that can access it.
  • Google Search Console verification TXT records present — Two Google Search Console verification TXT records exist in DNS, indicating past or intended submission to Google for indexing.
  • Wayback Machine snapshot from May 2026 available — The site has a Wayback Machine snapshot from May 2026, confirming it is actively maintained on the DDC platform despite access restrictions.

Track stevewhiteag.com across AI search

This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.

Open this AI Site Grade Grade another site Track your brand