AI Site Grade

dobbspeterbilt.com — AI Site Grade

Dobbs Peterbilt's Imperva WAF blocks all AI crawlers with 403 errors, making the site invisible to search engines and AI systems.

The site is completely inaccessible to AI crawlers due to an Imperva WAF that blocks all automated traffic, resulting in zero indexed pages and no external footprint.

Findings
10
Evidence checks
29
Completed
30 May 2026

Analysis

The Imperva Wall: A Site AI Crawlers Cannot Reach

Every major AI crawler — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, ChatGPT-User, OAI-SearchBot, Applebot-Extended, anthropic-ai, Bytespider — receives a 403 Forbidden from Cloudflare when attempting to access dobbspeterbilt.com. The site's own robots.txt and llms.txt URLs return the same Imperva "Pardon Our Interruption" challenge page that the homepage serves. There are zero AI-specific rules in robots.txt because the WAF blocks all automated traffic before any content is served.

Crawler Access

The site is protected by Imperva (Incapsula) WAF fronted by Cloudflare (DNS: hugh.ns.cloudflare.com, tegan.ns.cloudflare.com; A record 104.17.90.30). A plain browser GET returns the challenge page with noindex, nofollow meta tags, three language variants of the bot-blocking interstitial, and a CAPTCHA requirement. The sitemap at /sitemap.xml also returns the challenge page with zero URLs. The Wayback Machine holds no snapshots. Google search returns zero indexed results for site:dobbspeterbilt.com. The site is effectively invisible to search engines and AI crawlers alike.

Cold-Knowledge Gap

The LLM queried cold described Dobbs Peterbilt as a "family-owned" dealership group with "multiple locations in the Southeast and Midwest" specializing in Peterbilt trucks, parts, leasing, and service. This prior knowledge is entirely unsourced from the actual site — the model has no way to verify or update this information because the site blocks all AI access. The gap between what the model "knows" (a regional dealer with a reputation for reliability) and what the site actually contains (nothing accessible) is absolute: the site provides zero content to confirm, correct, or enrich any AI-generated description.

External Signals

No external mentions of dobbspeterbilt.com were found across web search, reviews, Reddit, or press. The domain has no discoverable off-domain footprint. The MX records point to Mimecast for email, and SPF is configured via smart.ondmarc.com, suggesting the domain is actively used for business communications — but its public web presence is a complete dead end for any automated consumer or AI engine.

Findings

  1. Imperva WAF blocks all AI crawlers with 403 errors High

    Every major AI crawler (GPTBot, ClaudeBot, PerplexityBot, Google-Extended, ChatGPT-User, OAI-SearchBot, Applebot-Extended, anthropic-ai, Bytespider) receives a 403 Forbidden from Cloudflare when attempting to access dobbspeterbilt.com. The site's own robots.txt and llms.txt URLs return the same Imperva challenge page.

    What to change: Configure the Imperva WAF to allow known AI crawler user agents (e.g., GPTBot, ClaudeBot) to access the site, or serve them a static, crawlable version of the content.

  2. No AI-specific rules in robots.txt High

    The robots.txt file contains zero user-agent rules and does not name any AI bots. This is irrelevant because the WAF blocks all automated traffic before any content is served.

    What to change: Add explicit allow rules for AI crawlers in robots.txt, but this is secondary to fixing the WAF configuration.

  3. Sitemap returns challenge page with zero URLs High

    The sitemap at /sitemap.xml returns the Imperva challenge page instead of actual URLs, making it impossible for crawlers to discover site content.

    What to change: Ensure the sitemap is served without WAF challenge for known crawlers and contains all public URLs.

  4. Homepage has noindex, nofollow meta tags High

    The homepage includes meta tags instructing search engines not to index or follow links, further reducing visibility.

    What to change: Remove the noindex, nofollow meta tags from the homepage and all public pages.

  5. Zero pages indexed in Google High

    A site:dobbspeterbilt.com search returns zero results, confirming the site is invisible to search engines.

    What to change: Resolve WAF blocking and noindex tags to allow Googlebot to crawl and index the site.

  6. No Wayback Machine snapshots exist Medium

    The Wayback Machine has no snapshots of the site, indicating it has never been publicly accessible to crawlers.

    What to change: Allow crawlers to access the site so that archival services can capture content.

  7. No external mentions or backlinks found Medium

    Web searches for the domain and brand returned zero results across reviews, social media, and press, indicating no off-domain footprint.

    What to change: Build external signals through local listings, social media, and press releases to improve discoverability.

  8. LLM cold knowledge about the dealership is unsourced Medium

    An LLM queried cold described Dobbs Peterbilt as a family-owned dealership group with multiple locations, but this information cannot be verified from the site because it blocks all AI access.

    What to change: Make site content accessible to AI crawlers so that AI systems can retrieve accurate, up-to-date information.

  9. llms.txt returns challenge page instead of content Medium

    The llms.txt file, intended to provide AI-friendly content, returns the Imperva challenge page, making it useless.

    What to change: Serve llms.txt without WAF challenge and populate it with structured site information for AI consumption.

  10. No internal URLs discovered via crawling Medium

    The list_known_urls tool found zero URLs, indicating no crawlable internal links or sitemap data.

    What to change: Ensure the site has a proper internal linking structure and a valid sitemap accessible to crawlers.

What's working

  • Domain actively used for business email — MX records point to Mimecast and SPF is configured, indicating the domain is actively used for business communications, which is a positive signal for domain authority.
  • llms.txt file exists on the server — The site hosts an llms.txt file (though it returns a challenge page), showing awareness of AI-friendly content standards.

Track dobbspeterbilt.com across AI search

This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.

Open this AI Site Grade Grade another site Track your brand