AI Site Grade
smailauto.com — AI Site Grade
Smail Auto Group's live website returns HTTP 403 to every crawler and browser, blocking all AI bots at the Cloudflare edge and leaving the site invisible to AI models.
Smail Auto Group's entire domain is blocked by Cloudflare for all AI crawlers, with no accessible content, robots.txt, or sitemap, and zero external search presence, making the site completely invisible to AI models.
- Findings
- 11
- Evidence checks
- 31
- Completed
- 30 May 2026
Analysis
Smail Auto Group's live website returns HTTP 403 to every crawler and browser — including GPTBot, ClaudeBot, Google-Extended, PerplexityBot, and all other major AI user-agents — blocked at the Cloudflare edge before any content is served.
Crawler Access
The entire domain sits behind Cloudflare with no bypass for AI crawlers. robots.txt and llms.txt both return 403 instead of their expected content — the server serves a 236KB HTML shell with no usable directives. Every bot tested (GPTBot, ClaudeBot, Google-Extended, PerplexityBot, ChatGPT-User, OAI-SearchBot, Applebot-Extended, Bytespider, anthropic-ai) receives the same 403 response. The sitemap.xml is also blocked. No AI crawler has accessed a single page of real content from the live site.
Cold-Knowledge Gap
The LLM model knows Smail Auto Group as a "family-owned automotive dealership group based in western Pennsylvania" operating Ford, Honda, Hyundai, Nissan, and Kia franchises with a "Smail Lifetime Warranty." The Wayback Machine snapshot of the live homepage tells a different story: the site actually represents 10 franchises — Acura, Buick, Cadillac, Ford, GMC, Honda, Kia, Lincoln, Mazda, and Mercedes-Benz — and has been in business since 1956 (not the generic "over 75 years" the model recalls). The model's knowledge is outdated, missing multiple brands and the specific founding year. The model also has no awareness of the site's six service centers, collision center, auto glass, car wash, or commercial truck center.
Schema Posture
The archived homepage contains valid AutoDealer JSON-LD schema with name, address, telephone, and price range. However, this schema is only visible in the Wayback Machine — the live site serves zero schema to any crawler. The schema that does exist lacks openingHoursSpecification, areaServed, sameAs (social profiles), aggregateRating, and review fields that would strengthen AI knowledge extraction.
External Signals
DuckDuckGo returns zero search results for "Smail Auto Group," "smailauto.com," or any combination of the brand name with location. The brand has no detectable press coverage, Reddit threads, or review-site citations in the search index. This total absence of external signals means AI models have almost no corroborating material to cross-reference, making the site's own blocked content the only source — which they cannot reach.
Findings
All AI crawlers blocked by Cloudflare with HTTP 403 High
The live site returns HTTP 403 to every tested AI crawler (GPTBot, ClaudeBot, Google-Extended, PerplexityBot, ChatGPT-User, OAI-SearchBot, Applebot-Extended, Bytespider, anthropic-ai) at the Cloudflare edge. No content is served to any bot.
What to change: Configure Cloudflare to allow AI crawler user-agents (e.g., GPTBot, ClaudeBot, Google-Extended) to access the site, or serve a static version of the site to bots.
robots.txt returns HTTP 403 instead of directives High
The robots.txt file returns a 403 error, so crawlers cannot read any access rules. This prevents even well-behaved bots from knowing which paths are allowed.
What to change: Serve a valid robots.txt that allows AI crawlers to access the site, or at minimum returns a 200 with appropriate directives.
llms.txt returns HTTP 403 Medium
The llms.txt file, intended to guide AI crawlers to useful content, returns a 403 error. No AI-friendly content index is provided.
What to change: Create and serve an llms.txt file that lists key pages (e.g., inventory, about, service) for AI crawlers.
Sitemap.xml blocked by Cloudflare High
The sitemap.xml returns HTTP 403, preventing crawlers from discovering the site's URL structure. This hinders indexing of any pages that might become accessible.
What to change: Allow access to sitemap.xml for all crawlers, or at least for known AI bot user-agents.
No AI crawler has accessed any real content High
Every tested AI bot receives a 403 response. The live site serves zero pages of real content to any crawler, making the site completely invisible to AI models.
What to change: Remove the blanket Cloudflare block for AI crawlers and allow them to access key pages like inventory, about, and service.
Zero search results for brand and domain High
DuckDuckGo returns zero results for 'Smail Auto Group', 'smailauto.com', or any combination of brand name and location. No press, reviews, or social mentions are indexed.
What to change: Build external signals through press releases, local citations, social media profiles, and review sites to create a web of corroborating content.
LLM knowledge of Smail Auto Group is outdated and incomplete Medium
The LLM model knows only 5 franchises (Ford, Honda, Hyundai, Nissan, Kia) and a generic 'over 75 years' history, while the site actually represents 10 franchises and was founded in 1956. The model also lacks awareness of six service centers, collision center, auto glass, car wash, and commercial truck center.
What to change: Ensure the live site is accessible to AI crawlers and includes structured data (schema) that accurately reflects all franchises, services, and founding year.
Live site serves zero structured data to crawlers High
The archived homepage contains valid AutoDealer JSON-LD schema, but the live site returns 403 to all crawlers, so no schema is served. AI models cannot extract any structured information from the live site.
What to change: Allow crawlers to access the live site and ensure JSON-LD schema (AutoDealer) is present on all key pages.
Existing schema lacks key fields for AI extraction Medium
The archived homepage's AutoDealer schema is missing openingHoursSpecification, areaServed, sameAs, aggregateRating, and review fields. These fields would strengthen AI knowledge extraction and improve visibility in AI-generated answers.
What to change: Add openingHoursSpecification, areaServed, sameAs (social profiles), aggregateRating, and review fields to the AutoDealer schema.
No URLs discovered for the domain Medium
The URL discovery tool found zero URLs for smailauto.com, indicating the site has no indexed pages or discoverable links. This compounds the invisibility problem.
What to change: Ensure the site is crawlable and has internal links; submit a sitemap to search engines.
Cloudflare configuration blocks all bots indiscriminately High
The site uses Cloudflare with a configuration that returns 403 to all bots, including legitimate AI crawlers. This is a blanket block rather than a selective one.
What to change: Update Cloudflare WAF rules to allow known AI crawler user-agents while still blocking malicious bots.
What's working
- Archived homepage contains valid AutoDealer JSON-LD schema — The Wayback Machine snapshot of the homepage includes valid AutoDealer schema with name, address, telephone, and price range. This demonstrates the site has historically implemented structured data correctly.
- Wayback Machine has multiple snapshots of the site — The Wayback Machine has archived snapshots of the homepage and robots.txt, providing a historical record of the site's content and configuration.
- Archived homepage contains rich content about franchises and services — The Wayback Machine snapshot reveals the site represents 10 franchises and offers multiple services (six service centers, collision center, auto glass, car wash, commercial truck center), providing substantial material for AI extraction if made accessible.
Track smailauto.com across AI search
This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.