AI Site Grade
crepprotect.com — AI Site Grade
Crep Protect's live site is entirely invisible to every AI crawler and human browser, blocked behind a Cloudflare JS challenge wall that returns 403 to all traffic.
Crep Protect's Cloudflare JS challenge wall blocks all AI crawlers and human browsers, causing complete de-indexation and zero AI knowledge of the brand despite strong schema and social presence.
- Findings
- 8
- Evidence checks
- 24
- Completed
- 30 May 2026
Analysis
Crep Protect's live site is entirely invisible to every AI crawler — and to every human browser — blocked behind a Cloudflare JS challenge wall that returns 403 to all traffic.
Crawler Access
Every AI crawler tested — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, Bytespider, Applebot-Extended, ChatGPT-User, anthropic-ai, Perplexity-User — receives a 403 from Cloudflare's JS challenge gate. The homepage, /robots.txt, /sitemap.xml, and /llms.txt all return the same Cloudflare "Verifying your connection..." wall. No robots.txt directives exist because the file is unreachable. No llms.txt exists. The sitemap is inaccessible. The site runs on Shopify (A-record 23.227.38.65, Shopify's standard IP) behind Cloudflare with managed challenge mode, meaning no bot or browser can access content without executing JavaScript and passing a Cloudflare cookie challenge.
Cold-Knowledge Gap
A frontier LLM queried cold about crepprotect.com knows nothing — it cannot confirm what the brand does, what products it sells, or any reputational signals. The model states the domain may be "obscure, newly launched, or not widely referenced." In reality, Crep Protect is an established sneaker care brand selling sprays, cleaning kits, laces, insoles, and storage products with thousands of reviews (19.6K on the spray alone). The gap between the brand's actual commercial presence and what AI engines can recall is total — the site's Cloudflare wall has effectively erased it from AI training data and retrieval surfaces.
Schema Posture
The archived site content reveals strong structured data — product pages carry Product schema with brand, offers, shippingDetails, and MerchantReturnPolicy. The homepage carries LocalBusiness, Organization, and WebSite schemas with full address (7714 North Lehigh Avenue, Chicago), phone (941-730-9340), and social profiles (Facebook, Instagram, YouTube, TikTok). BreadcrumbList schema is present on product pages. FAQ schema appears on product pages with Q&A about product features. This schema is well-constructed but completely inaccessible to crawlers behind the Cloudflare wall.
External Signals
The brand maintains active social presence on Instagram (@crepprotect), TikTok (@crep_protect), YouTube, and Facebook. DNS records show integrations with Klaviyo (email marketing), Mailgun, Google Workspace, and Microsoft 365 (Outlook mail). The blog ("CrepDaily") publishes sneaker culture content with articles about Air Jordan releases and sneaker news. Despite this, web search returns zero indexed results for the domain, the brand name, or its products — the Cloudflare wall has caused complete de-indexation from search engines.
Findings
Cloudflare JS challenge blocks all AI crawlers and human browsers High
The live site returns 403 to every AI crawler tested (GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, Bytespider, Applebot-Extended, ChatGPT-User, anthropic-ai, Perplexity-User) and to human browsers, due to Cloudflare's managed challenge mode. No content is accessible without executing JavaScript and passing a cookie challenge.
What to change: Disable Cloudflare's JS challenge mode for the site, or configure it to allow known AI crawler user agents and search engine bots. Use Cloudflare's bot management to selectively block malicious traffic while permitting legitimate crawlers.
robots.txt is unreachable, preventing any crawler directives High
The robots.txt file returns a 403 error, so no crawler directives exist. This means the site cannot communicate which paths to crawl or avoid, and AI bots have no guidance.
What to change: Ensure robots.txt is publicly accessible and includes directives for AI crawlers, such as allowing GPTBot and ClaudeBot to crawl the site.
Sitemap is inaccessible, preventing crawlers from discovering pages High
The sitemap.xml returns a 403 error, so crawlers cannot discover the site's page structure. This contributes to the complete de-indexation.
What to change: Make sitemap.xml publicly accessible and submit it to search engines and AI crawler dashboards.
llms.txt is missing, reducing AI discoverability Medium
The site does not provide an llms.txt file, which is a recommended way to help AI crawlers understand the site's content and structure.
What to change: Create an llms.txt file that lists key pages and provides a summary of the site's content for AI crawlers.
Complete de-indexation from search engines and AI retrieval High
Web searches for the domain, brand name, and products return zero results. The Cloudflare wall has caused the site to be completely removed from search engine indexes and AI training data.
What to change: Resolve the Cloudflare access issue to allow search engine bots to crawl and index the site. Submit the site to Google Search Console and Bing Webmaster Tools.
AI models have zero knowledge of the brand High
A frontier LLM queried about crepprotect.com cannot confirm what the brand does or sells, despite the brand being an established sneaker care company with thousands of reviews. The Cloudflare wall has effectively erased the site from AI training data.
What to change: Allow AI crawlers to access the site so that content can be included in future training data and retrieval indexes.
Well-constructed schema is completely inaccessible to crawlers High
The site has strong structured data (Product, LocalBusiness, Organization, WebSite, BreadcrumbList, FAQ) but it is behind the Cloudflare wall, so no crawler can read it. This means AI engines cannot use the schema to understand the site's content.
What to change: Ensure the live site is accessible to crawlers so that structured data can be parsed and used by search engines and AI models.
No external backlinks or mentions are indexed Medium
Web searches for the brand on Reddit, product reviews, and social media return zero results. The site's external signals are not being indexed, likely due to the domain's de-indexation.
What to change: Build backlinks from reputable sites and ensure the domain is accessible so that search engines can discover and index these signals.
What's working
- Strong structured data on archived product and homepage — Archived pages show well-constructed Product, LocalBusiness, Organization, WebSite, BreadcrumbList, and FAQ schemas with accurate brand, offer, shipping, and return policy details. This schema is a strong asset once crawlers can access it.
- Active social media presence on Instagram, TikTok, YouTube, and Facebook — The brand maintains active profiles on major social platforms, which can drive external signals and brand awareness once the site is accessible.
- Blog (CrepDaily) publishes sneaker culture content — The blog publishes articles about sneaker releases and culture, which can attract organic traffic and backlinks if indexed.
- Established brand with thousands of product reviews — The brand has thousands of reviews (e.g., 19.6K on the spray), indicating strong customer engagement and social proof.
Track crepprotect.com across AI search
This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.