AI Site Grade

dreibach.co.uk — AI Site Grade

Dreibach's Cloudflare JS challenge blocks every AI crawler, making the Shopify store completely invisible to LLMs and search engines.

Dreibach's Cloudflare JS challenge wall blocks all AI crawlers and search bots, resulting in zero LLM knowledge and no external web footprint despite claims of 1M+ customers.

Findings
10
Evidence checks
29
Completed
30 May 2026

Analysis

Dreibach: A Shopify store rendered invisible by Cloudflare's JS challenge wall

The site is a Shopify-powered general merchandise retailer ("Your Destination for Everyday Living") that has made itself completely invisible to every AI crawler, every search engine, and every web tool through Cloudflare's JavaScript challenge — a configuration that blocks all non-browser traffic at the edge, including GPTBot, ClaudeBot, Google-Extended, PerplexityBot, and even standard browser UAs that don't execute JS.

Crawler Access

Every single AI crawler tested — GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, anthropic-ai, PerplexityBot, Perplexity-User, Google-Extended, Bytespider, Applebot-Extended — receives a 403 with a Cloudflare JS challenge page at the homepage. The robots.txt and llms.txt endpoints return the same 403 wall. The sitemap.xml is also blocked. The site runs on Shopify (IP 23.227.38.65, Cloudflare-managed) with Cloudflare's "Under Attack" or JS challenge mode enabled, which means no crawler that cannot execute JavaScript can read a single byte of content. This is not a selective block — it is a total AI-crawler blackout.

Cold-Knowledge Gap

A frontier LLM queried cold about "Dreibach" returns zero knowledge — no awareness of the brand, its products, its positioning, or its existence. The site itself claims "Trusted by 1,000,000+ Customers" and "Amazon's #1 Bestsellers" on its homepage, yet the brand has no detectable external footprint: zero search results on DuckDuckGo, no Trustpilot presence, no Reddit mentions, no press coverage, no social media profiles found. The gap between the site's self-described scale ("1M+ customers") and its total absence from the public web is extreme.

Content & Schema

The homepage (visible only via Wayback Machine snapshot) is a standard Shopify storefront selling home, garden, pet, lighting, kitchen, and travel products. It contains a single Organization JSON-LD schema with only name and url — no Product, Offer, AggregateRating, Review, FAQPage, or BreadcrumbList schemas despite the page displaying product listings, customer reviews, and an FAQ section. The FAQ section (covering returns, shipping, and contact) is rendered as plain HTML with no FAQPage schema markup. The heading structure uses H1 for the brand name and H2/H3 for categories and products, which is functional but lacks semantic depth.

External Signals

The domain has zero external signals discoverable through web search. No reviews, no forum discussions, no press articles, no social media links, no backlink mentions. The DNS shows Google Workspace mail (MX records) and a Microsoft verification TXT record (MS=ms84110774), suggesting the business uses Google for email and may have a Microsoft relationship, but none of this is publicly visible. The Wayback Machine captured a single snapshot from January 2026, indicating the site has existed at least since then but has not been widely crawled or archived.

Findings

  1. Cloudflare JS challenge blocks all AI crawlers and search bots High

    Every AI crawler tested (GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, anthropic-ai, PerplexityBot, Perplexity-User, Google-Extended, Bytespider, Applebot-Extended) receives a 403 with a Cloudflare JS challenge page at the homepage. The robots.txt, llms.txt, and sitemap.xml endpoints are also blocked. This configuration makes the entire site invisible to any crawler that cannot execute JavaScript.

    What to change: Disable the JS challenge for known AI crawler user agents by configuring Cloudflare WAF to allow GPTBot, ClaudeBot, Google-Extended, and other listed bots, or serve a static HTML version of the site to non-JS clients.

  2. Robots.txt and llms.txt endpoints return 403 High

    The robots.txt and llms.txt files are inaccessible, returning 403 errors. This prevents crawlers from discovering allowed paths and signals a lack of AI-specific guidance.

    What to change: Serve a robots.txt that allows AI crawlers (e.g., GPTBot, ClaudeBot) and create an llms.txt file listing key pages for LLM consumption.

  3. Sitemap.xml blocked by Cloudflare High

    The sitemap.xml endpoint returns a 403 error, preventing search engines and AI crawlers from discovering the site's pages.

    What to change: Ensure sitemap.xml is accessible to crawlers by excluding it from the JS challenge or serving it statically.

  4. Zero LLM knowledge of the brand High

    A frontier LLM queried cold about 'Dreibach' returns no information, indicating the brand has no presence in AI training data.

    What to change: Allow AI crawlers to index the site and build external signals (reviews, press, social media) to improve LLM knowledge.

  5. No external web footprint despite claims of 1M+ customers High

    Web searches for 'dreibach' return zero results across multiple queries, including brand name, reviews, social media, and press. The site claims 'Trusted by 1,000,000+ Customers' and 'Amazon's #1 Bestsellers', but no external validation exists.

    What to change: Build external signals through customer reviews on third-party platforms, social media presence, and press coverage to validate brand claims.

  6. Missing Product and Offer schema on product pages Medium

    The homepage contains only an Organization schema with name and URL. No Product, Offer, AggregateRating, Review, FAQPage, or BreadcrumbList schemas are present, despite the page displaying product listings, customer reviews, and an FAQ section.

    What to change: Add Product, Offer, AggregateRating, Review, and FAQPage structured data to relevant pages to improve AI understanding and rich snippet eligibility.

  7. FAQ section lacks FAQPage schema markup Medium

    The FAQ section covering returns, shipping, and contact is rendered as plain HTML with no FAQPage schema, missing an opportunity for AI crawlers to extract structured Q&A content.

    What to change: Add FAQPage schema markup to the FAQ section to enable rich results and improve AI extraction.

  8. No detectable social media profiles Medium

    Web searches for Dreibach on Facebook, Instagram, and other platforms return no results, limiting brand visibility and external signals.

    What to change: Create and link social media profiles to build brand presence and external signals.

  9. No press coverage or third-party reviews Medium

    No press articles, blog mentions, or reviews on platforms like Trustpilot or Reddit were found, despite the site claiming 'Amazon's #1 Bestsellers'.

    What to change: Encourage customer reviews on third-party platforms and seek press coverage to build credibility and external signals.

  10. Only one Wayback Machine snapshot available Low

    The Wayback Machine has only a single snapshot from January 2026, indicating the site has not been widely crawled or archived, likely due to the Cloudflare block.

    What to change: Allow crawlers to access the site to increase archival frequency and visibility.

What's working

  • Shopify hosting provides reliable infrastructure — The site runs on Shopify, a robust e-commerce platform that offers built-in SEO features and scalability.
  • Organization JSON-LD schema present on homepage — The homepage includes an Organization schema with name and URL, providing basic entity identification for search engines.
  • Heading structure uses H1, H2, H3 appropriately — The homepage uses H1 for the brand name and H2/H3 for categories and products, providing a clear content hierarchy.
  • Google Workspace email configured — DNS records show Google Workspace mail (MX records), indicating professional email setup.

Track dreibach.co.uk across AI search

This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.

Open this AI Site Grade Grade another site Track your brand