AI Site Grade
bluebella.com — AI Site Grade
Bluebella.com is entirely invisible to AI crawlers due to a Cloudflare JS challenge that blocks all bots and even browser UAs from accessing any content.
The entire bluebella.com domain is locked behind a Cloudflare JS challenge, returning 403 to every AI crawler and browser, with no robots.txt, sitemap, or structured data accessible.
- Findings
- 9
- Evidence checks
- 34
- Completed
- 30 May 2026
Analysis
I have enough data to write the audit. The key finding is clear: the entire site is completely locked behind Cloudflare's JS challenge, blocking every AI crawler and even browser UAs from accessing any content.
Bluebella.com: The Invisible Storefront
Every AI crawler — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, Bytespider, Applebot-Extended, and ChatGPT-User — receives a 403 with a Cloudflare JS challenge page when attempting to access bluebella.com. The same 403 is returned for a standard browser User-Agent. No URL on the domain, including robots.txt, sitemap.xml, llms.txt, products.json, or collections.json, returns any actual content. The site is a Shopify store (confirmed by DNS TXT records including shopify-verification-code) fronted by Cloudflare with "Managed Challenge" mode, but the challenge never resolves for non-JS-capable clients.
Crawler Access
The compare_bot_access test across all 11 User-Agents returned uniform 403 status with ~8.8 KB of Cloudflare challenge HTML. No AI crawler reaches any page content. The robots.txt is inaccessible (403), meaning no crawler can even read crawl directives. The llms.txt returns the same 403 — no AI-friendly content map exists. The sitemap.xml is also blocked. This is not selective blocking; it is a blanket denial of all machine-readable access.
Cold-Knowledge Gap
The frontier LLM queried about Bluebella describes a UK-based lingerie brand with bold designs, inclusive sizing (A-G cups, XS-2XL), collections like "Lydia" and "Natalie," collaborations with Emily Ratajkowski and Jourdan Dunn, and a "Love Your Lingerie" campaign. This knowledge is detailed but entirely derived from pre-training data — none of it can be verified or refreshed from the live site. The model mentions "ethical manufacturing" and "sustainable materials" as reputational signals, but the site itself offers no way to confirm these claims. The gap between what AI knows cold and what the site actually communicates is total: the site communicates nothing.
Schema Posture
Zero JSON-LD schema was extractable from any page. Zero structured data of any kind. The homepage, collections, and about page all return only the Cloudflare challenge wall. No Organization, WebSite, Product, or BreadcrumbList markup exists in the accessible DOM. No meta descriptions, OG tags, or canonical URLs are served.
External Signals
DuckDuckGo search returned zero results for queries including "Bluebella," "Bluebella lingerie," and "bluebella.com" — an extreme absence of indexed external mentions. The Wayback Machine has no archived snapshot of the homepage. The DNS records confirm Shopify hosting (23.227.38.32), Microsoft 365 mail, and Klaviyo email marketing integration, but none of these backend services are exposed to crawlers. The site exists as a commercial entity with no discoverable web footprint.
Findings
Cloudflare JS challenge blocks all AI crawlers and browsers High
Every AI crawler (GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, Bytespider, Applebot-Extended, ChatGPT-User) and a standard browser User-Agent receive a 403 with Cloudflare challenge HTML. No content is served to any non-JS-capable client.
What to change: Configure Cloudflare to allow AI crawler User-Agents (e.g., GPTBot, ClaudeBot) through the JS challenge, or serve a static version of the site to bots.
Robots.txt returns 403, blocking crawl directives High
The robots.txt file at bluebella.com/robots.txt returns a 403 error, preventing crawlers from reading any crawl rules. This means no AI bot can even determine which paths are allowed or disallowed.
What to change: Ensure robots.txt is publicly accessible and returns a 200 status with appropriate directives for AI crawlers.
Sitemap.xml is blocked by Cloudflare High
The sitemap.xml returns a 403 error, so crawlers cannot discover the site's URL structure. No URLs were found via sitemap or other means.
What to change: Make sitemap.xml publicly accessible and submit it to search engines and AI crawler dashboards.
No llms.txt file for AI-friendly content map Medium
The llms.txt file returns a 403, meaning the site provides no AI-friendly content map. This is a missed opportunity to guide AI crawlers to key pages.
What to change: Create an llms.txt file listing important pages (e.g., collections, about, size guide) for AI crawlers.
Zero JSON-LD or structured data accessible High
No JSON-LD schema, meta descriptions, OG tags, or canonical URLs were extractable from any page because all pages return only the Cloudflare challenge wall. This prevents AI from understanding the site's content and structure.
What to change: Implement JSON-LD schema (Organization, WebSite, Product, BreadcrumbList) on all pages and ensure they are served to crawlers.
Zero indexed external mentions across search engines High
DuckDuckGo searches for 'Bluebella', 'Bluebella lingerie', and 'bluebella.com' returned zero results. The site has no discoverable web footprint, indicating no backlinks or indexed content.
What to change: Build backlinks through PR, partnerships, and content marketing; ensure site is indexed by search engines.
No archived snapshot in Wayback Machine Medium
The Wayback Machine has no archived snapshot of bluebella.com, further confirming the site's lack of historical web presence.
What to change: Ensure the site is crawlable so that archiving services can capture it.
AI cold knowledge about Bluebella cannot be verified from live site Medium
LLMs have detailed pre-training knowledge about Bluebella (e.g., collections, collaborations, inclusive sizing), but none of this can be confirmed or refreshed from the live site, which returns no content.
What to change: Allow AI crawlers to access the site so that AI models can verify and update their knowledge with current information.
Shopify JSON API endpoints return 403 Medium
The Shopify store's JSON endpoints (products.json, collections.json) return 403, preventing programmatic access to product and collection data.
What to change: Whitelist AI crawler IPs or serve JSON data to bots via a separate endpoint.
What's working
- Shopify hosting confirmed with DNS verification — DNS records confirm Shopify hosting (23.227.38.32) and a Shopify verification code, indicating a legitimate e-commerce setup.
- Klaviyo email marketing integration detected — DNS TXT records include Klaviyo email marketing verification, showing the site uses a reputable email marketing platform.
- Microsoft 365 mail configuration in place — DNS records show Microsoft 365 mail setup, indicating professional email infrastructure.
Track bluebella.com across AI search
This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.