AI Site Grade

thisisbeauty.us — AI Site Grade

Cloudflare JS challenge blocks every AI crawler and browser from accessing any content on thisisbeauty.us, leaving a 4,400-product catalog invisible to AI and search engines.

Thisisbeauty.us is completely inaccessible to AI crawlers and search engines due to a Cloudflare JS challenge wall, resulting in zero AI knowledge, zero external footprint, and no structured product data.

Findings
9
Evidence checks
30
Completed
30 May 2026

Analysis

I have everything I need. Let me write the audit.

Cloudflare JS challenge blocks every AI crawler — and every browser — from seeing any content

The site thisisbeauty.us is a Shopify-hosted cosmetics overstock/imperfect-goods retailer ("This is Beauty US") that sells discounted beauty products with damaged packaging. Every single request — from GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, Applebot-Extended, Bytespider, and even a standard browser — returns a 403 Cloudflare JS challenge wall. No AI crawler, search engine bot, or human browser can access the homepage, products, collections, sitemap, robots.txt, or llms.txt without passing a JavaScript challenge that no crawler can solve.

Crawler Access

All eleven tested user-agents (GPTBot, ClaudeBot, anthropic-ai, PerplexityBot, Perplexity-User, Google-Extended, OAI-SearchBot, ChatGPT-User, Applebot-Extended, Bytespider, and a plain Browser) receive a 403 status with ~8.8KB of Cloudflare challenge HTML and zero visible content. The robots.txt and llms.txt endpoints also return 403 Cloudflare walls — meaning no crawler can even read the site's access rules. The site runs on Cloudflare (managed challenge mode) proxying to Shopify (IP 23.227.38.65, a Shopify address). No AI bot has ever been able to index a single product page from this domain.

Cold-Knowledge Gap

A frontier LLM queried cold about thisisbeauty.us has zero knowledge of the brand — it cannot confirm what the site does, who it serves, or any products it offers. This is a complete vacuum. The site's actual positioning — a discount retailer selling "imperfect box" and "missing box" cosmetics from brands like SKINCEUTICALS, Sunday Riley, Estee Lauder, ColourPop, e.l.f., and Lumene — is entirely absent from AI training data. The Wayback Machine snapshot reveals a 4,412-product catalog with categories spanning haircare (1,291 products), skincare (1,199), makeup (693), bath & body (570), and wellness (431), yet no AI model has ever seen any of it.

Schema Posture

The archived homepage contains basic Organization and WebSite JSON-LD schema with the brand name "This is Beauty US" and links to Instagram and TikTok. However, no Product schema, no Offer schema, no AggregateRating schema, and no FAQ schema exist on any archived page. A site selling 4,400+ products with price comparisons, condition labels ("Imperfect Box," "Missing Box," "Imperfect Container"), and discount percentages has zero structured data for any individual product. The sameAs array in the Organization schema has eight empty string entries alongside the two real social links.

External Signals

The brand has no detectable external footprint. Searches for "thisisbeauty.us," "This is Beauty" cosmetics, and related queries return zero results across web search, social media, and review platforms. The Instagram handle @thisisbeauty_us and TikTok @thisisbeauty_us exist in the schema but have no discoverable public presence. The domain has two Google Search Console verification TXT records but appears to have never been indexed by Google — consistent with the Cloudflare wall blocking all crawlers since deployment.

Findings

  1. Cloudflare JS challenge blocks all AI crawlers and browsers High

    Every request to thisisbeauty.us returns a 403 Cloudflare JS challenge wall, preventing GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, Applebot-Extended, Bytespider, and standard browsers from accessing any content.

    What to change: Disable the JS challenge for known AI crawler user-agents by configuring Cloudflare WAF to allow GPTBot, ClaudeBot, and other listed bots, or serve static HTML content without requiring JavaScript execution.

  2. Robots.txt and llms.txt return 403 Cloudflare walls High

    The robots.txt and llms.txt endpoints are also blocked by Cloudflare, preventing crawlers from reading access rules or discovering allowed paths.

    What to change: Ensure robots.txt and llms.txt are served without Cloudflare challenge, or configure Cloudflare to allow these paths for all user-agents.

  3. Zero AI knowledge of the brand or its products High

    A frontier LLM queried cold about thisisbeauty.us has no information about the site, its products, or its positioning as a discount cosmetics retailer.

    What to change: Allow AI crawlers to index the site to build AI knowledge; consider submitting the site to AI training data sources.

  4. No Product, Offer, or AggregateRating schema on any page High

    Despite a catalog of over 4,400 products with condition labels and discount percentages, no structured data exists for individual products, offers, or ratings.

    What to change: Add Product, Offer, and AggregateRating JSON-LD schema to all product pages, including condition, price, and discount information.

  5. Organization schema contains eight empty sameAs entries Low

    The Organization JSON-LD includes eight empty strings in the sameAs array alongside two real social links, which may confuse parsers.

    What to change: Remove empty sameAs entries from the Organization schema to keep structured data clean.

  6. No detectable external presence on web, social, or review platforms High

    Searches for the brand name, domain, and social handles return zero results across web search, Instagram, TikTok, and review sites.

    What to change: Build external signals by creating social media profiles, earning backlinks, and getting listed on review platforms.

  7. Domain not indexed by Google despite Search Console verification High

    The domain has Google Search Console verification TXT records but appears to have never been indexed, consistent with the Cloudflare wall blocking all crawlers.

    What to change: Remove the Cloudflare JS challenge for Googlebot to allow indexing, and submit the sitemap to Google Search Console.

  8. Sitemap returns 403 and contains zero URLs High

    The sitemap endpoint is blocked by Cloudflare, returning a 403 status with no URLs discovered.

    What to change: Serve the sitemap without Cloudflare challenge and ensure it lists all product and collection URLs.

  9. No URLs discovered via any discovery method High

    All URL discovery methods (crawl, search, sitemap) returned zero URLs, indicating the site is completely opaque to automated discovery.

    What to change: Allow crawlers to access the site and ensure a valid sitemap is available and submitted to search engines.

What's working

  • Basic Organization and WebSite schema present on homepage — The homepage includes JSON-LD for Organization and WebSite with brand name and social links, providing foundational structured data.
  • Real Instagram and TikTok links in Organization schema — The Organization schema includes valid Instagram and TikTok URLs, which can help AI associate the brand with social profiles.
  • Google Search Console verification TXT records present — The domain has two TXT records for Google Search Console verification, indicating prior ownership verification.
  • Wayback Machine snapshots exist for homepage and collections — Archived snapshots of the homepage and collections page are available, preserving historical content for reference.

Track thisisbeauty.us across AI search

This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.

Open this AI Site Grade Grade another site Track your brand