AI Site Grade

spacegoods.com — AI Site Grade

Spacegoods.com is a fully operational Shopify store with thousands of reviews that is completely invisible to AI crawlers and search engines due to an aggressive Cloudflare challenge wall.

Spacegoods.com's Cloudflare JS challenge blocks all AI crawlers and search engines, leaving a mature e-commerce brand with 21+ SKUs and 12,000+ reviews entirely absent from AI knowledge and web search.

Findings
12
Evidence checks
31
Completed
30 May 2026

Analysis

Spacegoods.com: A Live Shopify Store Rendered Invisible to the Open Web

The live site returns a 403 Cloudflare challenge wall to every single AI crawler and browser alike — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, and even a plain browser all receive the same "Verifying your connection..." JavaScript shell with zero content. The domain is a functioning e-commerce business (Shopify-hosted, DNS points to 23.227.38.65) selling mushroom-and-adaptogen functional blends under product names like Rainbow Dust, Astro Dust, and Hydro Dust, yet it has no discoverable external footprint anywhere on the indexed web.

Crawler Access

Every AI crawler tested — GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, anthropic-ai, PerplexityBot, Perplexity-User, Google-Extended, Bytespider, Applebot-Extended — receives a 403 with Cloudflare's JS challenge page (cf-ray headers present). The robots.txt and llms.txt endpoints also 403. The sitemap at /sitemap.xml is blocked. There are no AI-bot directives anywhere because Cloudflare's "Under Attack" mode (or equivalent JS challenge) intercepts all requests before the application server. The site is effectively invisible to every text-based crawler — no bot can read a single word of product copy, ingredient lists, or pricing.

Cold-Knowledge Gap

A frontier LLM queried cold about spacegoods.com returns: *"I do not have specific, verifiable information about spacegoods.com... I cannot confirm whether the site is active, legitimate, or notable."* This is a total knowledge vacuum — the model has no prior about the brand, its products, its category positioning, or its claims. The Wayback Machine reveals a fully fleshed-out Shopify store with 21+ SKUs, 12,315 reviews on Rainbow Dust alone (aggregate rating 4.7), and a clear value proposition ("mushroom coffee alternative, no jitters, no crash"). The gap between the live operation and what AI knows is absolute.

Content & Schema Posture

The archived homepage (Wayback Machine, May 2026) shows a polished Shopify store with FAQ sections, ingredient breakdowns, comparison language ("We didn't improve coffee. We made it obsolete"), and a full product taxonomy (Focus, Energy, Sleep, Hydrate, Restore). The Rainbow Dust product page carries a Product schema with AggregateRating (4.7 / 12,315 reviews) and embedded reviews. However, no Organization, WebSite, BreadcrumbList, or FAQPage schema was found on the homepage or collection pages. The site has rich answer-format signals (FAQ, tables, lists, comparison language) but fails to mark them up for structured extraction.

External Signals

The domain has zero indexed external mentions across search engines, Reddit, Trustpilot, or press. Searches for "spacegoods reviews," "spacegoods mushroom coffee," and "rainbow dust spacegoods" return no results. The DNS TXT records show Klaviyo and Google Search Console verification, confirming the site is actively managed for email marketing and search — yet no organic search presence exists. The Wayback snapshot from May 2026 is the only accessible record of the site's content.

Surprising Finding

The most striking finding is the complete contradiction between the site's operational sophistication and its web invisibility. This is a mature Shopify store with thousands of reviews, multiple product lines, subscription mechanics, and a clear brand voice — yet it is entirely walled off from AI crawlers and search engines. The Cloudflare challenge is so aggressive that even Google-Extended cannot index a single page. The brand has built a functioning e-commerce business that, from the perspective of AI knowledge retrieval and web search, does not exist.

Findings

  1. Cloudflare JS challenge blocks all AI crawlers and search engines High

    Every AI crawler tested (GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, anthropic-ai, PerplexityBot, Perplexity-User, Google-Extended, Bytespider, Applebot-Extended) receives a 403 with a Cloudflare JavaScript challenge page. The site is completely inaccessible to text-based crawlers, preventing any indexing or content extraction.

    What to change: Disable Cloudflare's Under Attack mode or JS challenge for known AI crawler user agents. Allow GPTBot, ClaudeBot, Google-Extended, and other major AI bots to access the site without a challenge.

  2. Robots.txt and llms.txt endpoints return 403 High

    The robots.txt and llms.txt files are blocked by Cloudflare, returning 403 errors. This prevents crawlers from discovering allowed paths and prevents AI-specific directives from being read.

    What to change: Ensure robots.txt and llms.txt are served without a challenge. Add clear AI crawler directives to robots.txt and publish an llms.txt file with site overview and key URLs.

  3. Sitemap.xml returns 403, preventing crawler discovery High

    The sitemap at /sitemap.xml is blocked by Cloudflare, returning a 403. This prevents search engines and AI crawlers from discovering the site's pages and structure.

    What to change: Allow access to /sitemap.xml without a challenge. Ensure the sitemap is properly formatted and submitted to Google Search Console.

  4. Zero indexed external mentions across web, social, and review platforms High

    Searches for 'spacegoods reviews', 'spacegoods mushroom coffee', 'rainbow dust spacegoods', and brand-specific queries on Reddit, Trustpilot, and press return no results. The domain has no discoverable external footprint anywhere on the indexed web.

    What to change: Build external signals through PR, influencer partnerships, review platforms (Trustpilot, G2), and social media engagement. Encourage customers to leave reviews on third-party sites.

  5. Frontier LLMs have no knowledge of the brand or products High

    A cold query to a frontier LLM about spacegoods.com returns: 'I do not have specific, verifiable information about spacegoods.com... I cannot confirm whether the site is active, legitimate, or notable.' The model has zero prior knowledge of the brand, its products, or its category positioning.

    What to change: Allow AI crawlers to index the site and publish structured data (Organization, Product, FAQ) to help LLMs learn about the brand. Consider submitting to AI knowledge bases like ChatGPT's browsing feature.

  6. No Organization or WebSite schema on homepage Medium

    The archived homepage lacks Organization, WebSite, and BreadcrumbList structured data. This limits how AI systems and search engines understand the brand's identity and site structure.

    What to change: Add Organization schema (name, logo, URL, social profiles) and WebSite schema (search action, name) to the homepage. Implement BreadcrumbList on all pages.

  7. FAQ content on homepage lacks FAQPage schema Medium

    The homepage contains FAQ-style content (e.g., 'What is Spacegoods?', 'How does it work?') but is not marked up with FAQPage schema. This prevents AI systems from extracting structured Q&A pairs.

    What to change: Wrap FAQ sections in FAQPage schema with Question and Answer properties. This enables rich results and AI-friendly extraction.

  8. Collection pages lack Product schema for listed items Medium

    The 'shop all' collection page lists products but does not include Product schema for each item. Only individual product pages have schema. This limits AI understanding of the full catalog.

    What to change: Add Product schema (name, price, description, image, aggregateRating) to each product listing on collection pages.

  9. Reviews page contains no visible content Medium

    The /pages/reviews page returns only 2 words of content ('Spacegoods | Community'), suggesting reviews are loaded dynamically or the page is empty. This prevents AI crawlers from accessing social proof.

    What to change: Ensure review content is server-side rendered or statically included in the HTML. Add Review schema markup for each review.

  10. Labs page returns 404, breaking trust signals Medium

    The /pages/labs page, which likely contained lab testing or ingredient transparency information, returns a 404. This erodes trust and removes a potential source of authoritative content for AI crawlers.

    What to change: Restore the labs page with lab test results, certificates, or ingredient sourcing details. Add a link in the footer or navigation.

  11. No pages indexed in Google despite Search Console verification High

    A site:spacegoods.com search returns zero results. DNS TXT records show Google Search Console verification, indicating the site is actively managed for search, yet no pages are indexed due to the Cloudflare block.

    What to change: Remove the Cloudflare challenge for Googlebot and other search engine crawlers. Submit the sitemap to Google Search Console and request indexing.

  12. No discoverable social media presence for the brand Medium

    Searches for spacegoods on Instagram, TikTok, and other platforms return no results. The brand has no external social signals that AI crawlers or search engines can reference.

    What to change: Create and actively maintain social media profiles (Instagram, TikTok, LinkedIn) with links back to the website. Embed social feeds on the site.

What's working

  • Product schema with AggregateRating on individual product pages — The Rainbow Dust product page includes Product schema with AggregateRating (4.7 out of 5 from 12,315 reviews) and embedded reviews. This provides rich structured data for AI systems when they can access the page.
  • Polished Shopify store with clear brand messaging and product taxonomy — The archived site shows a well-designed Shopify store with clear product categories (Focus, Energy, Sleep, Hydrate, Restore), compelling copy, and a strong value proposition. This provides a solid foundation for AI-friendly content once access is granted.
  • Active email marketing and Google Search Console verification — DNS TXT records show Klaviyo and Google Search Console verification, indicating the site is actively managed for email marketing and search. This suggests the business is operational and invested in digital presence.
  • Wayback Machine snapshot preserves site content for historical reference — A Wayback Machine snapshot from May 2026 captures the full site content, including product pages, collections, and reviews. This provides a fallback for AI systems that can access archived content.

Track spacegoods.com across AI search

This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.

Open this AI Site Grade Grade another site Track your brand