AI Site Grade
minorfigures.com — AI Site Grade
Minor Figures' live site is completely invisible to all AI crawlers due to a Cloudflare JS challenge, with zero schema markup and no external search footprint.
Minor Figures' live site blocks every AI crawler with a Cloudflare JS challenge, has no schema markup, and lacks any indexed external signals, making it invisible to AI systems.
- Findings
- 12
- Evidence checks
- 29
- Completed
- 30 May 2026
Analysis
Minor Figures: The live site is invisible to every AI crawler
The live minorfigures.com returns 403 with a Cloudflare JS challenge to every user-agent tested — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, ChatGPT-User, Applebot-Extended, Bytespider, and even a standard browser. No bot or tool can read a single word of content from the live domain. The only accessible version lives in the Wayback Machine, where the most recent snapshot (February 2026) reveals a Shopify storefront.
Crawler Access
Every AI crawler and browser receives the same Cloudflare "Verifying your connection..." JS challenge page. The compare_bot_access test returned 403 blocked for all 11 user-agents with near-identical byte sizes (~8.8KB of challenge HTML). The robots.txt and llms.txt endpoints are also behind the same challenge wall — neither exists as a resolvable file. The DNS resolves to 23.227.38.32 (Shopify's CDN), and the site uses Google Domains nameservers. The Cloudflare challenge is configured at the edge before any Shopify content is served, meaning no crawler ever reaches the actual store.
Cold-Knowledge Gap
The LLM knows Minor Figures as a UK-based oat milk and cold brew coffee brand founded in 2014 by Stuart Forsyth and Simon Wright, with a barista-quality oat milk line, minimalist branding with a cartoon mascot, and a 2023 greenwashing controversy over carbon-neutral claims that led to an ASA ruling. The live site, however, cannot confirm or deny any of this — it is entirely inaccessible. The Wayback snapshot shows the site describes itself as "a coffee company that makes coffee products for coffee lovers" and prominently features B Corp certification, subscription models, and a "Dairy Disloyalty" campaign. The greenwashing controversy is completely absent from the site's own messaging.
Schema Posture
The Wayback snapshots of the homepage, product pages, FAQ, and subscription pages all show zero JSON-LD schema of any type. Despite being a Shopify store with product pages (/products/barista-oat), no Product, Organization, FAQPage, BreadcrumbList, or WebSite schema is present. The FAQ page contains rich Q&A content (ingredients, sustainability, shipping) that would naturally map to FAQPage schema but has none. The homepage has no Organization schema with logo, social links, or B Corp certification data.
External Signals
The brand's external footprint is surprisingly sparse in search results. DuckDuckGo returned zero results for queries combining the brand name with "ASA," "greenwashing," "B Corp," or "founded." The LLM's cold knowledge about the 2023 ASA greenwashing ruling is the most distinctive external signal — but no live web search could corroborate it, suggesting the brand's off-domain presence may be thin or poorly indexed. The site uses Klaviyo (multiple verification TXT records), Mandrill, SendGrid, and Google Workspace — a standard Shopify ecommerce stack. A subdomain at minorfigures.fm hosts a community radio project, and sourcefiles.minorfigures.com appears to host media assets.
Findings
Cloudflare JS challenge blocks all AI crawlers High
The live site returns a 403 with a Cloudflare JavaScript challenge to every tested user-agent, including GPTBot, ClaudeBot, PerplexityBot, and standard browsers. No crawler can access any content.
What to change: Remove the Cloudflare JS challenge for known AI crawler user-agents, or configure a bypass rule to allow GPTBot, ClaudeBot, and other listed bots through to the Shopify origin.
Robots.txt and llms.txt are inaccessible High
The robots.txt and llms.txt endpoints return 403 errors, preventing crawlers from discovering allowed paths or AI-specific instructions.
What to change: Ensure robots.txt and llms.txt are served without a JS challenge, and include directives for AI crawlers.
No JSON-LD schema on any page High
Wayback snapshots of the homepage, product pages, FAQ, and subscription pages show zero JSON-LD schema. No Product, Organization, FAQPage, or WebSite schema is present.
What to change: Add JSON-LD schema for Organization (homepage), Product (product pages), FAQPage (FAQ page), and WebSite across the site.
No external search results for the brand Medium
DuckDuckGo returned zero results for queries combining the brand name with key terms like 'ASA', 'greenwashing', 'B Corp', or 'founded'. The brand's off-domain presence appears thin or poorly indexed.
What to change: Build external signals through PR, backlinks, and social media to improve brand visibility in search and AI knowledge bases.
Greenwashing controversy absent from site messaging Medium
The LLM knows about a 2023 ASA greenwashing ruling regarding carbon-neutral claims, but the site's Wayback content does not address or acknowledge this controversy.
What to change: Add a page or section addressing the ASA ruling and current sustainability practices to provide transparency and accurate information.
Sitemap inaccessible High
The sitemap endpoint returns a 403 error, preventing crawlers from discovering the site's URL structure.
What to change: Serve the sitemap without a JS challenge and ensure it lists all important pages.
No known URLs discovered via search Medium
The list_known_urls tool returned zero URLs, indicating the site has no indexed pages in search engines.
What to change: Ensure the site is indexable by search engines and submit a sitemap to Google Search Console.
FAQ page lacks FAQPage schema Medium
The FAQ page contains rich Q&A content but has no FAQPage schema, missing an opportunity for rich results in search and AI extraction.
What to change: Add FAQPage schema to the FAQ page with all questions and answers.
Product page lacks Product schema High
The Barista Oat product page has no Product schema, preventing AI systems from extracting product details like name, price, and description.
What to change: Add Product schema to all product pages with name, description, price, and availability.
Homepage lacks Organization schema Medium
The homepage has no Organization schema, missing the chance to provide brand name, logo, and social links to AI crawlers.
What to change: Add Organization schema to the homepage with name, logo, and social media URLs.
Subscription page lacks schema Low
The subscription page has no schema markup, missing an opportunity to describe subscription offerings to AI systems.
What to change: Add appropriate schema (e.g., Product or Service) to the subscription page.
Play page has minimal content Low
The /pages/play page contains only 7 words, providing no value to crawlers or users.
What to change: Expand the Play page with meaningful content or remove it if unused.
What's working
- Wayback snapshot preserves site content — The Wayback Machine has a recent snapshot (February 2026) that reveals the site's Shopify storefront, including product pages, FAQ, and subscription details, providing a fallback for content discovery.
- B Corp certification prominently displayed — The site prominently features B Corp certification on the homepage, which is a positive signal for brand trust and sustainability.
- Subscription model well-documented — The subscription page contains detailed information about subscription options, which is useful for AI systems that can access it via Wayback.
- FAQ content is rich and structured — The FAQ page contains well-organized Q&A covering ingredients, sustainability, and shipping, which would be ideal for FAQPage schema.
- Dairy Disloyalty campaign present — The site has a dedicated page for the 'Dairy Disloyalty' campaign, which is a unique brand asset that could be leveraged for AI visibility.
- Community radio subdomain exists — The subdomain minorfigures.fm hosts a community radio project, which could be a unique external signal if properly indexed.
- Standard ecommerce stack in use — The site uses Shopify, Klaviyo, Mandrill, SendGrid, and Google Workspace, which are reliable and well-understood by AI systems.
- Product page has descriptive title — The Barista Oat product page has a descriptive title tag that includes the product name and category, which helps with basic SEO.
Track minorfigures.com across AI search
This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.