AI Site Grade
innermostglobal.com — AI Site Grade
Innermostglobal.com's Cloudflare JS challenge wall blocks every AI crawler and browser, making the site completely invisible to the AI ecosystem.
The site is entirely inaccessible to AI crawlers due to a Cloudflare JS challenge, has zero external signals, and lacks product schema despite a full supplement catalog.
- Findings
- 12
- Evidence checks
- 33
- Completed
- 30 May 2026
Analysis
The Cloudflare JS challenge at innermostglobal.com blocks every AI crawler — and every browser — from seeing a single byte of real content, making the site effectively invisible to the AI ecosystem.
Crawler Access
Every AI crawler tested — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, Applebot-Extended, Bytespider, anthropic-ai, ChatGPT-User, Perplexity-User — receives a 403 with a Cloudflare JS challenge wall. Even a standard browser UA gets the same 403. The site is hosted on Shopify (IP 23.227.38.32, Shopify CDN) behind Cloudflare's "Managed Challenge" mode. The robots.txt (visible only via Wayback) is the default Shopify template with no AI-bot-specific rules — it does not disallow GPTBot, ClaudeBot, or any AI crawler, but the Cloudflare layer blocks them before they ever reach the robots.txt. No llms.txt exists (returns the same 403 wall). The sitemap is a standard Shopify index pointing to products, pages, collections, and blogs — but none of these URLs are accessible to crawlers either.
Cold-Knowledge Gap
A frontier LLM queried cold about innermostglobal.com has zero knowledge of the brand — no products, no reputation, no media coverage. The site describes itself as "Science-Backed Health Supplements" with a full product line (The Lean Protein, The Strong Protein, The Health Protein, The Fit Protein, boosters, capsules, blends) and claims press mentions from Cosmopolitan, Metro, and Women's Fitness. Yet the AI model knows nothing about any of this. The gap between the site's self-positioning ("science meets nature," "tailored nutrition for body and mind") and the model's blank slate is total — the site has no AI-accessible footprint whatsoever.
Schema Posture
The archived homepage contains Organization and WebSite JSON-LD schema with name "Innermost," a London address (64 Nile Street, N1 7SR), contact email [email protected], and social links to Facebook, Twitter, and Instagram. The schema references a different domain (liveinnermost.com) as the @id and url, suggesting a domain migration or rebranding. No Product schema is present on the homepage despite 15+ products being listed. No FAQPage, HowTo, or BreadcrumbList schema was detected. The blog sitemap shows 300+ articles, but none carry structured data for recipes, health claims, or articles.
External Signals
External search returns zero results for the brand across web search, Reddit, and Trustpilot. The DNS TXT records show Klaviyo, Pinterest, and Amazon SES verification — indicating email marketing and social commerce activity — but no indexed press coverage, reviews, or community discussion was found. The Wayback Machine shows the site has been active since at least 2018 (oldest sitemap dates), with regular updates through early 2026, yet the brand has generated no discoverable external conversation. The schema references liveinnermost.com as the canonical brand domain, but that domain also returns no search results.
Findings
Cloudflare JS challenge blocks all AI crawlers and browsers High
Every AI crawler tested (GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, Applebot-Extended, Bytespider, anthropic-ai, ChatGPT-User, Perplexity-User) receives a 403 with a Cloudflare JS challenge wall. Even a standard browser UA gets the same 403. The site is hosted on Shopify behind Cloudflare's Managed Challenge mode, preventing any crawler from accessing content.
What to change: Configure Cloudflare to allow AI crawler user agents (e.g., GPTBot, ClaudeBot) by disabling the JS challenge for those UAs or adding them to an allowlist.
Robots.txt is inaccessible to crawlers High
The robots.txt file returns a 403 due to the Cloudflare challenge, so crawlers cannot read any directives. The archived version shows only the default Shopify template with no AI-bot-specific rules.
What to change: Ensure robots.txt is served without a JS challenge and add explicit allow rules for AI crawlers.
No llms.txt file exists Medium
The llms.txt file returns a 403, indicating it is not published. This file would help AI crawlers discover the site's content and structure.
What to change: Create an llms.txt file that lists key pages and provides a brief description of the site for AI crawlers.
Frontier LLMs have zero knowledge of the brand High
A cold query about innermostglobal.com returns no information about the brand, its products, or its press mentions. The site claims coverage from Cosmopolitan, Metro, and Women's Fitness, but no AI model can verify or surface this.
What to change: Make the site accessible to AI crawlers and publish structured data to help models index and understand the content.
Zero external signals from web search, Reddit, or Trustpilot High
Web searches for the brand, its products, and reviews return zero results. No Reddit discussions, Trustpilot reviews, or other third-party mentions were found, despite the site claiming press coverage.
What to change: Build external signals through PR, influencer partnerships, and review platforms to establish credibility and discoverability.
No Product schema on homepage despite 15+ products High
The archived homepage lists over 15 products (The Lean Protein, The Strong Protein, etc.) but contains no Product JSON-LD schema. This prevents AI crawlers from understanding product details, pricing, or availability.
What to change: Add Product schema markup to all product pages, including name, description, price, and availability.
Schema references liveinnermost.com instead of innermostglobal.com Medium
The Organization and WebSite schema use liveinnermost.com as the @id and url, suggesting a domain migration or rebranding. This inconsistency can confuse crawlers about the canonical domain.
What to change: Update the schema to use innermostglobal.com as the canonical domain and set up proper redirects from liveinnermost.com.
Missing FAQPage, HowTo, and BreadcrumbList schema Medium
The site does not implement FAQPage, HowTo, or BreadcrumbList structured data, which are commonly used by AI crawlers to extract Q&A, instructions, and navigation context.
What to change: Add FAQPage schema for common questions, HowTo schema for product usage, and BreadcrumbList schema for navigation.
Blog articles lack Article schema Medium
The blog sitemap lists 300+ articles, but no Article or NewsArticle schema was detected. This limits the ability of AI crawlers to surface blog content in knowledge panels or search results.
What to change: Add Article schema to all blog posts, including headline, datePublished, author, and image.
Sitemap is blocked by Cloudflare challenge High
The sitemap.xml and its sub-sitemaps return 403 to crawlers, preventing search engines and AI bots from discovering the site's URL structure.
What to change: Ensure sitemap.xml is accessible without a JS challenge and submit it to search engines.
No discoverable social media or community discussion Medium
Searches for the brand on Reddit, Trustpilot, and general web return zero results. The DNS TXT records show Klaviyo, Pinterest, and Amazon SES verification, but no active social media presence was found.
What to change: Establish and promote social media profiles, engage in relevant communities, and encourage customer reviews.
Claimed press coverage is not verifiable online Medium
The site claims press mentions from Cosmopolitan, Metro, and Women's Fitness, but no search results or external references confirm these mentions.
What to change: Ensure press coverage is indexed online by securing backlinks from the publishers' websites.
What's working
- Organization and WebSite schema present on homepage — The archived homepage includes Organization and WebSite JSON-LD schema with name, address, contact email, and social links, providing basic identity information to crawlers.
- Shopify hosting with standard sitemap structure — The site uses Shopify, which provides a standard sitemap index with sub-sitemaps for products, pages, collections, and blogs, making URL discovery possible if access is granted.
- Regular site updates confirmed via Wayback Machine — The Wayback Machine shows snapshots from at least 2018 through early 2026, indicating the site is actively maintained with fresh content.
- DNS TXT records show email and social platform verification — TXT records include Klaviyo, Pinterest, and Amazon SES verification, indicating active email marketing and social commerce integrations.
- Detailed product descriptions on homepage — The archived homepage contains detailed descriptions for multiple products, including ingredients and benefits, which would be valuable for AI crawlers if accessible.
- Blog with 300+ articles — The blog sitemap lists over 300 articles, providing a substantial content base that could be leveraged for AI visibility if made accessible.
- Social media links included in schema — The Organization schema includes links to Facebook, Twitter, and Instagram, providing crawlers with social profile references.
- Contact information present in schema — The schema includes a London address and email contact, which helps establish business legitimacy for AI crawlers.
Track innermostglobal.com across AI search
This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.