AI Site Grade

zoovu.com — AI Site Grade

Zoovu's site is a model of AI-crawler hospitality on the surface, yet ClaudeBot gets a 429 from Cloudflare while its sibling anthropic-ai passes cleanly — a WAF quirk that silently starves one of the most important AI models of the site's full content.

Zoovu's site demonstrates strong AI-crawler hospitality with a comprehensive llms.txt and permissive robots.txt, but suffers from a Cloudflare WAF quirk that rate-limits ClaudeBot, a near-empty JS-dependent blog, and a lack of SoftwareApplication schema on product pages, limiting AI visibility.

Findings
8
Evidence checks
25
Completed
30 May 2026

Analysis

Zoovu's site is a model of AI-crawler hospitality on the surface, yet ClaudeBot gets a 429 from Cloudflare while its sibling anthropic-ai passes cleanly — a WAF quirk that silently starves one of the most important AI models of the site's full content.

Crawler Access

The robots.txt explicitly allows GPTBot, Google-Extended, ClaudeBot, anthropic-ai, PerplexityBot, Bytespider, CCBot, Amazonbot, and facebookexternalhit to crawl everything. The llms.txt at /llms.txt returns a 200 with a comprehensive 70+ URL map spanning platform capabilities, industries, case studies, and integrations — one of the most complete llms.txt files observed. However, compare_bot_access reveals a split: ClaudeBot gets a 429 (rate-limited) from Cloudflare while anthropic-ai passes at 200 with full content. Bytespider gets a 403 (blocked) outright. All other AI bots (GPTBot, OAI-SearchBot, ChatGPT-User, Google-Extended, PerplexityBot, Perplexity-User, Applebot-Extended) return 200 with the same ~848KB payload as a browser. The site runs on Cloudflare + WP Engine, serving server-rendered HTML — no JS-shell risk for crawlers.

Cold-Knowledge Gap

The LLM prior knows Zoovu as a "product discovery and conversational commerce platform" founded in 2012 in Berlin with $50M+ funding from Wavecrest Growth Partners (2021), serving Microsoft, Bosch, and L'Oreal. The actual site tells a different story: the homepage schema lists foundingDate: 2008 and 200-500 employees from a Boston, MA address. The site positions Zoovu as an "AI-native ecommerce revenue engine" unifying search, guided selling, configurators, and data enrichment — a broader platform story than the cold model's narrower "product finder + chatbot" framing. The cold model knows nothing about the Zoe AI shopping assistant, the XGEN AI acquisition (May 2026), or the MCP Server launch (Dec 2025). The site's emphasis on B2B (CPQ, BOM, RFQ) is entirely absent from the model's prior.

Schema Posture

The homepage carries a rich Organization schema with founding date, employee range, address, social profiles, and contact info. The reviews page includes a SoftwareApplication schema with aggregateRating (4.8/5, 15 ratings). However, no product or solution page uses SoftwareApplication, WebApplication, or FAQPage schema. The pricing page, blog posts, and case studies all use only generic WebPage schema. The SoftwareApplication schema on the reviews page is the only structured-data signal that an AI engine could use to classify Zoovu as a software product — and it lives three clicks deep.

External Signals

The newsroom shows a steady cadence of partnerships (commercetools, Knack Systems, Euronics, Shopware, Microsoft Azure Marketplace), analyst recognition (IDC MarketScape 2024), and the XGEN AI acquisition (May 2026). DNS TXT records confirm verification tokens for OpenAI, Anthropic, Apple, Atlassian, and Langdock — indicating active management of AI-platform relationships. No Reddit threads or G2 review pages surfaced in search, suggesting limited organic off-domain conversation.

Surprising Details

The homepage datePublished is 2026-05-04 — a future date from the present perspective, suggesting the site uses a staging or placeholder timestamp. The /overview page is set to noindex, nofollow despite being a core platform landing page linked from the llms.txt. The blog at blog.zoovu.com redirects to zoovu.com/blog and renders only 15 words of visible text — a near-empty JS-dependent shell that AI crawlers cannot parse, despite the sitemap listing 60+ blog articles with rich content.

Findings

  1. ClaudeBot rate-limited by Cloudflare WAF while anthropic-ai passes High

    ClaudeBot receives a 429 (rate-limited) from Cloudflare, preventing full content access, while anthropic-ai (another Anthropic crawler) returns 200 with full content. This inconsistency starves ClaudeBot of the site's content.

    What to change: Review Cloudflare WAF rules to ensure ClaudeBot is not rate-limited, matching the access granted to anthropic-ai.

  2. Bytespider blocked with 403 Medium

    Bytespider (ByteDance's crawler) receives a 403 (blocked) response, preventing any content access despite being allowed in robots.txt.

    What to change: Investigate and resolve the 403 for Bytespider to align with robots.txt permissions.

  3. Blog renders as near-empty JS shell for AI crawlers High

    The blog at blog.zoovu.com redirects to zoovu.com/blog and renders only 15 words of visible text, indicating a JavaScript-dependent shell that AI crawlers cannot parse. The sitemap lists 60+ blog articles with rich content, but crawlers cannot access them.

    What to change: Implement server-side rendering or static HTML for the blog to ensure AI crawlers can access the full article content.

  4. Core overview page set to noindex, nofollow High

    The /overview page, a core platform landing page linked from llms.txt, is set to noindex, nofollow, preventing AI crawlers from indexing it.

    What to change: Remove the noindex, nofollow directive from the /overview page to allow indexing.

  5. Product and solution pages lack SoftwareApplication schema Medium

    No product or solution page uses SoftwareApplication, WebApplication, or FAQPage schema. Only the reviews page has SoftwareApplication schema with aggregateRating. This limits AI engines' ability to classify Zoovu as a software product.

    What to change: Add SoftwareApplication or WebApplication schema to product and solution pages, and consider FAQPage schema for relevant content.

  6. Homepage datePublished set to future date Low

    The homepage's datePublished is 2026-05-04, a future date, suggesting a staging or placeholder timestamp that could confuse AI crawlers.

    What to change: Update the datePublished to the actual publication date or remove it if not applicable.

  7. Cold LLM knowledge outdated and incomplete Medium

    The LLM prior knows Zoovu as a 2012 Berlin startup with $50M+ funding, but the site shows founding in 2008, Boston HQ, and recent developments like XGEN AI acquisition and MCP Server launch. The cold model lacks awareness of B2B capabilities and Zoe AI assistant.

    What to change: Ensure consistent and up-to-date information across the site and external sources to improve LLM knowledge alignment.

  8. Limited organic off-domain conversation Low

    No Reddit threads or G2 review pages surfaced in web searches, indicating limited organic discussion about Zoovu on third-party platforms.

What's working

  • Comprehensive llms.txt with 70+ URL map — The llms.txt file returns a 200 with a comprehensive map of over 70 URLs covering platform capabilities, industries, case studies, and integrations, providing AI crawlers with a clear entry point.
  • Permissive robots.txt allowing major AI bots — The robots.txt explicitly allows GPTBot, Google-Extended, ClaudeBot, anthropic-ai, PerplexityBot, and other AI crawlers to access the entire site.
  • Server-rendered HTML for main pages — Main pages serve server-rendered HTML with ~848KB payload, ensuring AI crawlers can parse content without JavaScript execution.
  • Rich Organization schema on homepage — The homepage includes a detailed Organization schema with founding date, employee range, address, social profiles, and contact info, aiding AI understanding.
  • SoftwareApplication schema with aggregate rating on reviews page — The reviews page includes SoftwareApplication schema with aggregateRating (4.8/5, 15 ratings), providing a strong signal for AI classification.
  • Active newsroom with recent partnerships and acquisitions — The newsroom shows a steady cadence of partnerships and the XGEN AI acquisition, demonstrating ongoing business activity.
  • DNS TXT records for AI platform verification — DNS TXT records confirm verification tokens for OpenAI, Anthropic, Apple, Atlassian, and Langdock, indicating active management of AI-platform relationships.

Track zoovu.com across AI search

This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.

Open this AI Site Grade Grade another site Track your brand