AI Site Grade

soarwithus.co — AI Site Grade

Soar With Us is invisible to AI engines: zero structured data, no LLM knowledge, and no external citations despite claiming £600M+ in generated revenue.

The site lacks all structured data, has no AI-bot directives, and has zero external citations, making it invisible to AI engines despite strong content and client results.

Findings
10
Evidence checks
24
Completed
30 May 2026

Analysis

Soar With Us: AI-Visibility Audit

The cold LLM knowledge gap is total — a frontier model queried on "Soar With Us marketing agency" returned zero verifiable facts, no services, no clients, no case studies, no reputation signals. This is a brand claiming £600M+ in generated revenue and 250+ clients that is effectively invisible to AI engines without live retrieval.

Crawler Access

All major AI crawlers — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, ChatGPT-User, anthropic-ai — receive a full 200 with identical byte payload (255KB) to a browser baseline. Only Bytespider is blocked (403 by Cloudflare). The robots.txt at https://www.soarwithus.co/robots.txt contains a single line (the sitemap URL) and zero AI-bot directives — no disallow, no crawl-delay, no mention of any crawler. The /llms.txt endpoint returns a 404 (Webflow's generic Not Found page). The site runs on Webflow behind Cloudflare (NS: cloudflare.com, A: 198.202.211.1), with HSTS enabled. No JS-rendering risk: all pages return rich HTML content to plain GET requests.

Content & Schema Posture

Zero JSON-LD structured data exists on any page examined — homepage, case studies, about, blog, service pages. No Organization, LocalBusiness, Service, FAQPage, Article, or BreadcrumbList schema is present. The homepage uses a single H1 ("FUELLING 8 & 9 FIGURE E-COMMERCE GROWTH") followed by a flat wall of H2/H3 headings with no semantic hierarchy. The blog contains substantive, long-form content (the personas article runs ~35K characters) but carries no Article schema, no author markup, no publish-date metadata that AI crawlers can parse reliably. The podcasts page has a broken canonical pointing to /case-studies instead of itself. The /email-sms-marketing URL, linked from the global navigation, returns a 404 Not Found.

Cold-Knowledge Gap

The LLM's prior is a blank slate. The site positions itself as a full-funnel DTC growth agency with proprietary "ARC marketing system," a "Creative Growth Engine" process, and named clients including AKT London, Cowshed, Mindful Chef, Lapland UK, Nuovva, and Spacegoods. The case studies page documents 15+ client results with specific metrics (e.g., "CPA down 59%," "revenue up 314%"). None of this data is represented in structured schema. The site also operates a podcast ("D2C Diaries") with 10+ episodes featuring AI, Meta strategy, and DTC growth topics — a content asset that would naturally feed AI knowledge bases but is entirely absent from the model's cold recall.

External Signals

Web searches for "Soar With Us" marketing agency Leeds and "Soar With Us" agency reviews returned zero results on DuckDuckGo. No third-party review sites, no press mentions, no Reddit threads, no Clutch/GoodFirms listings surfaced. The only external footprint is a LinkedIn company page and Instagram/Facebook profiles linked from the site footer. The store subdomain (store.soarwithus.co) is behind a Cloudflare challenge wall (403 with "Verifying your connection..."). The DNS TXT records show HubSpot email and Google Workspace usage, plus a proxy-ssl.webflow.com entry confirming the Webflow hosting stack.

Findings

  1. Zero JSON-LD structured data on any page High

    No Organization, LocalBusiness, Service, FAQPage, Article, or BreadcrumbList schema exists on the homepage, case studies, about, blog, or service pages. This prevents AI crawlers from extracting entity relationships and factual claims.

    What to change: Add JSON-LD structured data for Organization, LocalBusiness, Service, Article, and BreadcrumbList across all relevant pages.

  2. Complete cold-knowledge gap for the brand High

    A frontier LLM queried on 'Soar With Us marketing agency' returned zero verifiable facts — no services, clients, case studies, or reputation signals. The brand is effectively invisible to AI engines without live retrieval.

    What to change: Implement structured data and build external citations (press, reviews, directories) to populate AI knowledge bases.

  3. Robots.txt has no AI-bot directives Medium

    The robots.txt file contains only a sitemap URL and zero rules for any AI crawler (GPTBot, ClaudeBot, PerplexityBot, etc.). While this allows access, it misses the opportunity to guide crawlers to priority content.

    What to change: Add explicit allow/disallow rules for AI crawlers and reference the sitemap with crawl-delay hints.

  4. Missing /llms.txt endpoint Medium

    The /llms.txt endpoint returns a 404. This file is used by AI crawlers to discover key content and context about the site.

    What to change: Create an /llms.txt file listing key pages (homepage, case studies, about, blog) and a brief description of the agency.

  5. Podcasts page has broken canonical URL Medium

    The podcasts page's canonical URL points to /case-studies instead of itself, which can confuse crawlers about the page's identity and dilute indexing signals.

    What to change: Update the canonical tag on the podcasts page to point to its own URL.

  6. Email/SMS marketing page returns 404 Medium

    The /email-sms-marketing page, linked from the global navigation, returns a 404 Not Found. This creates a dead end for users and crawlers.

    What to change: Restore the page or remove the navigation link to avoid 404 errors.

  7. Zero external citations from web searches High

    Searches for the agency name and reviews returned no results on DuckDuckGo. No third-party review sites, press mentions, or directory listings were found, limiting off-site signals for AI knowledge.

    What to change: Build citations on review platforms (Clutch, GoodFirms), industry blogs, and press releases to establish off-site authority.

  8. Store subdomain blocked by Cloudflare challenge Low

    The store.soarwithus.co subdomain returns a 403 with a Cloudflare challenge wall, preventing crawlers from accessing any content there.

    What to change: If the store contains public content, allow crawler access by adjusting Cloudflare settings or adding a crawl-delay rule.

  9. Blog posts lack Article schema and metadata Medium

    Long-form blog content (e.g., the personas article) has no Article schema, author markup, or publish-date metadata, reducing its discoverability and trust signals for AI crawlers.

    What to change: Add Article schema with author, datePublished, and headline to all blog posts.

  10. No BreadcrumbList schema on any page Low

    The site does not implement BreadcrumbList structured data, which helps AI crawlers understand site hierarchy and navigation paths.

    What to change: Add BreadcrumbList schema to all pages to improve navigation understanding.

What's working

  • All major AI crawlers receive full access — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, ChatGPT-User, and anthropic-ai all receive a 200 response with identical content to a browser, ensuring no access barriers.
  • Rich HTML content without JS rendering dependency — All pages return substantial HTML content to plain GET requests, so AI crawlers can parse text without needing JavaScript rendering.
  • Long-form, substantive blog content — The blog contains in-depth articles (e.g., 35K characters on personas) that provide rich textual material for AI crawlers to index and learn from.
  • Detailed case studies with specific metrics — The case studies page and individual case studies (e.g., AKT London) document concrete results with metrics like CPA reduction and revenue increase, providing strong factual content for AI extraction.
  • Podcast content asset (D2C Diaries) — The site hosts a podcast with 10+ episodes covering AI, Meta strategy, and DTC growth, which could feed AI knowledge bases if properly structured.
  • Sitemap present with 80 URLs — The sitemap.xml is accessible and lists 80 URLs, providing a clear map of site content for crawlers.
  • HSTS enabled for secure connections — The site enforces HTTPS with HSTS, ensuring secure communication and trust signals for crawlers.

Track soarwithus.co across AI search

This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.

Open this AI Site Grade Grade another site Track your brand