AI Site Grade

growtherapy.com — AI Site Grade

Grow Therapy's sitemap and blog are broken, hiding content from AI crawlers despite full bot access.

Grow Therapy allows all major AI crawlers but has a 404 sitemap, missing llms.txt, and a blog where most article URLs return 404, severely limiting content discovery.

Findings: 7
Evidence checks: 28
Completed: 30 May 2026

Analysis

---

Grow Therapy: AI crawlers get full access but find a ghost-town blog and a broken sitemap

The site's most consequential AI-visibility problem is not blocked bots — every major AI crawler gets a 200 with full content — but a sitemap that 404s, an llms.txt that 404s, and a blog section where the majority of listed article URLs return 404 pages, meaning AI engines cannot discover or index the site's own content.

Crawler Access

All major AI crawlers — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, ChatGPT-User, anthropic-ai, Applebot-Extended — receive a 200 response with the same full HTML payload as a browser (808 KB). Only Bytespider (ByteDance) gets a 403 from CloudFront. The robots.txt uses a catch-all User-agent: * with no AI-specific rules, allowing everything except API and Next.js chunk paths. The site runs on nginx behind AWS CloudFront with Next.js, and pages render server-side — no JS-shell risk for crawlers.

Sitemap and Discovery

The robots.txt references Sitemap: https://growtherapy.com/wp/post-sitemap.xml, which redirects to a 404 (/start/post-sitemap.xml). The root sitemap.xml also 404s. This means no XML sitemap is discoverable by any crawler. The llms.txt returns a 404 (serving the full HTML error page, 648 KB). Without a sitemap or llms.txt, AI crawlers must rely entirely on internal link structure and external backlinks to find pages — and the blog listing page links to article URLs that largely return 404.

Cold-Knowledge Gap

The LLM knows Grow Therapy as a venture-funded (Sequoia, $100M+) online therapy platform founded in 2020 by Jake Cooper and Alan Ni, accepting insurance including Medicaid, with some mixed reviews on billing. The site itself never mentions Sequoia Capital, the $150M funding round, or the founders by name on any fetched page. The "About" page describes mission and values but omits founding story, funding, or scale metrics. The "How it Works" page does cite 10M+ therapy sessions delivered, 26,000+ providers, 125+ insurance plans, and a 4.7 TrustPilot score — but none of these appear in structured schema.

Schema Posture

The homepage and all subpages carry a WebPage + WebSite + Organization schema graph with sameAs links to social profiles. The therapist state pages (e.g., California) add SearchResultsPage schema with embedded Person entries for individual providers — strong for local SEO. However, no page uses FAQPage schema despite the homepage and "How it Works" page containing FAQ sections with visible Q&A content. No Product, SoftwareApplication, or MedicalBusiness schema is present, which would help AI engines classify the platform.

Broken Blog Content

The blog listing page (/blog/) displays article titles including "Does Grow Therapy work? Here's what real patient data shows," "The 12 best online therapy platforms of 2026," and "Grow Therapy introduces AI coach with clinician oversight" — but all three of these URLs return 404. Of 8 blog posts sampled from the listing, 5 returned 404. The only working blog posts found were /blog/psychology-today-listing/ and a few others. This means AI crawlers following links from the blog index encounter a wall of 404s, wasting crawl budget and losing the content that would substantiate the brand's authority claims.

Findings

XML sitemap returns 404, blocking crawler discovery High
The robots.txt references a sitemap at /wp/post-sitemap.xml which redirects to a 404. The root sitemap.xml also 404s. No XML sitemap is discoverable, forcing crawlers to rely on internal links and backlinks.
What to change: Generate a valid XML sitemap covering all important pages and update the robots.txt reference to the correct URL.
llms.txt returns 404, missing AI discovery file Medium
The llms.txt file at the root returns a 404, serving a full HTML error page. This file helps AI crawlers discover key content and is absent.
What to change: Create an llms.txt file listing the most important pages for AI crawlers.
Majority of blog article URLs return 404 High
The blog listing page links to articles such as 'Does Grow Therapy work?', 'The 12 best online therapy platforms of 2026', and 'Grow Therapy introduces AI coach' — all return 404. Of 8 sampled posts, 5 are broken. Crawlers following these links encounter dead ends.
What to change: Restore the broken blog posts or remove their links from the blog listing to avoid crawl errors.
FAQ sections lack FAQPage structured data Medium
The homepage and 'How it Works' page contain visible FAQ sections with questions and answers, but no FAQPage schema is present. This prevents AI engines from extracting Q&A content for rich results.
What to change: Add FAQPage schema markup to pages with FAQ content.
About page omits funding, founders, and scale metrics Medium
The About page describes mission and values but does not mention Sequoia Capital, the $150M funding round, founders Jake Cooper and Alan Ni, or key metrics like 10M+ sessions delivered. This information is known to LLMs but not present on the site.
What to change: Add a brief history, funding details, and scale metrics to the About page.
No Product or SoftwareApplication schema for the platform Low
The site uses WebPage, WebSite, and Organization schema but lacks Product or SoftwareApplication schema, which would help AI engines classify the platform as a software product.
What to change: Add SoftwareApplication schema to describe the platform.
Bytespider (ByteDance) is blocked by CloudFront Low
The Bytespider crawler receives a 403 response, preventing it from indexing the site. While not a major AI crawler, this blocks potential visibility in ByteDance's AI products.
What to change: Allow Bytespider access if desired.

What's working

All major AI crawlers receive full HTML content — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, and others get a 200 response with the same full HTML as a browser. No AI-specific blocking in robots.txt.
Pages are server-side rendered with no JS-shell risk — The site uses Next.js with server-side rendering, so crawlers receive full HTML content without needing JavaScript execution.
Organization schema with sameAs links is present on all pages — Every page includes WebPage, WebSite, and Organization schema with sameAs links to social profiles, helping AI engines understand the entity.
Therapist state pages use SearchResultsPage and Person schema — Pages like /therapists/california include SearchResultsPage schema with embedded Person entries for individual providers, strong for local SEO and AI understanding.
How it Works page cites 10M+ sessions, 26K+ providers, 125+ insurance plans — The page includes compelling scale metrics that can be used by AI engines to describe the platform, though not in structured data.

Track growtherapy.com across AI search

This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.

Open this AI Site Grade Grade another site Track your brand

Analysis

Grow Therapy: AI crawlers get full access but find a ghost-town blog and a broken sitemap

Crawler Access

Sitemap and Discovery

Cold-Knowledge Gap

Schema Posture

Broken Blog Content

Findings

XML sitemap returns 404, blocking crawler discovery High

llms.txt returns 404, missing AI discovery file Medium

Majority of blog article URLs return 404 High

FAQ sections lack FAQPage structured data Medium

About page omits funding, founders, and scale metrics Medium

No Product or SoftwareApplication schema for the platform Low

Bytespider (ByteDance) is blocked by CloudFront Low

What's working

Track growtherapy.com across AI search