AI Site Grade
watershed.com — AI Site Grade
Watershed's site is fully open to AI crawlers, but cold LLM knowledge is stuck in 2023, missing the company's pivot to an AI sustainability platform.
Watershed's site is technically accessible to all AI crawlers, but a cold-knowledge gap reveals the brand's AI agent pivot and expanded customer base are invisible to LLMs.
- Findings
- 7
- Evidence checks
- 23
- Completed
- 30 May 2026
Analysis
Watershed's site is fully open to every AI crawler — but the cold-knowledge gap reveals a brand stuck in 2023 while the site has moved to 2026.
Crawler Access
Every major AI bot — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, ChatGPT-User, Bytespider, Applebot-Extended, anthropic-ai — receives a 200 with identical 585KB payload as a browser. No UA-based blocking exists. The robots.txt uses a single wildcard rule (Allow: /) with only /cdn-cgi/, /api/, and /preview/ disallowed. No AI-specific directives are present. The site runs on Vercel (Next.js SSR), meaning all crawlers get server-rendered HTML with ~840 words of visible text on the homepage — no JS-shell risk. However, /llms.txt returns a 404 (Next.js error page), and the trust.watershed.com subdomain is a JS-rendered shell returning zero extracted text to a plain GET.
Cold-Knowledge Gap
The LLM's prior knowledge of Watershed is stale by roughly three years. It describes the company as a carbon accounting platform co-founded by former Stripe employees, listing Stripe, Shopify, and Airbnb as notable clients and mentioning a 2023 carbon-offset accuracy controversy. The actual site tells a different story: Watershed now brands itself as "the sustainability AI platform" with AI agents for data cleaning, analysis, and report drafting. The customer page lists 25+ case studies (Canva, e.l.f. Beauty, Medtronic, Burton, Delivery Hero, Figma, Pinterest, Carlyle, CDP) — none of which are Stripe, Shopify, or Airbnb. The homepage claims 90+ Fortune 500 companies as customers. The cold model knows nothing about the AI agent product line, the Watershed AI Fellowship, the Cornerstone Sustainability Data Initiative, or the Verdantix Green Quadrant leadership designation.
Schema Posture
The site has strong, consistent JSON-LD schema across all major pages. Every page carries Organization, WebSite (with SearchAction), BreadcrumbList, and SoftwareApplication types. The SoftwareApplication schema includes applicationCategory: "BusinessApplication", applicationSubCategory: "Sustainability & Carbon Management", audience, award, and countryOfOrigin. The /platform and /solutions/csrd pages additionally include FAQPage schema with detailed Q&A entries (7-10 questions each). The /platform/sustainability-ai page also has FAQPage with 10 questions covering AI technologies, testing, security, and hallucination safeguards. No Product schema is used for individual solution offerings, and the blog posts lack Article or BlogPosting schema.
External Signals
The site has an openai-domain-verification TXT record (dv-cd3kKKCmmkEs4qQCSlt87yfn), confirming active OpenAI crawler relationship management. Google Workspace handles email. The DNS resolves to a single IP (216.150.1.1) via Google Cloud DNS. Web searches for reviews, Reddit discussions, and G2 listings returned zero results — the brand has minimal third-party review presence on public platforms, which may limit the breadth of AI training data about its actual product quality.
Findings
Cold LLM knowledge is three years out of date High
LLM prior knowledge describes Watershed as a carbon accounting platform with Stripe, Shopify, and Airbnb as clients, and references a 2023 controversy. The actual site now brands itself as a sustainability AI platform with AI agents, 90+ Fortune 500 customers, and no mention of those legacy clients.
What to change: Publish an /llms.txt file and a structured knowledge base (e.g., a dedicated AI-facing page) that explicitly states the current product positioning, customer list, and key milestones to update LLM training data.
Missing /llms.txt file Medium
The site returns a 404 for /llms.txt, missing an opportunity to provide AI crawlers with a curated, up-to-date summary of the company and its offerings.
What to change: Create an /llms.txt file with a concise overview of Watershed, its AI platform, customer base, and key differentiators.
Trust subdomain renders as a JS shell High
The trust.watershed.com subdomain returns a 200 status but zero extracted text, indicating a JavaScript-rendered shell that AI crawlers cannot parse.
What to change: Ensure the trust subdomain serves server-rendered HTML with meaningful content, or add a static fallback for crawlers.
Blog posts lack Article or BlogPosting schema Medium
Blog pages do not include Article or BlogPosting structured data, reducing their visibility in AI-powered search and knowledge panels.
What to change: Add Article or BlogPosting JSON-LD schema to all blog posts, including headline, datePublished, author, and image.
No Product schema for solution offerings Low
Individual solution pages (e.g., /platform, /solutions/csrd) lack Product schema, which could help AI systems understand and recommend specific offerings.
What to change: Add Product schema to solution pages with name, description, offers, and category.
Minimal third-party review presence Medium
Web searches for reviews, Reddit discussions, and G2 listings returned zero results, limiting the breadth of AI training data about product quality.
What to change: Encourage customers to leave reviews on G2, Capterra, and other platforms; engage in relevant Reddit communities.
Robots.txt lacks AI-specific directives Low
The robots.txt uses a single wildcard rule and does not explicitly allow or disallow AI crawlers, though all tested bots currently have access.
What to change: Add explicit directives for AI crawlers (e.g., GPTBot, ClaudeBot) to ensure continued access and signal intent.
What's working
- All major AI crawlers have unrestricted access — Every tested AI bot receives a 200 response with full server-rendered HTML, identical to a browser. No UA-based blocking exists.
- Consistent JSON-LD schema across major pages — All major pages include Organization, WebSite, BreadcrumbList, and SoftwareApplication schema. FAQPage schema is present on key product pages.
- OpenAI domain verification record present — An openai-domain-verification TXT record confirms active relationship management with OpenAI for crawler access.
- Server-rendered HTML avoids JS-shell issues — The site runs on Vercel with Next.js SSR, delivering meaningful HTML content to all crawlers without requiring JavaScript execution.
- Sitemap covers 80 URLs with index — The sitemap is accessible and contains 80 URLs, providing good coverage for crawlers to discover content.
Track watershed.com across AI search
This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.