AI Site Grade
carnegielearning.com — AI Site Grade
Carnegie Learning's site is fully accessible to AI crawlers but lacks any structured data, missing schema, no /llms.txt, and a cold-knowledge gap around its CLEAR product rebrand.
Carnegie Learning's site is fully accessible to AI crawlers but lacks any structured data, missing schema, no /llms.txt, and a cold-knowledge gap around its CLEAR product rebrand.
- Findings
- 8
- Evidence checks
- 20
- Completed
- 30 May 2026
Analysis
AI crawlers see the full site — but find almost nothing machine-readable
Every AI bot tested — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, Bytespider, Applebot-Extended — receives a 200 with identical byte payload (108,921 bytes) as a browser visit. Cloudflare serves the site without any UA-based blocking. The robots.txt contains no AI-bot directives whatsoever; the wildcard User-agent: * only blocks HubSpot preview paths and a review page. No AI crawler is throttled, disallowed, or redirected. Yet the site is structurally invisible to AI engines in the ways that matter.
Schema Vacuum
Across the homepage, MATHia product page, Lenses on Literature page, research page, and careers page — zero JSON-LD schemas of any kind. The only structured data found anywhere on the domain is a single VideoObject on the blog listing page (for a German-language pancake video). No Organization, Product, Course, FAQPage, HowTo, or WebSite schema exists. The homepage has no @type declarations at all. An AI crawler landing on carnegielearning.com gets rich visible text but no machine-readable entity graph to anchor the brand's identity, products, or claims.
Cold-Knowledge Gap
The cold LLM knows Carnegie Learning as a CMU-spinoff math software company (MATHia, Fast ForWord, ClearFluency) founded in 1998, acquired by CIP Capital in 2023. The actual site has been aggressively rebranded around the "CLEAR" family — CLEAR Math, CLEAR Literacy, CLEAR Languages, CLEAR Services — a naming architecture the cold model does not reference at all. The site positions itself as a full K-12 curriculum provider spanning math, ELA, world languages, and professional services, serving 5.5 million students. The cold model's prior is narrower (math-focused, supplemental software). The site's major new products — Lenses on Literature (all-green EdReports 2026), ClearMath Elementary (all-green EdReports 2026), ClearTalk (AI language tool, 2025 award) — are entirely absent from the model's knowledge. The 2023 PE acquisition is mentioned by the model but nowhere on the site (the press page, about page, and careers page omit it entirely).
Missing AI Infrastructure
/llms.txt returns a 404 (HubSpot 404 page). No AI content map exists. The sitemap is a standard HubSpot-generated index with 735+ URLs, but no AI-specific sitemap or content prioritization. The blog publishes future-dated posts (May 4, 2026, Apr 20, 2026) — a temporal anomaly that may confuse recency-sensitive AI retrievers. The homepage relies on a video element (Your browser does not support the video tag fallback text visible) and a JavaScript-driven solution-finder widget, though the core text renders server-side.
External Signals
The press room lists recent wins: EdReports all-green for Lenses on Literature (Mar 2026) and ClearMath Elementary (Dec 2025), a federal EIR grant for adolescent literacy (Mar 2026), Florida state approval for world language titles (Jan 2026), and California adoption of K-8 math (Nov 2025). The company claims $90M+ in grant funding from Gates Foundation, Walton Family Foundation, and U.S. Department of Education. DNS records reveal a heavy SaaS stack: HubSpot (primary CMS/email), AWS (CloudFront/Route53), Mimecast, Salesforce, Marketo, OneTrust, FormAssembly, and Adobe. No negative external signals surfaced in search; the brand's online reputation is clean but thin — few independent reviews or Reddit discussions appear in current search results.
Findings
Zero JSON-LD structured data across the site High
No Organization, Product, Course, FAQPage, or WebSite schema found on any page. The only structured data is a VideoObject on the blog listing page for an unrelated video. AI crawlers receive rich text but no machine-readable entity graph.
What to change: Add JSON-LD schemas for Organization, Product (MATHia, ClearMath, Lenses on Literature), Course, and WebSite on relevant pages. Use the homepage to declare the brand entity with logo, description, and social profiles.
No /llms.txt file for AI content discovery High
The /llms.txt endpoint returns a 404 (HubSpot default). There is no AI-specific content map or guidance for language models to discover key pages.
What to change: Create an /llms.txt file listing the most important pages (homepage, product pages, research, press releases) with brief descriptions to guide AI crawlers.
Cold LLM knowledge does not reflect CLEAR product rebrand High
The cold model knows Carnegie Learning as a CMU-spinoff math software company (MATHia, Fast ForWord) but the site now promotes the CLEAR family (CLEAR Math, CLEAR Literacy, CLEAR Languages, CLEAR Services). New products like Lenses on Literature and ClearMath Elementary are absent from the model's knowledge.
What to change: Publish structured data and authoritative content (e.g., Wikipedia page, press releases with schema) to update the model's knowledge of the CLEAR product line and recent EdReports accreditations.
Robots.txt lacks any AI-bot directives Medium
The robots.txt file has no rules for GPTBot, ClaudeBot, PerplexityBot, or other AI crawlers. The wildcard rule only blocks HubSpot preview paths and a review page. While this allows access, it misses the opportunity to guide AI crawlers to important content.
What to change: Add explicit directives for AI bots, such as allowing all and optionally pointing to the /llms.txt file via a comment or crawl-delay.
Blog contains future-dated posts that may confuse AI retrievers Medium
The blog lists posts with dates in 2026 (e.g., May 4, 2026, Apr 20, 2026). Recency-sensitive AI retrievers may treat these as current or misattribute timeliness.
What to change: Remove or correct future dates on blog posts, or use a 'last updated' field that reflects actual publication date.
2023 PE acquisition not mentioned on the site Medium
The cold model knows about the CIP Capital acquisition in 2023, but the site's press page, about page, and careers page omit this information entirely. This creates a discrepancy between model knowledge and site content.
What to change: Add a brief mention of the acquisition on the About page or Press page to align site content with external knowledge.
Limited independent reviews and third-party coverage Medium
Web searches for Carnegie Learning reviews, Reddit discussions, and recent news returned zero results. The brand's online reputation is clean but thin, reducing external signals that AI models might use for validation.
What to change: Encourage customer reviews on third-party platforms (G2, EdSurge, Reddit) and publish case studies to generate more external mentions.
No AI-specific sitemap or content prioritization Low
The sitemap is a standard HubSpot-generated index with 735+ URLs, but there is no separate sitemap for AI crawlers highlighting key pages like product pages, research, or press releases.
What to change: Create a dedicated sitemap for AI crawlers (e.g., /sitemap-ai.xml) that lists the most important pages with appropriate priority and change frequency.
What's working
- All AI crawlers receive full site content without blocking — Every major AI bot tested receives a 200 response with identical content as a browser visit. No UA-based blocking, throttling, or redirects. The site is fully open to AI crawlers.
- Key pages contain substantial readable text for AI extraction — Homepage, product pages, research page, and blog posts have 500-1500 words of descriptive text that AI crawlers can parse for content understanding.
- No negative external signals or spam associations — Web searches found no negative reviews, complaints, or spammy backlinks. The brand's online reputation is clean, which avoids penalties in AI model training data.
- Recent EdReports all-green ratings for key products — Lenses on Literature (Mar 2026) and ClearMath Elementary (Dec 2025) received all-green EdReports ratings, providing strong third-party validation that can be leveraged in structured data and AI content.
- Reliable infrastructure with Cloudflare CDN and HubSpot CMS — DNS records show CloudFront/Route53 for CDN, HubSpot for CMS, and enterprise tools like Salesforce and Marketo. This ensures fast, reliable delivery of content to AI crawlers.
Track carnegielearning.com across AI search
This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.