AI Site Grade
jerry.ai — AI Site Grade
Jerry.ai's robots.txt uses an experimental non-standard directive while failing to name any AI bot user-agent, yet all major AI crawlers receive full HTML content with no restrictions.
Jerry.ai has strong AI crawler access and rich content but suffers from a non-standard robots.txt, missing schema types, and a cold-knowledge gap between its funding narrative and actual product branding.
- Findings
- 8
- Evidence checks
- 24
- Completed
- 30 May 2026
Analysis
---
Jerry.ai: AI-Visibility Audit
The site's robots.txt contains a non-standard Content-Signal: ai-train=yes, search=yes, ai-input=yes directive — an experimental header not recognized by any major AI crawler — while simultaneously failing to name a single AI bot user-agent, leaving all AI crawlers to fall through to the catch-all * rule with no AI-specific restrictions.
Crawler Access
Every major AI crawler — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, anthropic-ai, Bytespider, Applebot-Extended, ChatGPT-User, Perplexity-User — receives a 200 with full HTML content identical to browser delivery. No UA-based blocking, no Cloudflare challenge, no JS shell. The site runs on AWS CloudFront behind WordPress, serving server-rendered pages with 43 KB+ of visible text per page. AI crawlers can ingest the full content corpus without JavaScript rendering risk. The robots.txt disallows /wp-admin/, /login, /admin, and /cms — standard WordPress hygiene — but imposes no crawl-rate limits or AI-bot exclusions.
llms.txt and Sitemap Posture
An /llms.txt exists and returns 200 with 337 KB of content, auto-generated by Rank Math SEO. It lists every post and page with descriptions, functioning as a comprehensive AI content map. The sitemap index contains 8 sub-sitemaps covering posts, pages, car insurance, car repair, newsroom, reviews, categories, and local pages — approximately 900+ indexed URLs. This is an unusually complete AI-friendly content surface.
Cold-Knowledge Gap
The LLM's prior knowledge describes Jerry as an "AI-powered insurance comparison platform" with "over $100 million in funding from Goodwater Capital and Bow Capital." The actual site never describes itself as "AI-powered" — the homepage positions Jerry as an "insurance & car care" app with three branded products: PriceProtect (insurance comparison), GarageGuard (repair/maintenance), and DriveShield (driver safety). The cold model knows about the funding narrative but not the product-line branding. The site also reveals a secondary domain — getjerry.com — listed in sameAs schema, which the cold knowledge does not reference.
Schema Posture
The site deploys rich JSON-LD schema across every page: Organization, ContactPoint, PostalAddress, FAQPage, WebSite, WebPage, BreadcrumbList, and Article. The Organization block includes founders (Art Agrawal, Lina Zhang, Musawir Shah), founding date (2017), legal name (Jerry Services, Inc.), and makesOffer for FinancialProduct (car, home, motorcycle insurance) and Service (GarageGuard). The FAQPage schema is duplicated across the homepage, /priceprotect/, /garageguard/, and /car-insurance/ with overlapping questions. No Product or SoftwareApplication schema is used despite the app being the core delivery mechanism. No Review schema marks up the customer testimonials despite multiple review blocks on the homepage and /reviews/.
Content and Answer Signals
The homepage and key pages are rich in AI-friendly answer signals: FAQ sections (schema-encoded), comparison language ("side-by-side", "compare 100+ insurers"), definition patterns, and step-by-step numbered lists (the "How Jerry Works" flow). The /car-insurance/ page contains a coverage-type comparison table (BI, PD, collision, comprehensive, etc.) with plain-English explanations. Content is authored by named writers with credentials (Ben Moore, ex-NerdWallet; Stephanie Colestock, CFEI), and reviewed by licensed agents — a trust signal AI engines can extract. Blog posts carry future-dated publication timestamps (e.g., "May 26, 2026"), suggesting dynamic or scheduled content freshness.
External Signals
The site links to Trustpilot (trustpilot.com/review/jerry.ai), BBB (bbb.org/.../jerry-1216-718137), and CNBC Select (cnbc.com/select/jerry-car-insurance-comparison-shopping-review/) from the "Is Jerry Legit?" page. A Reddit community (reddit.com/r/jerryinsurance/) exists. The DNS TXT records include SendGrid and Google SPF, plus Google and Facebook site verification tokens. The site references YCombinator (founded 2017, YC Top Companies list) and lists awards from Comparably, Forbes/Statista, and LinkedIn Top Startups — all of which are crawlable signals that shape AI model priors about the brand's credibility.
Findings
Non-standard robots.txt directive not recognized by AI crawlers Medium
The robots.txt contains a custom 'Content-Signal: ai-train=yes, search=yes, ai-input=yes' directive that no major AI crawler understands, while failing to name any AI bot user-agent, leaving all AI crawlers to fall through to the catch-all rule.
What to change: Replace the non-standard directive with standard user-agent rules for AI bots (e.g., GPTBot, ClaudeBot) and consider adding crawl-delay or disallow rules as needed.
No AI bot user-agents defined in robots.txt Medium
The robots.txt does not explicitly name any AI crawler user-agents, meaning all AI bots are governed only by the catch-all '*' rule with no AI-specific restrictions or allowances.
What to change: Add explicit user-agent directives for major AI crawlers (e.g., GPTBot, ClaudeBot, Google-Extended) to control access and crawling behavior.
Cold LLM knowledge does not reflect current product branding Medium
The LLM's prior knowledge describes Jerry as an 'AI-powered insurance comparison platform' with funding details, but the site itself never uses 'AI-powered' and instead brands three products: PriceProtect, GarageGuard, and DriveShield. The cold model is unaware of this product-line structure.
What to change: Explicitly describe the product-line branding (PriceProtect, GarageGuard, DriveShield) in prominent page copy and structured data to align AI model priors with current offerings.
No SoftwareApplication schema despite app-based delivery Medium
The site uses Organization, WebSite, and FAQPage schema but lacks SoftwareApplication schema, even though the core product is a mobile app. This limits AI understanding of the app's functionality and platform availability.
What to change: Add SoftwareApplication schema to the homepage and product pages, specifying applicationCategory, operatingSystem, and offers.
Customer testimonials lack Review schema markup Medium
Multiple review blocks appear on the homepage and /reviews/ page, but none are marked up with Review schema. This prevents AI crawlers from extracting structured review data.
What to change: Add Review schema markup to customer testimonials, including itemReviewed, reviewRating, author, and datePublished.
Duplicate FAQPage schema across multiple pages Low
The same FAQPage schema with overlapping questions appears on the homepage, /priceprotect/, /garageguard/, and /car-insurance/ pages, which may confuse AI crawlers and dilute the value of each page's unique content.
What to change: Ensure each page's FAQPage schema contains unique questions relevant to that page's content, avoiding duplication.
Blog posts show future-dated publication timestamps Low
Some blog posts have publication dates in the future (e.g., 'May 26, 2026'), which may be dynamic or scheduled content. This can confuse AI crawlers about content freshness and credibility.
What to change: Ensure publication dates reflect actual publish dates; avoid future dates unless content is explicitly scheduled and marked as such.
Secondary domain getjerry.com not referenced in cold knowledge Low
The site lists getjerry.com in sameAs schema, but the cold LLM knowledge does not reference this domain, potentially fragmenting brand identity across AI models.
What to change: Ensure consistent brand representation across domains and consider canonicalization or explicit cross-referencing in structured data.
What's working
- All major AI crawlers receive full HTML content — Every tested AI crawler (GPTBot, ClaudeBot, PerplexityBot, etc.) receives a 200 response with full server-rendered HTML, no JavaScript shell, and no blocking. This ensures AI crawlers can ingest the complete content corpus.
- Comprehensive llms.txt with 337 KB of content — An /llms.txt file exists and returns 337 KB of auto-generated content listing all posts and pages with descriptions, serving as a complete AI content map.
- Rich JSON-LD schema deployed across all pages — Every page includes Organization, ContactPoint, PostalAddress, FAQPage, WebSite, WebPage, BreadcrumbList, and Article schema, providing structured context for AI crawlers.
- AI-friendly content signals with FAQs, comparisons, and step-by-step lists — Key pages contain FAQ sections with schema, comparison language, definition patterns, and numbered step-by-step flows, all of which are strong answer signals for AI crawlers.
- Content authored by named writers with credentials — Articles are written by named authors with relevant backgrounds (e.g., Ben Moore, ex-NerdWallet) and reviewed by licensed agents, providing trust signals that AI engines can extract.
- External trust signals from Trustpilot, BBB, CNBC, and Reddit — The site links to Trustpilot, BBB, CNBC Select, and a Reddit community, providing crawlable third-party validation that shapes AI model priors about brand credibility.
- Comprehensive sitemap index with 8 sub-sitemaps — The sitemap index contains 8 sub-sitemaps covering posts, pages, car insurance, car repair, newsroom, reviews, categories, and local pages, totaling approximately 900+ indexed URLs.
- Awards and recognition from Comparably, Forbes, and LinkedIn are crawlable — The site mentions awards from Comparably, Forbes/Statista, and LinkedIn Top Startups, which are crawlable signals that enhance brand credibility in AI models.
Track jerry.ai across AI search
This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.