AI Site Grade
answerdigital.com — AI Site Grade
Answer Digital's site is fully crawlable by AI bots but invisible to search engines, with zero external citations and generic schema that fails to surface its strongest content.
Answer Digital's site is technically accessible to all AI crawlers but lacks sitemap, llms.txt, and any search engine presence, while its rich case studies and employee ownership story are buried under generic schema and absent from LLM prior knowledge.
- Findings
- 10
- Evidence checks
- 24
- Completed
- 30 May 2026
Analysis
AI crawlers see the full site, but no search engine can find it
Every AI bot tested — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, Bytespider, Applebot-Extended — receives a 200 with the identical 403KB HTML payload that a browser gets. Cloudflare sits in front but does not UA-block any crawler. The robots.txt exists (200, 1,248 bytes) but contains zero user-agent directives — only a legal preamble about EU copyright content signals (search/ai-input/ai-train). No AI bot is explicitly allowed or disallowed; the file is effectively inert for crawler governance.
Sitemap and llms.txt are missing; schema is generic
/sitemap.xml returns 404. /llms.txt returns 404 (a Gatsby JS error page). The site runs on Gatsby 5.16.0 (static-site generator) hosted on Cloudflare, so all pages are pre-rendered HTML — no JS-rendering risk for crawlers. However, the JSON-LD schema is minimal and identical on every page: only WebSite, WebPage (with a wildcard @id of https://answerdigital.com/*/#webpage), and Organization (address, logo, URL). No Article, FAQPage, HowTo, Product, Service, BreadcrumbList, or LocalBusiness schema exists anywhere. Case studies, industry pages, and insight articles all carry the same three generic schema types. The wildcard @id (/*/#webpage) is technically invalid and may confuse schema parsers.
Cold-knowledge gap: the model knows a different company
The LLM's cold knowledge describes Answer Digital as a 1997-founded consultancy with a "Data Platform as a Service" offering, Microsoft Gold Partner status, and acquisition growth (buying a data consultancy in 2021). The actual site says none of this. The site presents Answer as employee-owned since 2020 (an Employee Ownership Trust), founded by Gary Parlett, with a focus on regulated-sector AI, data integration, and engineering delivery — primarily NHS healthcare, financial services, and utilities. The founding year (1997), DPaaS product name, Microsoft Gold Partner status, and acquisition history are absent from the live site. The site's strongest differentiator — employee ownership — is entirely missing from the model's prior knowledge.
External signals are near-zero in search
DuckDuckGo returns zero results for answerdigital.com, "Answer Digital" NHS, "Answer Digital" Leeds, and "Answer Digital" employee owned. No external reviews, Reddit threads, or press articles surfaced. The careers subdomain (careers.answerdigital.com) references a Glassdoor page but no Glassdoor data was findable via search. The Wayback Machine has no snapshot of the homepage. The site has strong case study content (21 detailed case studies across healthcare, finance, PE, public sector) and a blog with recent articles (May 2025 employee ownership piece, National Digital Twin contract), but none of this content is discoverable through general web search — a severe external-signal deficit that limits the citation surface AI engines can draw from.
Content quality is high but structurally invisible to AI
The homepage and case studies contain rich, specific content: named clients (NHS, Yorkshire Water, NVIDIA, King's College Hospital, OneLondon, Fintel IQ), quantified outcomes (GBP 90bn assets migrated, 10.6 million patients, 7.6% NHS vacancy rate, 29% radiologist shortfall), and named frameworks (MONAI Deploy, AIDE, OMOP, NDTP). The AIDE case study won the UKIT Award for AI Project of the Year 2024. Yet none of this is surfaced through FAQ schema, HowTo schema, or any structured data beyond the generic Organization block. The site has no FAQ pages, no comparison tables, and no definition-style content that AI answer engines preferentially cite.
Findings
Zero search engine results for the domain or brand High
DuckDuckGo returns zero results for the domain, brand name, and key queries like 'Answer Digital NHS' or 'Answer Digital employee owned'. No external reviews, press articles, or social mentions are indexed. This severe external-signal deficit limits the citation surface AI engines can draw from.
What to change: Build a backlink strategy, publish press releases, and claim business listings on platforms like Crunchbase, Clutch, and G2 to generate indexed external references.
Sitemap.xml returns 404, blocking search discovery High
The sitemap at /sitemap.xml returns a 404 error. Without a sitemap, search engines and AI crawlers have no structured way to discover all pages, reducing crawl efficiency and content indexing.
What to change: Generate and submit an XML sitemap covering all important pages (case studies, insights, industry pages) to Google Search Console and Bing Webmaster Tools.
llms.txt file missing, no AI guidance provided Medium
The /llms.txt endpoint returns a 404 (Gatsby error page). This file is used by AI tools to understand site structure and content; its absence means AI crawlers get no curated guidance.
What to change: Create an llms.txt file that lists key pages, a brief site description, and suggested entry points for AI crawlers.
Generic JSON-LD schema on every page, no specialized types High
All pages use only WebSite, WebPage, and Organization schema with identical minimal properties. No Article, FAQPage, HowTo, Service, or BreadcrumbList schema exists. Case studies and insights lack structured data to surface their content in AI answer engines.
What to change: Add Article schema to blog posts, FAQPage schema to any Q&A content, and Service schema for each service offering. Use BreadcrumbList on all pages.
Wildcard @id in schema markup is technically invalid Medium
The WebPage schema uses a wildcard @id of 'https://answerdigital.com/*/#webpage', which is not a valid URL and may confuse schema parsers or cause validation errors.
What to change: Replace the wildcard @id with the actual page URL for each page, or use a placeholder like 'https://answerdigital.com/#webpage'.
LLM cold knowledge contradicts site content on key facts High
The LLM's prior knowledge describes Answer Digital as founded in 1997 with a 'Data Platform as a Service' product and Microsoft Gold Partner status, none of which appear on the live site. The site emphasizes employee ownership since 2020 and regulated-sector AI, which is absent from the model's knowledge.
What to change: Ensure the homepage and about page explicitly state founding year, employee ownership, and key differentiators in clear text. Publish a company fact sheet or about page with structured data.
Robots.txt contains no user-agent directives Low
The robots.txt file exists but has zero rules for any crawler. It only includes a legal preamble about EU copyright signals. This means no AI bot is explicitly allowed or disallowed, leaving governance ambiguous.
What to change: Add explicit directives for AI crawlers (e.g., allow all) and include a sitemap reference.
No FAQ or definition-style content for AI answer engines Medium
The site lacks FAQ pages, comparison tables, or glossary-style definitions that AI answer engines preferentially cite. Rich case study content exists but is not structured for easy extraction.
What to change: Create FAQ pages for common questions about services, employee ownership, and industries served. Use FAQPage schema.
No Wayback Machine snapshot of the homepage Low
The Wayback Machine has no archived snapshot of the homepage, indicating low historical web presence and limited external linking.
What to change: Encourage external sites to link to answerdigital.com; submit the site to the Wayback Machine manually.
Careers subdomain not indexed in search Medium
The careers subdomain (careers.answerdigital.com) is not appearing in search results, and no Glassdoor data was found despite a reference on the site.
What to change: Ensure the careers subdomain is included in the sitemap and has proper internal linking from the main site. Claim the Glassdoor profile.
What's working
- All AI crawlers receive full HTML content — Every tested AI bot (GPTBot, ClaudeBot, PerplexityBot, etc.) receives a 200 response with the same 403KB HTML as a browser. Cloudflare does not block any crawler, and robots.txt has no disallow rules.
- Gatsby static site serves pre-rendered HTML — The site is built with Gatsby 5.16.0, a static-site generator, so all pages are pre-rendered HTML. There is no JavaScript rendering risk for crawlers.
- Detailed case studies with quantified outcomes — The site contains 21 detailed case studies with named clients (NHS, NVIDIA, Yorkshire Water) and specific metrics (GBP 90bn assets, 10.6 million patients). The AIDE case study won UKIT Award for AI Project of the Year 2024.
- Employee ownership story clearly communicated — The site prominently features employee ownership (Employee Ownership Trust since 2020) on the homepage and in an insight article. This is a strong differentiator that, if surfaced, could improve AI citations.
- Organization schema with address and logo — Every page includes Organization schema with address, logo, and URL, providing basic entity information to search engines and AI.
- Contact page with clear information — The contact page provides address, phone, and email, making it easy for AI to extract business contact details.
Track answerdigital.com across AI search
This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.