AI Site Grade

phase2.io — AI Site Grade

Phase2.io's canonical URL loops back to itself, creating a redirect chain that confuses AI crawlers.

Phase2.io has strong AI crawler access and domain verification, but its canonical loop, stale Drupal content, missing product schema, and a cold-knowledge gap undermine its AI visibility.

Findings
9
Evidence checks
27
Completed
30 May 2026

Analysis

The canonical homepage URL redirects to itself — a self-referencing canonical pointing to a slug that 301s back to root

The homepage at https://phase2.io declares a canonical URL of https://phase2.io/phase2-is-your-technology-partner-for-ai-that-works — but that URL itself issues a 301 redirect back to https://phase2.io/. This creates a canonical loop: the page says its canonical is a URL that redirects to itself. AI crawlers and search engines following the canonical will hit a redirect, not the content.

Crawler Access

All major AI crawlers — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, OAI-SearchBot, ChatGPT-User, anthropic-ai, Bytespider, Applebot-Extended — receive a full 200 response with identical byte size (56,399 bytes) to a browser baseline. No UA-based blocking exists. The site runs on nginx behind Pantheon (via x-pantheon-styx-hostname header) with Varnish caching. The robots.txt contains zero AI-bot-specific directives — only a generic User-agent: * rule blocking Drupal admin paths. No llms.txt exists (404). The X-Generator header reveals Drupal 11.

Cold-Knowledge Gap

The LLM's prior knowledge describes Phase2 as a Drupal-centric digital agency serving government, higher ed, and non-profits, with a flagship Drupal distribution and a 2021 Waud Capital acquisition. The actual site positions itself as "your technology partner for AI that works" with a primary focus on healthcare and pharma — AI solutions, ProfileIQ, Pharma iCX, Health iCX. The Drupal heritage, government clients, and acquisition are entirely absent from the current site messaging. The cold knowledge is approximately 5-7 years stale, describing a company that no longer matches the site's self-presentation.

Schema Posture

Every page uses a minimal @graph structure containing Article (or BlogPosting/WebSite) and WebPage with BreadcrumbList. No Organization schema with logo, social profiles, or founding date exists on the homepage — the WebSite schema has a publisher of type Organization but it contains only name and url. No Product or SoftwareApplication schema exists for ProfileIQ, the AI-powered product. No FAQPage schema anywhere despite the site using question-based headings. No LocalBusiness or Service schema for the agency's offerings.

Content Archaeology

The sitemap contains 551 URLs, the vast majority of which are stale Drupal blog posts from 2014-2018 — posts about Drupal 7 module development, DrupalCon 2017, and AngularJS code reuse. These outnumber current AI/healthcare content by roughly 10:1. The blog section (/blog/) mixes 2017 Drupal tutorials alongside 2025 AI announcements with no content-type differentiation. The /insights page shows recent AI-focused articles (SXSW 2026, enterprise AI) but the sitemap still surfaces the old blog URLs with equal weight.

External Signals

DNS TXT records confirm OpenAI domain verification (openai-domain-verification=dv-B6gqbzHBEH01C09eSt9mr1X9) and Anthropic domain verification (anthropic-domain-verification-cdwn05=...), indicating the site has proactively registered with both major AI platforms for crawling. No external press or review results surfaced in search for the current AI/healthcare positioning — the brand's external footprint remains dominated by its legacy Drupal identity.

Findings

  1. Homepage canonical URL creates redirect loop High

    The homepage declares a canonical URL that 301-redirects back to itself, causing AI crawlers to follow a redirect instead of directly indexing the content.

    What to change: Set the homepage canonical to 'https://phase2.io' and remove the redirect from the slug URL.

  2. No llms.txt file for AI crawlers Medium

    The site returns a 404 for /llms.txt, missing an opportunity to provide structured guidance to AI crawlers about which pages to index and how to interpret content.

    What to change: Create an llms.txt file that lists key pages (homepage, offerings, AI in Practice) and provides a brief site summary.

  3. Stale Drupal blog posts outnumber current AI content 10:1 High

    The sitemap contains 551 URLs, mostly Drupal blog posts from 2014-2018, which overwhelm the small number of recent AI/healthcare articles. This dilutes the site's topical authority for AI and healthcare queries.

    What to change: Remove or noindex outdated blog posts, or restructure the sitemap to prioritize current AI and healthcare content.

  4. Homepage lacks Organization schema with key details Medium

    The WebSite schema's publisher Organization only includes name and URL, missing logo, social profiles, founding date, and other signals that help AI systems understand the entity behind the site.

    What to change: Add a complete Organization schema on the homepage with logo, sameAs URLs, founding date, and description.

  5. ProfileIQ product page has no SoftwareApplication schema High

    The ProfileIQ page describes an AI-powered product but lacks structured data to help AI crawlers understand it as a software product, reducing its visibility in AI-generated answers.

    What to change: Add SoftwareApplication schema to the ProfileIQ page with name, description, applicationCategory, and offers.

  6. LLM prior knowledge describes a 5-7 year old version of Phase2 High

    AI models recall Phase2 as a Drupal agency for government and non-profits, while the site now positions itself as an AI partner for healthcare. This mismatch means AI systems may answer queries with outdated information.

    What to change: Publish authoritative content (case studies, whitepapers) about the healthcare AI focus and ensure it is well-linked and cited to update AI training data.

  7. Question-based headings lack FAQPage schema Medium

    Several pages use question-style headings (e.g., 'What is ProfileIQ?') but do not mark them up with FAQPage schema, missing an opportunity to appear in AI-generated answer snippets.

    What to change: Add FAQPage schema to pages with question-answer content, wrapping each Q&A pair.

  8. No external press or reviews for current AI positioning Medium

    Web searches for Phase2's AI and healthcare focus return zero results, meaning the brand's external footprint does not reinforce its new messaging, making it harder for AI systems to validate claims.

    What to change: Earn coverage in healthcare and AI industry publications, and publish case studies with client logos to build external authority.

  9. Blog and insights sections mix old and new content without differentiation Low

    The /blog/ path contains both 2017 Drupal tutorials and 2025 AI announcements, while /insights/ has recent AI articles. This lack of content-type separation confuses AI crawlers about the site's primary expertise.

    What to change: Create separate content hubs (e.g., /resources/ai) and use category taxonomy to signal topical focus to crawlers.

What's working

  • All major AI crawlers allowed with no UA blocking — GPTBot, ClaudeBot, PerplexityBot, and others receive full 200 responses identical to browser access, ensuring AI systems can index all public content.
  • Domain verified with OpenAI and Anthropic for crawling — DNS TXT records confirm domain verification for both OpenAI and Anthropic, indicating proactive registration to allow and prioritize crawling by these AI platforms.
  • Robots.txt has no AI-bot blocking directives — The robots.txt only blocks Drupal admin paths for all user agents, with no disallow rules for any AI crawler, ensuring full access.
  • Basic schema markup present on all pages — Every page includes a @graph structure with Article/BlogPosting/WebSite and WebPage with BreadcrumbList, providing a foundation for AI understanding.
  • Dedicated AI and healthcare content pages exist — Pages like /ai-in-practice, /profileiq, and blog posts about Pharma iCX provide substantive content about the company's AI and healthcare focus.
  • Site runs on Drupal 11 with nginx and Varnish caching — The use of Drupal 11, nginx, and Varnish indicates a modern, performant infrastructure that can handle crawler traffic efficiently.

Track phase2.io across AI search

This is one snapshot. Open the interactive report to inspect evidence, or grade another site free.

Open this AI Site Grade Grade another site Track your brand