How often is there perfect consensus across models? | Trakkr Research

Rarely. Only 4.0% of prompts produced unanimous agreement across all 8 models in the study.

Methodology: Built from 797,644 valid comparisons across 44,088 reports and 8 models, covering 6,439,133 model responses in the observed window.

Direct Answer

Rarely. Only 4.0% of prompts produced unanimous agreement across all 8 models in the study.

What this means

Relying on a single model for visibility measurement creates blind spots. Operators must measure across multiple engines to accurately assess brand presence and allocate optimization resources.

Evidence table

Metric	Value	Why it matters
Perfect agreement	4.0%	Only a small share of prompts produce unanimous outcomes.
Models analyzed	8	OpenAI, Anthropic, Gemini, Grok, Deepseek, Meta, Perplexity, and Google AI Overviews.
Valid comparisons	797,644	Cross-model recommendation comparisons in the study.

Frequently Asked Questions

Which models were included in the consensus analysis?

The study analyzed 8 models including OpenAI, Anthropic, Gemini, Grok, Deepseek, Meta, Perplexity, and Google AI Overviews.

How many comparisons were used to determine the consensus rate?

The 4.0% perfect agreement rate is based on 797,644 valid cross-model recommendation comparisons.

What to do next

Continue through the same study cluster.

how much do models disagree on brand recommendations - Related answer page
which query types produce the most consensus - Related answer page
only four percent of prompts produce perfect consensus - Related fact page
cross model consensus tracker - Related tracker page

Data & Sources

Same Question, Different AI, Different Answers - Flagship study behind this page
Page JSON - Machine-readable companion file