How often is there perfect consensus across models? | Trakkr Research
Rarely. Only 4.0% of prompts produced unanimous agreement across all 8 models in the study.
Methodology: Built from 797,644 valid comparisons across 44,088 reports and 8 models, covering 6,439,133 model responses in the observed window.
Direct Answer
Rarely. Only 4.0% of prompts produced unanimous agreement across all 8 models in the study.
What this means
Relying on a single model for visibility measurement creates blind spots. Operators must measure across multiple engines to accurately assess brand presence and allocate optimization resources.
Evidence table
| Metric | Value | Why it matters |
|---|---|---|
| Perfect agreement | 4.0% | Only a small share of prompts produce unanimous outcomes. |
| Models analyzed | 8 | OpenAI, Anthropic, Gemini, Grok, Deepseek, Meta, Perplexity, and Google AI Overviews. |
| Valid comparisons | 797,644 | Cross-model recommendation comparisons in the study. |
Frequently Asked Questions
Which models were included in the consensus analysis?
The study analyzed 8 models including OpenAI, Anthropic, Gemini, Grok, Deepseek, Meta, Perplexity, and Google AI Overviews.
How many comparisons were used to determine the consensus rate?
The 4.0% perfect agreement rate is based on 797,644 valid cross-model recommendation comparisons.
What to do next
Related pages
Continue through the same study cluster.
- how much do models disagree on brand recommendations - Related answer page
- which query types produce the most consensus - Related answer page
- only four percent of prompts produce perfect consensus - Related fact page
- cross model consensus tracker - Related tracker page
Data & Sources
- Same Question, Different AI, Different Answers - Flagship study behind this page
- Page JSON - Machine-readable companion file