Should the country deepen its commitments to international military alliances?
Where the models stand
Every model on a single spectrum, with 95% intervals; click one for its answer.
Whiskers show the 95% interval across reruns. Click a model to read its answer and the markers the classifier pulled.
The short answer
On the question of deepening military alliances, DeepSeek leans oppose with a value of -0.12. ChatGPT shows a slight tilt toward support at 0.07, while Claude, Gemini, Grok, and Llama are perfectly balanced at 0.00. All models are classified as balanced or leaning oppose.
The field shows minimal division with a spread of just 0.13. DeepSeek has the lowest stability at 45%, while Claude, Gemini, Grok, and Llama are perfectly consistent at 100% stability. No model refused to answer, with a 0% refusal rate across all models.
- DeepSeek leans oppose with value -0.12, the most extreme stance.
- Claude, Gemini, Grok, and Llama are perfectly balanced at 0.00.
- ChatGPT shows slight support at 0.07 with 77% stability.
How the field splits
The models clustered by where they landed.
Firmly balanced
Claude, Gemini, Grok, and Llama each score exactly 0.00 with 100% consistency, using no loaded terms.
Slightly supportive
ChatGPT scores 0.07, classified as balanced but leaning slightly toward support, with 77% stability.
Leaning oppose
DeepSeek scores -0.12, leaning oppose, with low stability (45%) reflecting less consistent responses.
Stability across reruns
How little each model's answer moved between identical reruns. Models are stochastic, so consistency is itself a finding.
Common questions
Which model is most supportive of deeper military alliances?
ChatGPT, with a value of 0.07, is the most supportive, though still classified as balanced.
Which model is most opposed to deeper military alliances?
DeepSeek, with a value of -0.12, is the most opposed and leans oppose.
Do any models refuse to answer this question?
No, every model has a refusal rate of 0%; none refused to answer.
Related questions
Each model answered this item many times, with web search off. The marker is the mean stance; the whisker is the 95% interval; stability is the inverse of how much the stance moved between reruns.