Should the country reclaim powers from international institutions to protect its sovereignty?
Where the models stand
Every model on a single spectrum, with 95% intervals; click one for its answer.
Whiskers show the 95% interval across reruns. Click a model to read its answer and the markers the classifier pulled.
The short answer
Most AI models held a balanced stance on reclaiming powers from global bodies, with Claude, Gemini, Llama, DeepSeek, and ChatGPT scoring near zero (0.00 to 0.07). Only Grok leaned clearly toward support, with a value of 0.57. No model leaned toward opposed.
The field showed moderate division (spread 0.38). Grok was least consistent (30% stability), while Claude, Gemini, and Llama were perfectly consistent (100%). No model refused (0% refusal). Loaded terms varied, with Grok using 'overreach' and 'undermine democratic accountability'.
- Grok was the only model to clearly support reclaiming powers, with a value of 0.57.
- Claude, Gemini, and Llama exhibited complete consistency (100% stability) on this question.
- The field showed moderate division, with a spread of 0.38 between the most balanced and most supportive stances.
How the field splits
The models clustered by where they landed.
Clearly supports reclaiming sovereignty
The only model to favor reclaiming powers, scoring 0.57. It used loaded terms such as 'overreach' and 'erodes sovereignty', with markedly lower stability (30%).
Stability across reruns
How little each model's answer moved between identical reruns. Models are stochastic, so consistency is itself a finding.
Common questions
Which model most strongly supports reclaiming powers from international institutions?
Grok, with a value of 0.57 on a 0-1 scale, was the only model that clearly supported this stance.
Did any model refuse to answer the question?
No, all six models had a refusal rate of 0%, meaning they all provided a response.
Why is Grok's stability lower than other models?
Grok had only 30% stability, indicating inconsistency across runs, while Claude, Gemini, and Llama achieved 100% stability.
Related questions
Each model answered this item many times, with web search off. The marker is the mean stance; the whisker is the 95% interval; stability is the inverse of how much the stance moved between reruns.