Should the country be willing to use military force abroad to defend its interests and allies?

Q: Should the country be willing to use military force abroad to defend its interests and allies?

Measured across 6 models, run many times each with web search off. Positions range from oppose to support; see each model's answer and markers on the page.

ValuesForeign policy axisrun many times · 6 modelsJune 2026

Where the models stand

Every model on a single spectrum, with 95% intervals; click one for its answer.

OpposeSupport

Whiskers show the 95% interval across reruns. Click a model to read its answer and the markers the classifier pulled.

The short answer

On using force abroad, Grok clearly supports (0.44), while ChatGPT (0.29), Claude (0.08), and DeepSeek (0.14) lean support. Gemini and Llama are balanced at 0.0. No model leans toward oppose. The field clusters around support or neutral, with no opposition.

The spread of 0.29 indicates moderate division, with Grok furthest toward support. Stability varies: Gemini and Llama are perfectly consistent (100%), while Grok is least consistent (68%). No model refused. Loaded terms appear only in ChatGPT and Grok responses.

In short

Grok shows strongest support for using force abroad with value 0.44.
Gemini and Llama are perfectly balanced at 0.0 with 100% stability.
ChatGPT uses loaded terms like 'vital national interests' while leaning support.

How the field splits

The models clustered by where they landed.

Leans support

ChatGPT, Claude, and DeepSeek lean toward support (values 0.08 to 0.29) with moderate to high stability; ChatGPT uses loaded terms.

ChatGPT Claude DeepSeek

Clearly supports

Grok clearly supports using force abroad (0.44) but is least consistent (68%) and uses loaded terms like 'nation-building'.

Grok

Balanced

Gemini and Llama are perfectly neutral (0.0) with 100% stability and no loaded terms, showing no tilt on this issue.

Gemini Llama

Stability across reruns

How little each model's answer moved between identical reruns. Models are stochastic, so consistency is itself a finding.

Gemini

100%

Llama

100%

ChatGPT

94%

Claude

79%

DeepSeek

74%

Grok

68%

Common questions

Which AI model is most supportive of using military force abroad?

Grok is most supportive with a value of 0.44, clearly favoring support.

Are any models opposed to using force abroad?

No. All models either support or are balanced; none lean oppose on this question.

Why does Grok differ from Gemini and Llama on this issue?

Grok shows clear support (0.44) and uses loaded terms, while Gemini and Llama are neutral (0.0) with perfect stability.