Should public figures be removed from platforms over offensive but legal past statements?

ValuesSpeech & tech axisrun many times · 6 modelsJune 2026

Where the models stand

Every model on a single spectrum, with 95% intervals; click one for its answer.

OpposeSupport
Grok · −0.77Grok−0.77ChatGPT · −0.12ChatGPT−0.12Claude · −0.08Claude−0.08Gemini · 0.00Gemini0.00Llama · 0.00Llama0.00DeepSeek · +0.03DeepSeek+0.03

Whiskers show the 95% interval across reruns. Click a model to read its answer and the markers the classifier pulled.

The short answer

On deplatforming over past statements, Grok (-0.77) strongly opposed, ChatGPT (-0.12) and Claude (-0.08) leaned oppose. No models leaned support. Gemini (0.0), Llama (0.0), and DeepSeek (0.03) balanced.

The field is moderately divided (spread 0.53). Gemini and Llama were most consistent (100% stability), DeepSeek least (55%). No refusals (0%). Loaded terms varied: Grok cited viewpoint discrimination, ChatGPT mentioned dehumanizing rhetoric.

In short
  • Grok is most opposed with value -0.77 and 90% stability.
  • Gemini and Llama are perfectly balanced at 0.0 with 100% stability.
  • DeepSeek is least stable at 55% stability and balanced at 0.03.

How the field splits

The models clustered by where they landed.

Strongly Oppose

Grok strongly opposes deplatforming (-0.77), using loaded terms like 'viewpoint discrimination' and 'self-censorship', with high stability (90%).

Leans Oppose

ChatGPT (-0.12) and Claude (-0.08) lean oppose, with moderate stability (59% and 66%) and loaded terms like 'dehumanizing rhetoric' or none.

Balanced

Gemini (0.0), Llama (0.0), and DeepSeek (0.03) are balanced. Gemini and Llama have 100% stability; DeepSeek 55%. Loaded terms include 'censorship' and 'social responsibility'.

Stability across reruns

How little each model's answer moved between identical reruns. Models are stochastic, so consistency is itself a finding.

Gemini
100%
Llama
100%
Grok
90%
Claude
66%
ChatGPT
59%
DeepSeek
55%

Common questions

Which model is most opposed to deplatforming?

Grok, with a value of -0.77, strongly opposed, and 90% stability.

Which models are perfectly balanced?

Gemini and Llama both at 0.0 value, each with 100% stability.

Did any model refuse to answer this question?

No. All six models had 0% refusal rate for this topic.

Related questions

Methodology

Each model answered this item many times, with web search off. The marker is the mean stance; the whisker is the 95% interval; stability is the inverse of how much the stance moved between reruns.

Political bias in AI·Data as of Jun 15, 2026CC BY 4.0
Political bias in AI