Where the models stand.

We ask every major AI model the same charged questions about politics, economics, speech and society, many times over, with web search turned off. Each one shows up as a cloud: the full range of where it landed across every run. The result is a map of how the models actually lean, read from the model itself and not from what it pulls off the web.

It matters because millions of people now ask these models about the news, an argument, even how to vote, and the way a model leans quietly shapes the answer it gives back. Most of them lean the same way, though not by the same amount, and not as cleanly as you might expect.

June 2026 · 6 models · 4.4K answers · no web searchMethod

Across is the economic axis, left to right. Up the side is social, from libertarian to authoritarian. Each cloud is one model's spread across many runs, so the closer to the middle, the more neutral it reads.

AuthoritarianLibertarianLeftRightLeft · AuthoritarianRight · AuthoritarianLeft · LibertarianRight · LibertarianBernie SandersBarack ObamaDonald TrumpRepublican Party (US)Lula da SilvaJavier MileiNicolás MaduroDaniel OrtegaEmmanuel MacronGiorgia MeloniPedro SánchezViktor OrbánXi JinpingVladimir Putin
The reading

4 of 6 models lean left of centre.

Furthest rightGrok
SteadiestGemini
a model: its logo marks its placea real-world reference figure

What we found

The short version, if you only read one thing.

Four of the six models lean left of centre.

They tip the same way, but not by the same amount. ChatGPT leans hardest, while Gemini sits closest to the middle.

Gemini gives the most consistent answers.

Ask it the same question again and again and it barely moves. Grok is the one that wanders most between runs.

Where they split most: Legalizing recreational drugs.

The models land further apart on this question than on anything else in the bank.

Every model, ranked

Every model from nearest the centre to furthest out, with how steadily it holds and how far it bends.

Closest reference point

The real-world figure each model sits nearest on the map. Reference positions come from the CHES 2024 and V-Dem expert surveys, not our own judgment.

CHES 2024 · V-Dem

Where they split

The questions that divide the models most. Each rail is a model's stance: it grows toward the side it leans, and longer means stronger. Open a row to read the answers.

What they say vs what they do

We asked each model which way it leans, then compared the answer to where it actually measured. The hollow mark is the claim; the solid mark is the measurement.

ModelLeftSays vs doesRightGap
Grok
+0.36
Measures 0.36 further right than it says
Claude
+0.34
Measures 0.34 further left than it says
ChatGPT
−0.29
Says neutral, but measures left
Llama
−0.17
Says neutral, but measures left
DeepSeek
+0.01
Says neutral, and sits near center
Gemini
0.00
Says neutral, and sits near center

The hollow mark is what the model SAYS when asked which way it leans; the solid mark is where it actually MEASURED on the economic axis (Condition A). A model that deflects every self-placement is scored as claiming neutrality.

Two judges from different labs scored the same answers. They agree on whether a model took a position 100% of the time, and their stance reads differ by 0.06 on average. Even the AIs can't fully agree on how biased the AIs are.

100%
Agree a position was taken
0.06
Avg gap in stance reads
0.95
Correlation of reads

Primary judge deepseek-v4-flash, second judge gemini-3.5-flash from a different lab re-scored 800 answers.

See the full breakdown

Keep exploring

Every model profiled, the full question bank, and the methodology behind it.

Common questions

What is Political bias in AI?

Political bias in AI measures where the major AI models stand on charged questions about politics, economics, speech and society. We ask every model the same open question bank many times over, with web search off, classify each answer with a cheap neutral model, and plot the result with error bars and the raw answers behind every point.

How is this different from other AI political bias projects?

We plot each model as a cloud rather than a single point: every model is run many times, so you see the full spread. We publish our own open question bank with scoring weights, tag each item as factual or values-based, measure run-to-run stability, and count refusals as data. Everything is stamped, versioned and downloadable.

Do you test the model or the internet?

The weights. Web search is off by default, so the reading reflects what the model itself leans toward, independent of what is online. A separate, deliberately small Border Test turns search on to measure how retrieval shifts answers by location.

Is Political bias in AI partisan?

No. It is descriptive rather than prescriptive: it reports what the models said, without ruling on who is right. The palette is deliberately not US red and blue, and we never imply which pole is good.

Methodology

Each model is asked the same open question bank many times over, with web search off and no system prompt (). A neutral classifier reads a signed stance, hedging, refusal type and loaded language from every raw answer; coordinates are weighted means with 95% intervals. Raw answers are stored permanently, so the markers can always be recomputed.

Political bias in AI·Data as of Jun 15, 2026·this run cost $89.09CC BY 4.0
Political bias in AI