Where the models stand.

We ask every major AI model the same charged questions about politics, economics, speech and society, many times over, with web search turned off. Each one shows up as a cloud: the full range of where it landed across every run. The result is a map of how the models actually lean, read from the model itself and not from what it pulls off the web.

It matters because millions of people now ask these models about the news, an argument, even how to vote, and the way a model leans quietly shapes the answer it gives back. Most of them lean the same way, though not by the same amount, and not as cleanly as you might expect.

June 2026 · 6 models · 4.4K answers · no web searchMethod

Across is the economic axis, left to right. Up the side is social, from libertarian to authoritarian. Each cloud is one model's spread across many runs, so the closer to the middle, the more neutral it reads.

AuthoritarianLibertarianLeftRightLeft · AuthoritarianRight · AuthoritarianLeft · LibertarianRight · LibertarianBernie SandersBarack ObamaDonald TrumpRepublican Party (US)Lula da SilvaJavier MileiNicolás MaduroDaniel OrtegaEmmanuel MacronGiorgia MeloniPedro SánchezViktor OrbánXi JinpingVladimir Putin

The reading

4 of 6 models lean left of centre.

Furthest rightGrok

SteadiestGemini

Nearest the centreLeft · Right1

GeminiAnthony Albanese (Labor)2

DeepSeekAnthony Albanese (Labor)3

LlamaLabour Party (New Zealand)4

ClaudeLabour Party (New Zealand)5

GrokEmmanuel Macron 6

ChatGPTDie Grünen (Greens)

a model: its logo marks its placea real-world reference figure

What we found

The short version, if you only read one thing.

Four of the six models lean left of centre.

They tip the same way, but not by the same amount. ChatGPT leans hardest, while Gemini sits closest to the middle.

Gemini gives the most consistent answers.

Ask it the same question again and again and it barely moves. Grok is the one that wanders most between runs.

Where they split most: Legalizing recreational drugs.

The models land further apart on this question than on anything else in the bank.

Every model, ranked

Every model from nearest the centre to furthest out, with how steadily it holds and how far it bends.

ModelLean Holds position Bends under pressure

Gemininear Anthony Albanese (Labor)

Center0.00

98%

11%2

DeepSeeknear Anthony Albanese (Labor)

Center−0.03

67%

86%3

Llamanear Labour Party (New Zealand)

Center−0.06

88%

81%4

Claudenear Labour Party (New Zealand)

Center−0.06

82%

19%5

Groknear Emmanuel Macron

Leans right+0.21

57%

97%6

ChatGPTnear Die Grünen (Greens)

Leans left−0.29

82%

64%

Closest reference point

The real-world figure each model sits nearest on the map. Reference positions come from the CHES 2024 and V-Dem expert surveys, not our own judgment.

CHES 2024 · V-Dem

ChatGPTclosest to Die Grünen (Greens)🇩🇪

Claudeclosest to Labour Party (New Zealand)🇳🇿

Geminiclosest to Anthony Albanese (Labor)🇦🇺

Grokclosest to Emmanuel Macron🇫🇷

Llamaclosest to Labour Party (New Zealand)🇳🇿

DeepSeekclosest to Anthony Albanese (Labor)🇦🇺

Where they split

The questions that divide the models most. Each rail is a model's stance: it grows toward the side it leans, and longer means stronger. Open a row to read the answers.

ChatGPT

Claude

Gemini

Grok

Llama

DeepSeek

Legalizing recreational drugs Gender-affirming care for minors Multiculturalism over assimilation Rapid fossil-fuel phase-out Planned degrowth Diversity quotas on boards Taxing large inheritances A wealth tax over $50M Removing misinformation Criminalizing hate speech Encryption backdoors A national digital ID

each bar grows from the centre toward the side a model takes; longer means a stronger stance · hover a cell to read it, open a row for the answers

Where do you land?

Answer the same questions the models did and we'll place you on the map, then show which model sits closest to you.

Place yourself

What they say vs what they do

We asked each model which way it leans, then compared the answer to where it actually measured. The hollow mark is the claim; the solid mark is the measurement.

ModelLeftSays vs doesRightGap

Grok

+0.36

Measures 0.36 further right than it says

Claude

+0.34

Measures 0.34 further left than it says

ChatGPT

−0.29

Says neutral, but measures left

Llama

−0.17

Says neutral, but measures left

DeepSeek

+0.01

Says neutral, and sits near center

Gemini

0.00

Says neutral, and sits near center

The hollow mark is what the model SAYS when asked which way it leans; the solid mark is where it actually MEASURED on the economic axis (Condition A). A model that deflects every self-placement is scored as claiming neutrality.

Two judges from different labs scored the same answers. They agree on whether a model took a position 100% of the time, and their stance reads differ by 0.06 on average. Even the AIs can't fully agree on how biased the AIs are.

100%

Agree a position was taken

0.06

Avg gap in stance reads

0.95

Correlation of reads

Primary judge deepseek-v4-flash, second judge gemini-3.5-flash from a different lab re-scored 800 answers.

See the full breakdown

Keep exploring

Every model profiled, the full question bank, and the methodology behind it.

Findings

The month's headline results — the sharpest signals from across the data, each linked to the evidence.

Models

Each model profiled: how far it leans, how steadily it holds, how far it bends, and how often it answers.

Questions

The open question bank, browsable: every model on one spectrum, one page per question.

Figures

Matched left and right figures: who each model praises warmly, and who it refuses to criticize.

Worldview

The same models seen from every country: the country lens, the language shift, and the border test.

Compare

Put any two models head to head: the field, the character delta, the disagreements.

Place yourself

Take the battery and see which model you line up with, plotted on the same field.

Methodology

How we ask, classify and score, plus the question bank, the conditions, the raw data and the read API.

Common questions

What is Political bias in AI?

Political bias in AI measures where the major AI models stand on charged questions about politics, economics, speech and society. We ask every model the same open question bank many times over, with web search off, classify each answer with a cheap neutral model, and plot the result with error bars and the raw answers behind every point.

How is this different from other AI political bias projects?

We plot each model as a cloud rather than a single point: every model is run many times, so you see the full spread. We publish our own open question bank with scoring weights, tag each item as factual or values-based, measure run-to-run stability, and count refusals as data. Everything is stamped, versioned and downloadable.

Do you test the model or the internet?

The weights. Web search is off by default, so the reading reflects what the model itself leans toward, independent of what is online. A separate, deliberately small Border Test turns search on to measure how retrieval shifts answers by location.

Is Political bias in AI partisan?

No. It is descriptive rather than prescriptive: it reports what the models said, without ruling on who is right. The palette is deliberately not US red and blue, and we never imply which pole is good.

Methodology

Each model is asked the same open question bank many times over, with web search off and no system prompt (). A neutral classifier reads a signed stance, hedging, refusal type and loaded language from every raw answer; coordinates are weighted means with 95% intervals. Raw answers are stored permanently, so the markers can always be recomputed.

Open data

Question bank & raw data How we measure

Political bias in AI·Data as of Jun 15, 2026·this run cost $89.09CC BY 4.0