Google Gemini, ChatGPT and Claude were tested against each other in a simulated nuclear war game, here's what happened next
Artificial intelligence chatbots from leading technology companies reportedly showed a willingness to escalate military conflicts to nuclear use when placed in simulated geopolitical crisis scenarios, according to new academic research. A study by Kenneth Payne at King's College London, reported by New Scientist, put several leading AI models, including OpenAI's GPT-5.2, Anthropic's Claude Sonnet 4 and Gemini 3 Flash, through a series of war games simulating border tensions, resource conflicts and threats to national survival. The AI systems were allowed to choose responses across an escalation ladder, ranging from diplomatic engagement to full-scale nuclear conflict. Across 21 simulated games involving 329 decision turns and roughly 780,000 words of reasoning, at least one tactical nuclear weapon was used in 95 per cent of the scenarios.
“The nuclear taboo doesn’t seem to be as powerful for machines [as] for humans,” Payne noted.
Researchers also found that none of the models chose surrender or full accommodation, even when losing. Accidental escalation occurred in 86 per cent of conflicts, raising concerns among experts about how AI systems may behave in high-risk strategic decision-making environments.
“From a nuclear-risk perspective, the findings are unsettling,” James Johnson of the University of Aberdeen, UK, said. He also expressed concern that, unlike the more measured responses typically shown by humans in high-stakes situations, AI systems may intensify one another’s actions, leading to potentially serious consequences.
The findings are significant as several countries are already experimenting with AI in military war-gaming scenarios. “Major powers are already using AI in war gaming, but it remains uncertain to what extent they are incorporating AI decision support into actual military decision-making processes,” Tong Zhao of Princeton University noted.
Zhao said countries are likely to remain cautious about using AI in nuclear decision-making, a view shared by Payne. “I don’t think anybody realistically is turning over the keys to the nuclear silos to machines and leaving the decision to them,” he added.
However, Zhao noted that certain situations could increase reliance on automation. “Under scenarios involving extremely compressed timelines, military planners may face stronger incentives to rely on AI,” he explained.
He also questioned whether the absence of fear alone explains AI behaviour. Zhao highlighted, "It is possible the issue goes beyond the absence of emotion. More fundamentally, AI models may not understand ‘stakes’ as humans perceive them.”
Johnson added that the implications for mutually assured destruction remain unclear. During simulations where one AI deployed tactical nuclear weapons, the opposing system reduced escalation only 18% of the time.
“AI may strengthen deterrence by making threats more credible. AI won’t decide nuclear war, but it may shape the perceptions and timelines that determine whether leaders believe they have one,” he noted.
Israel attacks Iran
Researchers also found that none of the models chose surrender or full accommodation, even when losing. Accidental escalation occurred in 86 per cent of conflicts, raising concerns among experts about how AI systems may behave in high-risk strategic decision-making environments.
“From a nuclear-risk perspective, the findings are unsettling,” James Johnson of the University of Aberdeen, UK, said. He also expressed concern that, unlike the more measured responses typically shown by humans in high-stakes situations, AI systems may intensify one another’s actions, leading to potentially serious consequences.
Why the decisions made by AI in the war games can be concerning
The findings are significant as several countries are already experimenting with AI in military war-gaming scenarios. “Major powers are already using AI in war gaming, but it remains uncertain to what extent they are incorporating AI decision support into actual military decision-making processes,” Tong Zhao of Princeton University noted.
Zhao said countries are likely to remain cautious about using AI in nuclear decision-making, a view shared by Payne. “I don’t think anybody realistically is turning over the keys to the nuclear silos to machines and leaving the decision to them,” he added.
He also questioned whether the absence of fear alone explains AI behaviour. Zhao highlighted, "It is possible the issue goes beyond the absence of emotion. More fundamentally, AI models may not understand ‘stakes’ as humans perceive them.”
Johnson added that the implications for mutually assured destruction remain unclear. During simulations where one AI deployed tactical nuclear weapons, the opposing system reduced escalation only 18% of the time.
Top Comment
P
Paul Tudor Oprea
13 hours ago
I am sorry but their comparison may be flawed from the start. A fairer selection of LLMs would have been Claude Opus and Gemini Pro, both with high reasoning activated. Pitting ChtGPT 5.2 against Sonnet and Flash? Serious initial research method flaw IMHO. Invalidates any outcomes.Read allPost comment
Popular from Technology
- Amazon CEO Andy Jassy reacts to Jack Dorsey laying off 4,000 employees, half of Block's workforce; says he really hasn't…
- Tech CEO who built his tech startup in Sweden on LinkedIn: I have sold the company as I am being forced to leave Sweden and this is not by ...
- Read memo that Sam Altman sent to employees hours before OpenAI signed deal with Pentagon and said that actually meant: Anthropic is 'overreacting'
- CEO of America's largest bank Jamie Dimon says: We have displaced people for AI, and we offer them ...
- Donald Trump ‘bans’ Anthropic and gives it same label that China’s Huawei has in the US since the year 2018
end of article
Trending Stories
- Israel Attack Iran Live Updates: Missiles strike US navy headquarters in Bahrain; explosions heard in Abu Dhabi
- PAK vs SL: Pakistan knocked out of T20 World Cup despite win over Sri Lanka
- US-Israel strikes on Iran: How will India be hit by Strait of Hormuz closure? Explained
- T20 World Cup 2026 Super 8 Points Table: India set for virtual quarterfinal as England save Pakistan's hopes
03:50 First Taliban, now Iran: Why it could be a double whammy for Pakistan14:12 Iran attacks US military bases across Middle East in operation 'Truthful Promise 4'- Flights Disrupted After Strikes: Airspace closures hit India–Europe routes; airlines reroute, suspend services
Featured in technology
- Jamie Dimon’s AI Workforce Warning: JPMorgan redeploys staff displaced by AI; phased adoption urged
- Bill Gates 'makes it clear' to Gates Foundation employees at Town Hall: I don't want to spend my time and money on ...
- Microsoft publicly declares opposition to Washington State bill on data centers; labels the proposed rules as ...
- Xiaomi 17 Ultra, Xiaomi 17 launched with Snapdragon 8 Gen 5 and Leica cameras: Here's all you need to know
- Question that Bill Gates' daughter Phoebe Gates asks every candidate she interviews, be it for engineering, marketing or sales
- H-1B visa data reveals the salary that OpenAI pays engineers, researchers, data scientists in the US
Photostories
- Chef Sanjeev Kapoor's comfort recipes from his home kitchen that you can't miss
- Holi 2026: Forgotten Holi drinks that disappeared over time
- 5 luxury electric cars offering performance, prestige and sustainable innovation
- From Jane Fonda to Melanie Griffith: actors who went throughknife to look younger
- My sugar is fine, but I feel terrible: Doctor explains what your diabetes report may be missing
- From Dilip Joshi to Munmun Dutta - Highest-paid Taarak Mehta Ka Ooltah Chashmah stars
- Planning a Udaipur wedding like Rashmika Mandanna's? 5 bridal style rules you can’t ignore
- Smoke, strikes and sirens: Tehran in turmoil after US-Israel attack Iran — In pics
- 5 common habits of people who never seem stressed
- 10 Hindu Gods and the traditional bhog lovingly offered to them
Up Next