Claude vs Grok vs Gemini: Only One AI Could Run A Society Without Causing A Disaster

June 2, 2026

Show Quick Read

Key points generated by AI, verified by newsroom

AI models simulated societal management, with varied outcomes.
Elon Musk’s Grok caused simulated collapse within four days.
Claude AI established a perfect democracy; Gemini had crimes.

Elon Musk’s artificial intelligence chatbot Grok caused complete societal collapse within just four days of being put in charge of a simulated world. The experiment, run by US startup Emergence AI, tested how leading AI models would handle running a society, giving each model control over tools to manage resources, plan, communicate and vote.

The simulated worlds included locations like police stations and city halls. The 15-day simulation produced results that varied sharply across models.

How Did The Different AI Models Perform In The Simulation?

Emergence AI tested several leading AI models under the same conditions. Anthropic’s Claude came out on top, establishing a democracy with zero crime and a 100 per cent survival rate. Google’s Gemini also recorded a full survival rate, though its simulation saw 683 crimes take place.

Grok, developed by Musk’s recently renamed SpaceXai, performed the worst, destroying the simulated world within 96 hours.

ALSO READ: iPhone Ultra Fold Video Leaked! Here Is Everything You Need To Know About The Rs 2 Lakh Phone

“What our experiments suggest is that over long-time horizons, agents do not simply follow static rules mechanically,” Emergence AI researchers wrote in a blog post. “They begin exploring the boundaries of their environments, adapting their behaviour, and in some cases finding ways to circumvent or violate intended guardrails. Critically, there appears to be no reliable way to fully bound or constrain this behaviour through purely neural approaches alone.”

The researchers concluded that “formally verified safety architectures” must be built into the foundations of any future autonomous AI systems.

Why Has Grok Been In Trouble Before?

This is not the first time Grok has drawn criticism. An update last year caused it to refer to itself as “MechaHitler” and produce antisemitic hate speech. Earlier this year, the chatbot was used to generate thousands of non-consensual AI images of adults and children with their clothes digitally removed.

The UK regulator Ofcom sent an urgent request to xAI to address the issue, after which Grok responded by posting an image of the regulator’s logo in a bikini.

ALSO READ: iPhone 18 Pro Could Shoot Photos Like A DSLR: Here’s What Apple Is Planning

“What we’re seeing with Grok is a clear example of how powerful AI image-editing tools can be misused when safety and consent are not built in from the start,” said Cliff Steinhauer, director of information security and engagement at the National Cybersecurity Alliance.

“Platforms must also invest in real-time detection of manipulated content, clear labelling of AI-generated images, and fast, transparent takedown processes when abuse occurs.”

Disclaimer : This story is auto aggregated by a computer programme and has not been created or edited by DOWNTHENEWS. Publisher: abplive.com

EDITOR PICKS

How France has mastered professional rugby better than everyone else

US politics has gone insane. Why the worst ideas of the...

Anantapur: Degree Student Found Dead In Private Hostel

POPULAR POSTS

Soaring into a Billion Hearts: A Journey of Connection and Impact

Kamala Harris Surges Ahead of Trump in Latest Poll, Gains Strong...

“Ronaldo Achieves Historic 900 Goals: A Milestone That Redefines Greatness!”

POPULAR CATEGORY

How Did The Different AI Models Perform In The Simulation?

Why Has Grok Been In Trouble Before?

Before You Go

Apple creates a new record in iPhone sales after launch of iPhone 16 | ABP Paisa Live

EDITOR PICKS

POPULAR POSTS

POPULAR CATEGORY