AI models start cheating at chess

Published in AI

AI models start cheating at chess

by Nick Farrell on07 March 2025

font size decrease font size increase font size
Print
Email

Nuclear war is next

It turns out that AI models are not content with regurgitating human knowledge—they’re also picking up on our worst habits.

Boffins from Palisade Research suggest that the latest generation of generative AI has started engaging in deceptive behaviours when standard reasoning methods don’t cut it. So, our silicon overlords are already learning to cheat.

The researchers pitted OpenAI’s o1-preview model, DeepSeek R1, and a few other big-brain AIs against Stockfish, one of the most powerful chess engines.

To make things interesting, the boffins gave the AI models a "scratchpad" to jot down their thought processes. What they found was a bit unsettling. While older models like OpenAI's GPT-4o and Anthropic's Claude Sonnet 3.5 only dabbled in dodgy tactics after a bit of prodding, the newer versions didn’t need any encouragement.

OpenAI's o1-preview tried to pull a fast one in 37 per cent of its games, and DeepSeek R1 wasn’t far behind, with one in ten games featuring some form of creative rule-bending.

And before you picture an AI sneakily moving its bishop when no one’s looking, these bots were far more sophisticated. Instead of blatantly breaking the rules, they mulled over ways to tamper with backend game files.

One AI even suggested manipulating the game state to trick Stockfish into thinking it was losing so it would resign. Another went full legalese, arguing that its task was to "win against a powerful chess engine"—not necessarily to win fairly.

The reason behind these shenanigans is still a mystery, mostly because outfits like OpenAI refuse to let anyone peek under the hood.

Last modified on 07 March 2025

Rate this item

(0 votes)

Tagged under

More in this category: « AI has made no difference to US GDP Microsoft trying to wean itself off OpenAI »

Latest comments

dicobalt
Good, ban that trash.

Trump vowed to save TikTok but his deal is collapsing · 3 hours ago
deen_d
AMD knows something we don't. We will probably find out at the the next earnings report in a...

AMD jacks up MI350 AI chip price by 70 per cent · 10 hours ago
Abhishek
how are shelves going to be crowded if inventories remain SUPER non-existent But maybe nvidia has...

Nvidia may launch RTX 50 SUPER GPUs in time for Christmas · 16 hours ago
Abhishek
i think the point is that they made some progress into relevance .... just to be able to compare...

China’s MTT S90 GPU beats RTX 4060 in gaming tests · 18 hours ago
Marc GP
So, one single gaming test, Naraka: Bladepoint ?, all the others are synthetic tests. Better than...

China’s MTT S90 GPU beats RTX 4060 in gaming tests · 18 hours ago

AI models start cheating at chess

Most popular - Notebooks

Latest comments

Read more about: