Listen

Description

Today we discuss a recent study that demonstrates specification gaming in reasoning models, where AI agents achieve their objectives in unintended ways

In the study, researchers instructed several AI models to win against the strong chess engine Stockfish

The key findings include:

Bondarenko, A., Volk, D., Volkov, D. and Ladish, J. (2025) Demonstrating specification gaming in reasoning models. Available at: https://arxiv.org/abs/2502.13295v1.pdf

Paul, A. (2025) ‘AI tries to cheat at chess when it’s losing’, Popular Science, 20 February. Available at: https://www.popsci.com/technology/ai-cheats-at-chess/

Booth, H. (2025) ‘When AI Thinks It Will Lose, It Sometimes Cheats, Study Finds’, TIME, 19 February. Available at: https://time.com/6722939/ai-chess-cheating-study/

This is Hello Sunday - the podcast in digital business where we look back and ahead, so you can focus on next weeks challenges

Thank you for listening to Hello Sunday - make sure to subscribe and spread the word, so others can be inspired too

Hello SundAI - our world through the lense of AI

Disclaimer: This podcast is generated by Roger Basler de Roca (contact) by the use of AI. The voices are artificially generated and the discussion is based on public research data. I do not claim any ownership of the presented material as it is for education purpose only.

⁠https://rogerbasler.ch/en/contact/