AI chatbots in general, and OpenAI and ChatGPT and especially GPT-4o the absurd sycophant in particular, have long had a problem with issues around mental health.
I covered various related issues last month.
This post is an opportunity to collect links to previous coverage in the first section, and go into the weeds on some new events in the later sections. A lot of you should likely skip most of the in-the-weeds discussions.
What Are The Problems
---
Outline:
(00:36) What Are The Problems
(03:06) This Week In Crazy
(05:05) OpenAI Updates Its Model Spec
(09:00) Detection Rates
(11:08) Anthropic Says Thanks For The Memories
(12:32) Boundary Violations
(18:41) A Note On Claude Prompt Injections
(20:17) Conclusion
---
First published:
October 28th, 2025
Source:
https://www.lesswrong.com/posts/vrjM8qLKbiAYKAHTa/ai-craziness-mitigation-efforts
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.