It doesn’t look good, on many fronts, especially taking a stake in Intel.
We continue.
Table of Contents
---
First published:
September 12th, 2025
Source:
https://www.lesswrong.com/posts/CWYomHppHsxNe9xRC/ai-131-part-2-various-misaligned-things-1
---
Narrated by TYPE III AUDIO.
---
Images from the article:











The graph shows scores for six different AI models (Claude Opus 4, Claude Sonnet 4, GPT-4.1, GPT-4.0, c3, and c4-mini), with c3 having notably higher overrefusal scores compared to the other models. The data includes 95% confidence intervals represented by error bars." style="max-width: 100%;" />
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.