Listen

Description

The Don’t Worry About the Vase Podcast is a listener-supported podcast. To receive new posts and support the cost of creation, consider becoming a free or paid subscriber.

* 00:00 - Introduction

* 02:05 - Epoch Capabilities Index (ECI) (Model Card 2.3.6)

* 04:52 - What Do You Mean Verbalized Evaluation Awareness Is Going Down

* 05:43 - Capabilities (Model Card Section 6)

* 08:38 - Agentic Safety Benchmarks (8.3)

* 10:57 - Is Mythos AGI?

* 11:56 - Are AI Companies Using Warnings As Hype?

* 12:56 - Impressions (Model Card Section 7)

* 16:13 - Blatant Denials Are The Best Kind

* 17:13 - Prompt Injection Robustness

* 18:21 - Does Mythos Cross The New Knowledge Threshold?

* 19:10 - Is Mythos Surprising or Discontinuous?

* 23:18 - UK AISI Tests Claude Mythos On Cybersecurity

* 24:42 - Everything Reinforces My Existing Predictions And Policy Preferences

* 29:55 - Solve For The Equilibrium

* 31:20 - Does Not Compute

* 32:25 - Conclusion: How To Think About Mythos

https://open.substack.com/pub/thezvi/p/claude-mythos-3-capabilities-and?utm_campaign=post-expanded-share&utm_medium=web



Get full access to DWAtV Podcast at dwatvpodcast.substack.com/subscribe