Ep. 44 — Deep Dive: The cost of intelligence
Date: 2026-05-09 (Saturday)
In this episode:
A weekend deep dive on AI pricing in 2026 — why the same week saw GPT-5.5 double its API price and DeepSeek V4 land at one-seventeenth of frontier cost, and what builders should do about a market that is splitting in two directions at once. Part 1 of a two-part weekend: tomorrow we follow with the margin trap.
Three angles:
1. Set the stage — frontier pricing is going up while commodity pricing is going down, both in the same news cycle
2. Core tension — premium and commodity tiers are different markets with different buyers; productivity surplus is going to shareholders, not workers
3. Builder implications — stop tokenmaxxing, route deliberately, and build the routing layer before you pick a model
Stories referenced:
- GPT-5.5 API price doubled to $5/$30 per million tokens — real-world cost up 49-92%
https://openrouter.ai/announcements/gpt55-cost-analysis
- DeepSeek V4 at 17x cheaper — builder runs ten-day local-vs-cloud measurement
https://reddit.com/r/LocalLLaMA/comments/1t4s6g2/deepseek_v4_being_17x_cheaper_got_me_to_actually/
- DeepSeek closes round at $45B valuation, up from $20B
https://techcrunch.com/2026/05/06/deepseek-could-hit-45b-valuation-from-its-first-investment-round/
- Moonshot AI raises $2B at $20B valuation, $200M ARR, Kimi K2.6 #2 on OpenRouter
https://techcrunch.com/2026/05/07/chinas-moonshot-ai-raises-2b-at-20b-valuation-as-demand-for-open-source-ai-skyrockets/
- Cloudflare lays off 1,100 (20% of company) on AI productivity, record $639.8M revenue
https://techcrunch.com/2026/05/08/cloudflare-says-ai-made-1100-jobs-obsolete-even-as-revenue-hit-a-record-high/
- Tokenmaxxing fails — Jellyfish data on diminishing returns from heavy AI use
https://www.businessinsider.com/ai-tokenmaxxing-fails-as-productivity-strategy-jellyfish-2026-5
- Anthropic doubles Claude Code per-developer cost estimate from $6 to $13 per active day
https://www.businessinsider.com/anthropic-doubles-claude-code-per-developer-cost
- Anthropic surpasses OpenAI: $39B ARR vs $25B, implied $1T+ valuation on secondary
Tomorrow: Deep Dive Part 2 — the margin trap. What happens to AI companies when inference is nearly free, the race to zero, and where defensible value is moving in 2026.