Listen

Description

A podcast discussing how to optimally scale test-time compute for Large Language Models (LLMs), focusing on improving both verifiers and the model's response distribution.