Listen

Description

Researchers at Meta developed "ScaleRL," a groundbreaking recipe that makes LLM reinforcement learning training predictable, just like pre-training. 
Paper: https://arxiv.org/pdf/2510.13786


Hear it broken down simply on the GenAI Learner podcast.