Seventy3:借助NotebookLM的能力进行论文解读,专注人工智能、大模型、机器人算法方向,让大家跟着AI一起进步。
进群添加小助手微信:seventy3_podcast
备注:小宇宙
今天的主题是:
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model
Summary
Open-Reasoner-Zero (ORZ) is introduced as an open-source project focused on large-scale reinforcement learning for reasoning in large language models. The authors demonstrate that a simple...去小宇宙查看完整单集简介
前往小宇宙评论区与主播互动