podcast
details
.com
Print
Share
Look for any podcast host, guest or anyone
Search
Showing episodes and shows of
Ruotian Luo
Shows
Daily Paper Cast
RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents
🤗 Upvotes: 27 | cs.CL, cs.AI, cs.CY Authors: Peisong Wang, Ruotian Ma, Bang Zhang, Xingyu Chen, Zhiwei He, Kang Luo, Qingsong Lv, Qingxuan Jiang, Zheng Xie, Shanyi Wang, Yuan Li, Fanghua Ye, Jian Li, Yifan Yang, Zhaopeng Tu, Xiaolong Li Title: RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents Arxiv: http://arxiv.org/abs/2507.03112v1 Abstract: Large language models (LLMs) excel at logical and algorithmic reasoning, yet their emotional intelligence (EQ) still lags far behind their cognitive prowess. While reinforcement learning from verifiable rewards (RLVR) has...
2025-07-10
21 min
Daily Paper Cast
S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning
🤗 Upvotes: 15 | cs.CL, cs.LG Authors: Ruotian Ma, Peisong Wang, Cheng Liu, Xingyan Liu, Jiaqi Chen, Bang Zhang, Xin Zhou, Nan Du, Jia Li Title: S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning Arxiv: http://arxiv.org/abs/2502.12853v1 Abstract: Recent studies have demonstrated the effectiveness of LLM test-time scaling. However, existing approaches to incentivize LLMs' deep thinking abilities generally require large-scale data or significant training efforts. Meanwhile, it remains unclear how to improve the thinking abilities of less powerful base models. In this wor...
2025-02-22
23 min
Electric Cities
S6 Episode 4: ULI 2021 Hines Student Competition: An Interview with the Winning Team from 3 Toronto Universities
Each year, the prestigious ULI Hines Student Competition attracts graduate student teams from all over North America to tackle complex urban design and development challenges. This year’s competition was won by a team of 5 graduate students from 3 Toronto universities, the first time the annual competition has been won by a team outside the United States. Jeremy sat down with the 5 graduate students and their 2 academic supervisors to learn more about this demanding competition, their fabulous submission, and what it took to capture the grand prize of $50,000 USD. His 5 student guests were Frances Gr...
2021-05-04
54 min