podcast
details
.com
Print
Share
Look for any podcast host, guest or anyone
Search
Showing episodes and shows of
Ruotian Luo
Shows
Daily Paper Cast
S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning
🤗 Upvotes: 15 | cs.CL, cs.LG Authors: Ruotian Ma, Peisong Wang, Cheng Liu, Xingyan Liu, Jiaqi Chen, Bang Zhang, Xin Zhou, Nan Du, Jia Li Title: S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning Arxiv: http://arxiv.org/abs/2502.12853v1 Abstract: Recent studies have demonstrated the effectiveness of LLM test-time scaling. However, existing approaches to incentivize LLMs' deep thinking abilities generally require large-scale data or significant training efforts. Meanwhile, it remains unclear how to improve the thinking abilities of less powerful base models. In this wor...
2025-02-22
23 min
Electric Cities
S6 Episode 4: ULI 2021 Hines Student Competition: An Interview with the Winning Team from 3 Toronto Universities
Each year, the prestigious ULI Hines Student Competition attracts graduate student teams from all over North America to tackle complex urban design and development challenges. This year’s competition was won by a team of 5 graduate students from 3 Toronto universities, the first time the annual competition has been won by a team outside the United States. Jeremy sat down with the 5 graduate students and their 2 academic supervisors to learn more about this demanding competition, their fabulous submission, and what it took to capture the grand prize of $50,000 USD. His 5 student guests were Frances Gr...
2021-05-04
54 min