podcast
details
.com
Print
Share
Look for any podcast host, guest or anyone
Search
Showing episodes and shows of
Hongyuan Mei
Shows
Daily Paper Cast
FlowRL: Matching Reward Distributions for LLM Reasoning
🤗 Upvotes: 68 | cs.LG, cs.AI, cs.CL Authors: Xuekai Zhu, Daixuan Cheng, Dinghuai Zhang, Hengli Li, Kaiyan Zhang, Che Jiang, Youbang Sun, Ermo Hua, Yuxin Zuo, Xingtai Lv, Qizheng Zhang, Lin Chen, Fanghao Shao, Bo Xue, Yunchong Song, Zhenjie Yang, Ganqu Cui, Ning Ding, Jianfeng Gao, Xiaodong Liu, Bowen Zhou, Hongyuan Mei, Zhouhan Lin Title: FlowRL: Matching Reward Distributions for LLM Reasoning Arxiv: http://arxiv.org/abs/2509.15207v1 Abstract: We propose FlowRL: matching the full reward distribution via flow balancing instead of maximizing rewards in large language mod...
2025-09-20
19 min
論文らじお
LLMによる仮説生成:AIはひらめくか?
📄 本日の論文 タイトル:Hypothesis Generation with Large Language Models 著者:Yangqiaoyu Zhou, Haokun Liu, Tejes Srivastava, Hongyuan Mei, Chenhao Tan 公開:2024年4月5日(v1), 2024年12月18日改訂(v3)(arXiv:2404.04326) 分野:Artificial Intelligence 論文リンク:https://doi.org/10.48550/arXiv.2404.04326
2025-05-08
06 min