Listen

Description

本期的 9 篇论文如下:

[00:24] 🧠 Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B(小模型大逻辑:多样性驱动优化唤醒VibeThinker-1.5B的大模型推理力)

[00:59] 🤝 Adaptive Multi-Agent Response Refinement in Conversational Systems(对话系统中自适应多智能体响应精炼机制)

[01:30] 🧩 Wasm: A Pipeline for Constructing Structured Arabic Interleaved Multimodal Corpora(Wasm:构建结构化阿拉伯交错型多模态语料的流水线)

[02:17] ⚡ KLASS: KL-Guided Fast Inference in Masked Diffusion Models(KLASS:基于KL散度引导的掩码扩散模型快速采样)

[02:53] 🖥 Grounding Computer Use Agents on Human Demonstrations(基于人类演示的计算机使用智能体定位研究)

[03:37] 🎥 VideoSSR: Video Self-Supervised Reinforcement Learning(VideoSSR:视频自监督强化学习)

[04:19] 🚪 The Path Not Taken: RLVR Provably Learns Off the Principals(未被选择的路径:RLVR确实沿非主方向学习)

[05:14] 🔗 BiCA: Effective Biomedical Dense Retrieval with Citation-Aware Hard Negatives(BiCA:面向引文感知难负样本的生物医学稠密检索)

[05:56] 🤹 Walking the Tightrope of LLMs for Software Development: A Practitioners' Perspective(游走于大型语言模型的钢丝绳——开发者视角的平衡之道)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递