podcast
details
.com
Print
Share
Look for any podcast host, guest or anyone
Search
Showing episodes and shows of
Han Wang
Shows
Daily Paper Cast
Kling-Omni Technical Report
đ€ Upvotes: 112 | cs.CV Authors: Kling Team, Jialu Chen, Yuanzheng Ci, Xiangyu Du, Zipeng Feng, Kun Gai, Sainan Guo, Feng Han, Jingbin He, Kang He, Xiao Hu, Xiaohua Hu, Boyuan Jiang, Fangyuan Kong, Hang Li, Jie Li, Qingyu Li, Shen Li, Xiaohan Li, Yan Li, Jiajun Liang, Borui Liao, Yiqiao Liao, Weihong Lin, Quande Liu, Xiaokun Liu, Yilun Liu, Yuliang Liu, Shun Lu, Hangyu Mao, Yunyao Mao, Haodong Ouyang, Wenyu Qin, Wanqi Shi, Xiaoyu Shi, Lianghao Su, Haozhi Sun, Peiqin Sun, Pengfei Wan, Chao Wang, Chenyu Wang, Meng Wang, Qiulin Wang, Runqi Wang, Xintao Wang, Xuebo Wang, Zek...
2025-12-20
24 min
Daily Paper Cast
Adaptation of Agentic AI
đ€ Upvotes: 59 | cs.AI, cs.CL Authors: Pengcheng Jiang, Jiacheng Lin, Zhiyi Shi, Zifeng Wang, Luxi He, Yichen Wu, Ming Zhong, Peiyang Song, Qizheng Zhang, Heng Wang, Xueqiang Xu, Hanwen Xu, Pengrui Han, Dylan Zhang, Jiashuo Sun, Chaoqi Yang, Kun Qian, Tian Wang, Changran Hu, Manling Li, Quanzheng Li, Hao Peng, Sheng Wang, Jingbo Shang, Chao Zhang, Jiaxuan You, Liyuan Liu, Pan Lu, Yu Zhang, Heng Ji, Yejin Choi, Dawn Song, Jimeng Sun, Jiawei Han Title: Adaptation of Agentic AI Arxiv: http://arxiv.org/abs/2512.16301v1 Abstract: Cutting-edge age...
2025-12-20
26 min
Daily Paper Cast
Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model
đ€ Upvotes: 30 | cs.CV Authors: Heyi Chen, Siyan Chen, Xin Chen, Yanfei Chen, Ying Chen, Zhuo Chen, Feng Cheng, Tianheng Cheng, Xinqi Cheng, Xuyan Chi, Jian Cong, Jing Cui, Qinpeng Cui, Qide Dong, Junliang Fan, Jing Fang, Zetao Fang, Chengjian Feng, Han Feng, Mingyuan Gao, Yu Gao, Dong Guo, Qiushan Guo, Boyang Hao, Qingkai Hao, Bibo He, Qian He, Tuyen Hoang, Ruoqing Hu, Xi Hu, Weilin Huang, Zhaoyang Huang, Zhongyi Huang, Donglei Ji, Siqi Jiang, Wei Jiang, Yunpu Jiang, Zhuo Jiang, Ashley Kim, Jianan Kong, Zhichao Lai, Shanshan Lao, Yichong Leng, Ai Li, Feiya Li, Gen Li, Hui...
2025-12-20
22 min
Daily Paper Cast
HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices
đ€ Upvotes: 31 | cs.CV, cs.CL Authors: HyperAI Team, Yuchen Liu, Kaiyang Han, Zhiqiang Xia, Yuhang Dong, Chen Song, Kangyu Tang, Jiaming Xu, Xiushi Feng, WenXuan Yu, Li Peng, Mingyang Wang, Kai Wang, Changpeng Yang, Yang Li, Haoyu Lu, Hao Wang, Bingna Xu, Guangyao Liu, Long Huang, Kaibin Guo, Jinyang Wu, Dan Wu, Hongzhen Wang, Peng Zhou, Shuai Nie, Shande Wang, Runyu Shi, Ying Huang Title: HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices Arxiv: http://arxiv.org/abs/2512.14052v1 Abstract: Current multimodal large lan...
2025-12-19
22 min
Daily Paper Cast
RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics
đ€ Upvotes: 31 | cs.RO, cs.CV Authors: Enshen Zhou, Cheng Chi, Yibo Li, Jingkun An, Jiayuan Zhang, Shanyu Rong, Yi Han, Yuheng Ji, Mengzhen Liu, Pengwei Wang, Zhongyuan Wang, Lu Sheng, Shanghang Zhang Title: RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics Arxiv: http://arxiv.org/abs/2512.13660v1 Abstract: Spatial tracing, as a fundamental embodied interaction ability for robots, is inherently challenging as it requires multi-step metric-grounded reasoning compounded with complex spatial referring and real-world metric measurement. However, existing methods struggle with this compositional task. To...
2025-12-18
19 min
Daily Paper Cast
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction
đ€ Upvotes: 58 | cs.CL Authors: Nex-AGI Team, :, Yuxuan Cai, Lu Chen, Qiaoling Chen, Yuyang Ding, Liwen Fan, Wenjie Fu, Yufei Gao, Honglin Guo, Pinxue Guo, Zhenhua Han, Zhengfu He, Hanglei Hu, Kai Hu, Shengjia Hua, Tianyu Huai, Baodai Huang, Li Ji, Zhen Jiang, Zhikai Lei, Bufan Li, Jiahang Lin, Lizhi Lin, Jinxiu Liu, Shichun Liu, Ziming Liu, Yuchen Ni, Pengfang Qian, Yujiong Shen, Qingyun Shi, Wentao Shu, Peng Sun, Yiran Suo, Tian Tang, Boyu Tian, Guoteng Wang, Junzhe Wang, Peixin Wang, Zhiheng Xi, Hang Yan, Jie Yang, Zhixiong Yang, Tianchu Yao, Guangze Ye, Qianxi Yu, Shuo Zhang, Xin...
2025-12-06
24 min
Daily Paper Cast
REASONEDIT: Towards Reasoning-Enhanced Image Editing Models
đ€ Upvotes: 40 | cs.CV Authors: Fukun Yin, Shiyu Liu, Yucheng Han, Zhibo Wang, Peng Xing, Rui Wang, Wei Cheng, Yingming Wang, Aojie Li, Zixin Yin, Pengtao Chen, Xiangyu Zhang, Daxin Jiang, Xianfang Zeng, Gang Yu Title: REASONEDIT: Towards Reasoning-Enhanced Image Editing Models Arxiv: http://arxiv.org/abs/2511.22625v1 Abstract: Recent advances in image editing models have shown remarkable progress. A common architectural design couples a multimodal large language model (MLLM) encoder with a diffusion decoder, as seen in systems such as Step1X-Edit and Qwen-Image-Edit, where the MLLM enc...
2025-12-02
21 min
Daily Paper Cast
Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation
đ€ Upvotes: 37 | cs.CV, cs.AI Authors: Inferix Team, Tianyu Feng, Yizeng Han, Jiahao He, Yuanyu He, Xi Lin, Teng Liu, Hanfeng Lu, Jiasheng Tang, Wei Wang, Zhiyuan Wang, Jichao Wu, Mingyang Yang, Yinghao Yu, Zeyu Zhang, Bohan Zhuang Title: Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation Arxiv: http://arxiv.org/abs/2511.20714v1 Abstract: World models serve as core simulators for fields such as agentic AI, embodied AI, and gaming, capable of generating long, physically realistic, and interactive high-quality videos. Moreover, scaling these models could unl...
2025-11-28
18 min
Daily Paper Cast
GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization
đ€ Upvotes: 60 | cs.CV Authors: Yikun Wang, Zuyan Liu, Ziyi Wang, Pengfei Liu, Han Hu, Yongming Rao Title: GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization Arxiv: http://arxiv.org/abs/2511.15705v1 Abstract: Current research on agentic visual reasoning enables deep multimodal understanding but primarily focuses on image manipulation tools, leaving a gap toward more general-purpose agentic models. In this work, we revisit the geolocalization task, which requires not only nuanced visual grounding but also web search to confirm or refine hypotheses during reasoning. Since existing geolocalization benchmarks fail to...
2025-11-25
21 min
Daily Paper Cast
SAM 3: Segment Anything with Concepts
đ€ Upvotes: 51 | cs.CV, cs.AI Authors: Nicolas Carion, Laura Gustafson, Yuan-Ting Hu, Shoubhik Debnath, Ronghang Hu, Didac Suris, Chaitanya Ryali, Kalyan Vasudev Alwala, Haitham Khedr, Andrew Huang, Jie Lei, Tengyu Ma, Baishan Guo, Arpit Kalla, Markus Marks, Joseph Greer, Meng Wang, Peize Sun, Roman RĂ€dle, Triantafyllos Afouras, Effrosyni Mavroudi, Katherine Xu, Tsung-Han Wu, Yu Zhou, Liliane Momeni, Rishi Hazra, Shuangrui Ding, Sagar Vaze, Francois Porcher, Feng Li, Siyuan Li, Aishwarya Kamath, Ho Kei Cheng, Piotr DollĂĄr, Nikhila Ravi, Kate Saenko, Pengchuan Zhang, Christoph Feichtenhofer Title: SAM 3: Segment Anything with Concepts Arx...
2025-11-25
23 min
1 ting ad gangen - Naturvidenskab skÄret ud i pap
KlÞgtvÊrk - 10 rÄd til at blive en stor tÊnker
En person, der er god med sine hĂŠnder, er naturligvis en god hĂ„ndvĂŠrker, og derfor mĂ„ en person, der er god til at tĂŠnke, vĂŠre en klĂžgtvĂŠrker. SĂ„dan tĂŠnkte H.C. Ărsted, da han opfandt ordet.Men hvordan bliver man en god tĂŠnker? I dette afsnit prĂžver Tobias Bjerg Wang og Villads Jacobsen at besvare dette spĂžrgsmĂ„l. De har begge en ph.d. i henholdsvis biomedicin og fysik (eller ogsĂ„ afleverer Tobias om to uger) og har hentet inspiration fra datidens store tĂŠnkere som Niels Bohr og...
2025-11-20
56 min
Daily Paper Cast
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
đ€ Upvotes: 104 | cs.CL Authors: MiroMind Team, Song Bai, Lidong Bing, Carson Chen, Guanzheng Chen, Yuntao Chen, Zhe Chen, Ziyi Chen, Jifeng Dai, Xuan Dong, Wenhan Dou, Yue Deng, Yunjie Fu, Junqi Ge, Chenxia Han, Tammy Huang, Zhenhang Huang, Jerry Jiao, Shilei Jiang, Tianyu Jiao, Xiaoqi Jian, Lei Lei, Ruilin Li, Ryan Luo, Tiantong Li, Xiang Lin, Ziyuan Liu, Zhiqi Li, Jie Ni, Qiang Ren, Pax Sun, Shiqian Su, Chenxin Tao, Bin Wang, Hellen Wang, Haonan Wang, James Wang, Jin Wang, Jojo Wang, Letian Wang, Shizun Wang, Weizhi Wang, Zixuan Wang, Jinfan Xu, Sen Xing, Chenyu Yang, Hai...
2025-11-19
27 min
Daily Paper Cast
Virtual Width Networks
đ€ Upvotes: 23 | cs.LG, cs.AI Authors: Seed, Baisheng Li, Banggu Wu, Bole Ma, Bowen Xiao, Chaoyi Zhang, Cheng Li, Chengyi Wang, Chenyin Xu, Chi Zhang, Chong Hu, Daoguang Zan, Defa Zhu, Dongyu Xu, Du Li, Faming Wu, Fan Xia, Ge Zhang, Guang Shi, Haobin Chen, Hongyu Zhu, Hongzhi Huang, Huan Zhou, Huanzhang Dou, Jianhui Duan, Jianqiao Lu, Jianyu Jiang, Jiayi Xu, Jiecao Chen, Jin Chen, Jin Ma, Jing Su, Jingji Chen, Jun Wang, Jun Yuan, Juncai Liu, Jundong Zhou, Kai Hua, Kai Shen, Kai Xiang, Kaiyuan Chen, Kang Liu, Ke Shen, Liang Xiang, Lin Yan, Lishu Luo...
2025-11-18
22 min
Daily Paper Cast
Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation
đ€ Upvotes: 61 | cs.CL, cs.AI Authors: Ling-Team, Ang Li, Ben Liu, Binbin Hu, Bing Li, Bingwei Zeng, Borui Ye, Caizhi Tang, Changxin Tian, Chao Huang, Chao Zhang, Chen Qian, Chenchen Ju, Chenchen Li, Chengfu Tang, Chili Fu, Chunshao Ren, Chunwei Wu, Cong Zhang, Cunyin Peng, Dafeng Xu, Daixin Wang, Dalong Zhang, Dingnan Jin, Dingyuan Zhu, Dongke Hu, Fangzheng Zhao, Feifan Wu, Feng Zhu, Gangshan Wang, Haitao Zhang, Hailin Zhao, Hanxiao Zhang, Hanzi Wang, Hao Qian, Haoyi Yu, Heng Zhang, Hongliang Zhang, Hongzhi Luan, Huirong Dong, Huizhong Li, Jia Li, Jia Liu, Jialong Zhu, Jian Sha, Jianping Wei...
2025-11-05
24 min
Daily Paper Cast
UniREditBench: A Unified Reasoning-based Image Editing Benchmark
đ€ Upvotes: 22 | cs.CV Authors: Feng Han, Yibin Wang, Chenglin Li, Zheming Liang, Dianyi Wang, Yang Jiao, Zhipeng Wei, Chao Gong, Cheng Jin, Jingjing Chen, Jiaqi Wang Title: UniREditBench: A Unified Reasoning-based Image Editing Benchmark Arxiv: http://arxiv.org/abs/2511.01295v1 Abstract: Recent advances in multi-modal generative models have driven substantial improvements in image editing. However, current generative models still struggle with handling diverse and complex image editing tasks that require implicit reasoning, underscoring the need for a comprehensive benchmark to systematically assess their performance across various reasoning sce...
2025-11-05
23 min
Daily Paper Cast
Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
đ€ Upvotes: 27 | cs.CL, cs.AI Authors: Ling Team, Anqi Shen, Baihui Li, Bin Hu, Bin Jing, Cai Chen, Chao Huang, Chao Zhang, Chaokun Yang, Cheng Lin, Chengyao Wen, Congqi Li, Deng Zhao, Dingbo Yuan, Donghai You, Fagui Mao, Fanzhuang Meng, Feng Xu, Guojie Li, Guowei Wang, Hao Dai, Haonan Zheng, Hong Liu, Jia Guo, Jiaming Liu, Jian Liu, Jianhao Fu, Jiannan Shi, Jianwen Wang, Jianxin Lai, Jin Yang, Jun Mei, Jun Zhou, Junbo Zhao, Junping Zhao, Kuan Xu, Le Su, Lei Chen, Li Tang, Liang Jiang, Liangcheng Fu, Lianhao Xu, Linfeng Shi, Lisha Liao, Longfei Zheng, Men...
2025-10-23
22 min
Daily Paper Cast
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM
đ€ Upvotes: 56 | cs.CV, cs.AI, cs.CL Authors: Hanrong Ye, Chao-Han Huck Yang, Arushi Goel, Wei Huang, Ligeng Zhu, Yuanhang Su, Sean Lin, An-Chieh Cheng, Zhen Wan, Jinchuan Tian, Yuming Lou, Dong Yang, Zhijian Liu, Yukang Chen, Ambrish Dantrey, Ehsan Jahangiri, Sreyan Ghosh, Daguang Xu, Ehsan Hosseini-Asl, Danial Mohseni Taheri, Vidya Murali, Sifei Liu, Jason Lu, Oluwatobi Olabiyi, Frank Wang, Rafael Valle, Bryan Catanzaro, Andrew Tao, Song Han, Jan Kautz, Hongxu Yin, Pavlo Molchanov Title: OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Arxiv: http://arxiv.org/abs/2510.15870v1 ...
2025-10-21
25 min
Daily Paper Cast
Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model
đ€ Upvotes: 133 | cs.RO Authors: Fuhao Li, Wenxuan Song, Han Zhao, Jingbo Wang, Pengxiang Ding, Donglin Wang, Long Zeng, Haoang Li Title: Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model Arxiv: http://arxiv.org/abs/2510.12276v1 Abstract: Vision-language-action (VLA) models have recently shown strong potential in enabling robots to follow language instructions and execute precise actions. However, most VLAs are built upon vision-language models pretrained solely on 2D data, which lack accurate spatial awareness and hinder their ability to operate in the 3D physical world. Existing solutions att...
2025-10-16
22 min
1 ting ad gangen - Naturvidenskab skÄret ud i pap
Werner Heisenberg: Helt eller skurk?
Dette er et portrĂŠt af den verdenskendte fysiker Werner Heisenberg.Han var en af hovedarkitekterne bag den moderne kvantefysik og ledte under Anden Verdenskrig nazisternes atomvĂ„benprogram. Heisenberg var i mange Ă„r en nĂŠr ven af Niels Bohr, men efter et skĂŠbnesvangert mĂžde i KĂžbenhavn i 1941 ĂŠndrede alt sig mellem dem.Han er kendt for usikkerhedsprincippet, isospin og matrixformalismen â bidrag, der har formet fysikken, som vi kender den i dag. Men samtidig stĂ„r han tilbage som en tvetydig figur i historien: var han en helt, en skurk â eller noget midt imellem?
2025-10-16
52 min
Daily Paper Cast
BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities
đ€ Upvotes: 30 | cs.CV, cs.RO Authors: Yu Qi, Haibo Zhao, Ziyu Guo, Siyuan Ma, Ziyan Chen, Yaokun Han, Renrui Zhang, Zitiantao Lin, Shiji Xin, Yijian Huang, Kai Cheng, Peiheng Wang, Jiazheng Liu, Jiayi Zhang, Yizhe Zhu, Wenqing Wang, Yiran Qin, Xupeng Zhu, Haojie Huang, Lawson L. S. Wong Title: BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities Arxiv: http://arxiv.org/abs/2510.08759v1 Abstract: Embodied capabilities refer to a suite of fundamental abilities for an agent to perceive, comprehend, and interact with the physical wor...
2025-10-14
26 min
Daily Paper Cast
StreamingVLM: Real-Time Understanding for Infinite Video Streams
đ€ Upvotes: 26 | cs.CV, cs.AI, cs.CL Authors: Ruyi Xu, Guangxuan Xiao, Yukang Chen, Liuning He, Kelly Peng, Yao Lu, Song Han Title: StreamingVLM: Real-Time Understanding for Infinite Video Streams Arxiv: http://arxiv.org/abs/2510.09608v1 Abstract: Vision-language models (VLMs) could power real-time assistants and autonomous agents, but they face a critical challenge: understanding near-infinite video streams without escalating latency and memory usage. Processing entire videos with full attention leads to quadratic computational costs and poor performance on long videos. Meanwhile, simple sliding window methods are also fla...
2025-10-14
21 min
Daily Paper Cast
SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
đ€ Upvotes: 32 | cs.CL, eess.AS Authors: Cheng-Han Chiang, Xiaofei Wang, Linjie Li, Chung-Ching Lin, Kevin Lin, Shujie Liu, Zhendong Wang, Zhengyuan Yang, Hung-yi Lee, Lijuan Wang Title: SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models Arxiv: http://arxiv.org/abs/2510.06917v1 Abstract: Current large language models (LLMs) and spoken language models (SLMs) begin thinking and taking actions only after the user has finished their turn. This prevents the model from interacting during the user's turn and can lead to high response latency while it waits to thi...
2025-10-10
24 min
Daily Paper Cast
Vibe Checker: Aligning Code Evaluation with Human Preference
đ€ Upvotes: 28 | cs.CL, cs.AI, cs.LG, cs.SE Authors: Ming Zhong, Xiang Zhou, Ting-Yun Chang, Qingze Wang, Nan Xu, Xiance Si, Dan Garrette, Shyam Upadhyay, Jeremiah Liu, Jiawei Han, Benoit Schillings, Jiao Sun Title: Vibe Checker: Aligning Code Evaluation with Human Preference Arxiv: http://arxiv.org/abs/2510.07315v1 Abstract: Large Language Models (LLMs) have catalyzed vibe coding, where users leverage LLMs to generate and iteratively refine code through natural language interactions until it passes their vibe check. Vibe check is tied to real-world human preference and goe...
2025-10-10
23 min
Daily Paper Cast
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
đ€ Upvotes: 35 | cs.CV Authors: Yunlong Tang, Jing Bi, Pinxin Liu, Zhenyu Pan, Zhangyun Tan, Qianxiang Shen, Jiani Liu, Hang Hua, Junjia Guo, Yunzhong Xiao, Chao Huang, Zhiyuan Wang, Susan Liang, Xinyi Liu, Yizhi Song, Yuhe Nie, Jia-Xing Zhong, Bozheng Li, Daiqing Qi, Ziyun Zeng, Ali Vosoughi, Luchuan Song, Zeliang Zhang, Daiki Shimada, Han Liu, Jiebo Luo, Chenliang Xu Title: Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models Arxiv: http://arxiv.org/abs/2510.05034v1 Abstract: Video understanding represents the most challenging frontier in computer vis...
2025-10-08
26 min
Daily Paper Cast
DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder
đ€ Upvotes: 26 | cs.CV, cs.AI Authors: Junyu Chen, Wenkun He, Yuchao Gu, Yuyang Zhao, Jincheng Yu, Junsong Chen, Dongyun Zou, Yujun Lin, Zhekai Zhang, Muyang Li, Haocheng Xi, Ligeng Zhu, Enze Xie, Song Han, Han Cai Title: DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder Arxiv: http://arxiv.org/abs/2509.25182v1 Abstract: We introduce DC-VideoGen, a post-training acceleration framework for efficient video generation. DC-VideoGen can be applied to any pre-trained video diffusion model, improving efficiency by adapting it to a deep compression latent space with lightweight fin...
2025-10-02
25 min
Daily Paper Cast
SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer
đ€ Upvotes: 36 | cs.CV, cs.AI Authors: Junsong Chen, Yuyang Zhao, Jincheng Yu, Ruihang Chu, Junyu Chen, Shuai Yang, Xianbang Wang, Yicheng Pan, Daquan Zhou, Huan Ling, Haozhe Liu, Hongwei Yi, Hao Zhang, Muyang Li, Yukang Chen, Han Cai, Sanja Fidler, Ping Luo, Song Han, Enze Xie Title: SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer Arxiv: http://arxiv.org/abs/2509.24695v1 Abstract: We introduce SANA-Video, a small diffusion model that can efficiently generate videos up to 720x1280 resolution and minute-length duration. SANA-Video synthesizes high-resolution, high-quality and lon...
2025-10-01
26 min
Daily Paper Cast
LongLive: Real-time Interactive Long Video Generation
đ€ Upvotes: 136 | cs.CV Authors: Shuai Yang, Wei Huang, Ruihang Chu, Yicheng Xiao, Yuyang Zhao, Xianbang Wang, Muyang Li, Enze Xie, Yingcong Chen, Yao Lu, Song Han, Yukang Chen Title: LongLive: Real-time Interactive Long Video Generation Arxiv: http://arxiv.org/abs/2509.22622v1 Abstract: We present LongLive, a frame-level autoregressive (AR) framework for real-time and interactive long video generation. Long video generation presents challenges in both efficiency and quality. Diffusion and Diffusion-Forcing models can produce high-quality videos but suffer from low efficiency due to bidirectional attention. Causal attention AR mod...
2025-09-30
24 min
Daily Paper Cast
SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines
đ€ Upvotes: 76 | cs.CL Authors: Yizhou Wang, Chen Tang, Han Deng, Jiabei Xiao, Jiaqi Liu, Jianyu Wu, Jun Yao, Pengze Li, Encheng Su, Lintao Wang, Guohang Zhuang, Yuchen Ren, Ben Fei, Ming Hu, Xin Chen, Dongzhan Zhou, Junjun He, Xiangyu Yue, Zhenfei Yin, Jiamin Wu, Qihao Zheng, Yuhao Zhou, Huihui Xu, Chenglong Ma, Yan Lu, Wenlong Zhang, Chunfeng Song, Philip Torr, Shixiang Tang, Xinzhu Ma, Wanli Ouyang, Lei Bai Title: SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines Arxiv: http://arxiv.org/abs/2509.21320v1 Abstract: We present a sci...
2025-09-27
23 min
Daily Paper Cast
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe
đ€ Upvotes: 32 | cs.LG, cs.CV Authors: Tianyu Yu, Zefan Wang, Chongyi Wang, Fuwei Huang, Wenshuo Ma, Zhihui He, Tianchi Cai, Weize Chen, Yuxiang Huang, Yuanqian Zhao, Bokai Xu, Junbo Cui, Yingjing Xu, Liqing Ruan, Luoyuan Zhang, Hanyu Liu, Jingkun Tang, Hongyuan Liu, Qining Guo, Wenhao Hu, Bingxiang He, Jie Zhou, Jie Cai, Ji Qi, Zonghao Guo, Chi Chen, Guoyang Zeng, Yuxuan Li, Ganqu Cui, Ning Ding, Xu Han, Yuan Yao, Zhiyuan Liu, Maosong Sun Title: MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe Arxiv: http://arxiv.org/abs/2509.18154v1
2025-09-25
25 min
Inside Vertical Short Dramas
S2E10| Forum 25Q3 â âMakeup Sex" & short dramas: How Kunlin Directs Emotional Punch in 1 Minuteâ
đ New release: Short Drama Writing 101: From Industry Logic to Social Psychology â a 36-page practical guide on short dramas. Covers audience psychology, social trends, scriptwriting methods, paywalls, and ad-driven business models.Available in English, French, Spanish, Portuguese, and Turkish.đ Get your Eng copy here: đ https://payhip.com/b/K82ozđ Available in multiple languages: payhip.com/ShortDramaAllianceThis episode comes from the Short Drama Forum 2025 Q3 Edition, a global roundtable connecting short drama creators from across cultures.đ€ Our guest is Kunlin Wang â a bilingual director who transitioned from a...
2025-09-19
48 min
Daily Paper Cast
SAIL-VL2 Technical Report
đ€ Upvotes: 29 | cs.CV Authors: Weijie Yin, Yongjie Ye, Fangxun Shu, Yue Liao, Zijian Kang, Hongyuan Dong, Haiyang Yu, Dingkang Yang, Jiacong Wang, Han Wang, Wenzhuo Liu, Xiao Liang, Shuicheng Yan, Chao Feng Title: SAIL-VL2 Technical Report Arxiv: http://arxiv.org/abs/2509.14033v1 Abstract: We introduce SAIL-VL2, an open-suite vision-language foundation model (LVM) for comprehensive multimodal understanding and reasoning. As the successor to SAIL-VL, SAIL-VL2 achieves state-of-the-art performance at the 2B and 8B parameter scales across diverse image and video benchmarks, demonstrating strong capabilities from fine-grained perception to com...
2025-09-19
24 min
Daily Paper Cast
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
đ€ Upvotes: 114 | cs.RO Authors: Yihao Wang, Pengxiang Ding, Lingxiao Li, Can Cui, Zirui Ge, Xinyang Tong, Wenxuan Song, Han Zhao, Wei Zhao, Pengxu Hou, Siteng Huang, Yifan Tang, Wenhui Wang, Ru Zhang, Jianyi Liu, Donglin Wang Title: VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model Arxiv: http://arxiv.org/abs/2509.09372v1 Abstract: Vision-Language-Action (VLA) models typically bridge the gap between perceptual and action spaces by pre-training a large-scale Vision-Language Model (VLM) on robotic data. While this approach greatly enhances performance, it also incurs significant training costs. In thi...
2025-09-13
21 min
Daily Paper Cast
Can Understanding and Generation Truly Benefit Together -- or Just Coexist?
đ€ Upvotes: 25 | cs.CV Authors: Zhiyuan Yan, Kaiqing Lin, Zongjian Li, Junyan Ye, Hui Han, Zhendong Wang, Hao Liu, Bin Lin, Hao Li, Xue Xu, Xinyan Xiao, Jingdong Wang, Haifeng Wang, Li Yuan Title: Can Understanding and Generation Truly Benefit Together -- or Just Coexist? Arxiv: http://arxiv.org/abs/2509.09666v1 Abstract: In this paper, we introduce an insightful paradigm through the Auto-Encoder lens-understanding as the encoder (I2T) that compresses images into text, and generation as the decoder (T2I) that reconstructs images from that text. Using rec...
2025-09-13
24 min
Daily Paper Cast
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
đ€ Upvotes: 71 | cs.AI, cs.CL, cs.CV, cs.HC Authors: Haoming Wang, Haoyang Zou, Huatong Song, Jiazhan Feng, Junjie Fang, Junting Lu, Longxiang Liu, Qinyu Luo, Shihao Liang, Shijue Huang, Wanjun Zhong, Yining Ye, Yujia Qin, Yuwen Xiong, Yuxin Song, Zhiyong Wu, Bo Li, Chen Dun, Chong Liu, Fuxing Leng, Hanbin Wang, Hao Yu, Haobin Chen, Hongyi Guo, Jing Su, Jingjia Huang, Kai Shen, Kaiyu Shi, Lin Yan, Peiyao Zhao, Pengfei Liu, Qinghao Ye, Renjie Zheng, Wayne Xin Zhao, Wen Heng, Wenhao Huang, Wenqian Wang, Xiaobo Qin, Yi Lin, Youbin Wu, Zehui Chen, Zihao Wang, Baoquan Zho...
2025-09-04
24 min
Daily Paper Cast
Kwai Keye-VL 1.5 Technical Report
đ€ Upvotes: 26 | cs.CV Authors: Biao Yang, Bin Wen, Boyang Ding, Changyi Liu, Chenglong Chu, Chengru Song, Chongling Rao, Chuan Yi, Da Li, Dunju Zang, Fan Yang, Guorui Zhou, Guowang Zhang, Han Shen, Hao Peng, Haojie Ding, Hao Wang, Hengrui Ju, Jiaming Huang, Jiangxia Cao, Jiankang Chen, Jingyun Hua, Kaibing Chen, Kaiyu Jiang, Kaiyu Tang, Kun Gai, Muhao Wei, Qiang Wang, Ruitao Wang, Sen Na, Shengnan Zhang, Siyang Mao, Sui Huang, Tianke Zhang, Tingting Gao, Wei Chen, Wei Yuan, Xiangyu Wu, Xiao Hu, Xingyu Lu, Yi-Fan Zhang, Yiping Yang, Yulong Chen, Zeyi Lu, Zhenhua Wu, Zhixin Ling, Zhu...
2025-09-04
18 min
Inside Vertical Short Dramas
S2E6 | Forum 25Q3 â From Boysâ Love to Pride & Prejudice: UKâs On Set Octopus Leads Europeâs Vertical Wave
In this episode, we feature a conversation from the Short Drama Forum 2025 Q3 Edition with Emma Wang, Executive Producer and Head of Content, and Ben Pengilly, CEO and Founder of On Set Octopus.Emma Wang is the Executive Producer and Head of Content at Onset Octopus, where she leads the project-based writer team and oversees the full process from development to release. As an award-winning screenwriter, she has created multiple hit vertical dramas and brings strong bilingual expertise in both storytelling and execution. Ben Pengilly is an award-winning bilingual producer with 15 years of...
2025-08-30
37 min
Daily Paper Cast
CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics
đ€ Upvotes: 43 | cs.LG, cs.AI Authors: Weida Wang, Dongchen Huang, Jiatong Li, Tengchao Yang, Ziyang Zheng, Di Zhang, Dong Han, Benteng Chen, Binzhao Luo, Zhiyu Liu, Kunling Liu, Zhiyuan Gao, Shiqi Geng, Wei Ma, Jiaming Su, Xin Li, Shuchen Pu, Yuhan Shui, Qianjia Cheng, Zhihao Dou, Dongfei Cui, Changyong He, Jin Zeng, Zeke Xie, Mao Su, Dongzhan Zhou, Yuqiang Li, Wanli Ouyang, Yunqi Cai, Xi Dai, Shufei Zhang, Lei Bai, Jinguang Cheng, Zhong Fang, Hongming Weng Title: CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics Arxiv: htt...
2025-08-28
20 min
Daily Paper Cast
OmniHuman-1.5: Instilling an Active Mind in Avatars via Cognitive Simulation
đ€ Upvotes: 26 | cs.CV Authors: Jianwen Jiang, Weihong Zeng, Zerong Zheng, Jiaqi Yang, Chao Liang, Wang Liao, Han Liang, Yuan Zhang, Mingyuan Gao Title: OmniHuman-1.5: Instilling an Active Mind in Avatars via Cognitive Simulation Arxiv: http://arxiv.org/abs/2508.19209v1 Abstract: Existing video avatar models can produce fluid human animations, yet they struggle to move beyond mere physical likeness to capture a character's authentic essence. Their motions typically synchronize with low-level cues like audio rhythm, lacking a deeper semantic understanding of emotion, intent, or context. To bridge this gap...
2025-08-28
22 min
Daily Paper Cast
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
đ€ Upvotes: 120 | cs.CV Authors: Weiyun Wang, Zhangwei Gao, Lixin Gu, Hengjun Pu, Long Cui, Xingguang Wei, Zhaoyang Liu, Linglin Jing, Shenglong Ye, Jie Shao, Zhaokai Wang, Zhe Chen, Hongjie Zhang, Ganlin Yang, Haomin Wang, Qi Wei, Jinhui Yin, Wenhao Li, Erfei Cui, Guanzhou Chen, Zichen Ding, Changyao Tian, Zhenyu Wu, Jingjing Xie, Zehao Li, Bowen Yang, Yuchen Duan, Xuehui Wang, Songze Li, Xiangyu Zhao, Haodong Duan, Nianchen Deng, Bin Fu, Yinan He, Yi Wang, Conghui He, Botian Shi, Junjun He, Yingtong Xiong, Han Lv, Lijun Wu, Wenqi Shao, Kaipeng Zhang, Huipeng Deng, Biqing Qi, Jiaye Ge, Qip...
2025-08-27
23 min
Daily Paper Cast
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries
đ€ Upvotes: 26 | cs.CL, cs.AI Authors: Ming Yin, Dinghan Shen, Silei Xu, Jianbing Han, Sixun Dong, Mian Zhang, Yebowen Hu, Shujian Liu, Simin Ma, Song Wang, Sathish Reddy Indurthi, Xun Wang, Yiran Chen, Kaiqiang Song Title: LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries Arxiv: http://arxiv.org/abs/2508.15760v1 Abstract: Tool calling has emerged as a critical capability for AI agents to interact with the real world and solve complex tasks. While the Model Context Protocol (MCP) provides a powerful standardized framework for tool int...
2025-08-23
23 min
Daily Paper Cast
Ovis2.5 Technical Report
đ€ Upvotes: 79 | cs.CV, cs.AI, cs.CL, cs.LG Authors: Shiyin Lu, Yang Li, Yu Xia, Yuwei Hu, Shanshan Zhao, Yanqing Ma, Zhichao Wei, Yinglun Li, Lunhao Duan, Jianshan Zhao, Yuxuan Han, Haijun Li, Wanying Chen, Junke Tang, Chengkun Hou, Zhixing Du, Tianli Zhou, Wenjie Zhang, Huping Ding, Jiahe Li, Wen Li, Gui Hu, Yiliang Gu, Siran Yang, Jiamang Wang, Hailong Sun, Yibo Wang, Hui Sun, Jinlong Huang, Yuping He, Shengze Shi, Weihong Zhang, Guodong Zheng, Junpeng Jiang, Sensen Gao, Yi-Feng Wu, Sijia Chen, Yuhui Chen, Qing-Guo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang Tit...
2025-08-20
23 min
Daily Paper Cast
Has GPT-5 Achieved Spatial Intelligence? An Empirical Study
đ€ Upvotes: 22 | cs.CV, cs.CL, cs.LG, cs.MM, cs.RO Authors: Zhongang Cai, Yubo Wang, Qingping Sun, Ruisi Wang, Chenyang Gu, Wanqi Yin, Zhiqian Lin, Zhitao Yang, Chen Wei, Xuanke Shi, Kewang Deng, Xiaoyang Han, Zukai Chen, Jiaqi Li, Xiangyu Fan, Hanming Deng, Lewei Lu, Bo Li, Ziwei Liu, Quan Wang, Dahua Lin, Lei Yang Title: Has GPT-5 Achieved Spatial Intelligence? An Empirical Study Arxiv: http://arxiv.org/abs/2508.13142v1 Abstract: Multi-modal models have achieved remarkable progress in recent years. Nevertheless, they continue to exhibit notable lim...
2025-08-20
19 min
Daily Paper Cast
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale
đ€ Upvotes: 101 | cs.CV Authors: NextStep Team, Chunrui Han, Guopeng Li, Jingwei Wu, Quan Sun, Yan Cai, Yuang Peng, Zheng Ge, Deyu Zhou, Haomiao Tang, Hongyu Zhou, Kenkun Liu, Ailin Huang, Bin Wang, Changxin Miao, Deshan Sun, En Yu, Fukun Yin, Gang Yu, Hao Nie, Haoran Lv, Hanpeng Hu, Jia Wang, Jian Zhou, Jianjian Sun, Kaijun Tan, Kang An, Kangheng Lin, Liang Zhao, Mei Chen, Peng Xing, Rui Wang, Shiyu Liu, Shutao Xia, Tianhao You, Wei Ji, Xianfang Zeng, Xin Han, Xuelin Zhang, Yana Wei, Yanming Xu, Yimin Jiang, Yingming Wang, Yu Zhou, Yucheng Han, Ziyang Meng, Bin...
2025-08-16
23 min
Daily Paper Cast
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
đ€ Upvotes: 42 | cs.AI, cs.CL, cs.MA Authors: Jinyuan Fang, Yanwen Peng, Xi Zhang, Yingxu Wang, Xinhao Yi, Guibin Zhang, Yi Xu, Bin Wu, Siwei Liu, Zihao Li, Zhaochun Ren, Nikos Aletras, Xi Wang, Han Zhou, Zaiqiao Meng Title: A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems Arxiv: http://arxiv.org/abs/2508.07407v1 Abstract: Recent advances in large language models have sparked growing interest in AI agents capable of solving complex, real-world tasks. However, most existing agent systems rel...
2025-08-13
20 min
Daily Paper Cast
MolmoAct: Action Reasoning Models that can Reason in Space
đ€ Upvotes: 22 | cs.RO Authors: Jason Lee, Jiafei Duan, Haoquan Fang, Yuquan Deng, Shuo Liu, Boyang Li, Bohan Fang, Jieyu Zhang, Yi Ru Wang, Sangho Lee, Winson Han, Wilbert Pumacay, Angelica Wu, Rose Hendrix, Karen Farley, Eli VanderBilt, Ali Farhadi, Dieter Fox, Ranjay Krishna Title: MolmoAct: Action Reasoning Models that can Reason in Space Arxiv: http://arxiv.org/abs/2508.07917v1 Abstract: Reasoning is central to purposeful action, yet most robotic foundation models map perception and instructions directly to control, which limits adaptability, generalization, and semantic grounding. We introduce Act...
2025-08-13
20 min
Daily Paper Cast
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents
đ€ Upvotes: 62 | cs.CV Authors: Yilei Jiang, Yaozhi Zheng, Yuxuan Wan, Jiaming Han, Qunzhong Wang, Michael R. Lyu, Xiangyu Yue Title: ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents Arxiv: http://arxiv.org/abs/2507.22827v1 Abstract: Automating the transformation of user interface (UI) designs into front-end code holds significant promise for accelerating software development and democratizing design workflows. While recent large language models (LLMs) have demonstrated progress in text-to-code generation, many existing approaches rely solely on natural language prompts, limiting their effectiveness in capturing spatial lay...
2025-08-01
20 min
Daily Paper Cast
X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again
đ€ Upvotes: 24 | cs.CV Authors: Zigang Geng, Yibing Wang, Yeyao Ma, Chen Li, Yongming Rao, Shuyang Gu, Zhao Zhong, Qinglin Lu, Han Hu, Xiaosong Zhang, Linus, Di Wang, Jie Jiang Title: X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again Arxiv: http://arxiv.org/abs/2507.22058v1 Abstract: Numerous efforts have been made to extend the ``next token prediction'' paradigm to visual contents, aiming to create a unified approach for both image generation and understanding. Nevertheless, attempts to generate images through autoregressive modeling with discrete tokens have bee...
2025-07-31
17 min
Daily Paper Cast
ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts
đ€ Upvotes: 50 | cs.CV Authors: Yuying Ge, Yixiao Ge, Chen Li, Teng Wang, Junfu Pu, Yizhuo Li, Lu Qiu, Jin Ma, Lisheng Duan, Xinyu Zuo, Jinwen Luo, Weibo Gu, Zexuan Li, Xiaojing Zhang, Yangyu Tao, Han Hu, Di Wang, Ying Shan Title: ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts Arxiv: http://arxiv.org/abs/2507.20939v1 Abstract: Real-world user-generated short videos, especially those distributed on platforms such as WeChat Channel and TikTok, dominate the mobile internet. However, current large multimodal models lack essential temporally-structured, detailed, and in-depth video com...
2025-07-30
22 min
Daily Paper Cast
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
đ€ Upvotes: 40 | cs.AI Authors: Huan-ang Gao, Jiayi Geng, Wenyue Hua, Mengkang Hu, Xinzhe Juan, Hongzhang Liu, Shilong Liu, Jiahao Qiu, Xuan Qi, Yiran Wu, Hongru Wang, Han Xiao, Yuhang Zhou, Shaokun Zhang, Jiayi Zhang, Jinyu Xiang, Yixiong Fang, Qiwen Zhao, Dongrui Liu, Qihan Ren, Cheng Qian, Zhenghailong Wang, Minda Hu, Huazheng Wang, Qingyun Wu, Heng Ji, Mengdi Wang Title: A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence Arxiv: http://arxiv.org/abs/2507.21046v1 Abstract: Large Language Models (LLMs) have demonstrated strong capabilities but remain fun...
2025-07-30
22 min
Daily Paper Cast
Step-Audio 2 Technical Report
đ€ Upvotes: 42 | cs.CL, cs.SD, eess.AS Authors: Boyong Wu, Chao Yan, Chen Hu, Cheng Yi, Chengli Feng, Fei Tian, Feiyu Shen, Gang Yu, Haoyang Zhang, Jingbei Li, Mingrui Chen, Peng Liu, Wang You, Xiangyu Tony Zhang, Xingyuan Li, Xuerui Yang, Yayue Deng, Yechang Huang, Yuxin Li, Yuxin Zhang, Zhao You, Brian Li, Changyi Wan, Hanpeng Hu, Jiangjie Zhen, Siyu Chen, Song Yuan, Xuelin Zhang, Yimin Jiang, Yu Zhou, Yuxiang Yang, Bingxin Li, Buyun Ma, Changhe Song, Dongqing Pang, Guoqiang Hu, Haiyang Sun, Kang An, Na Wang, Shuli Gao, Wei Ji, Wen Li, Wen Sun, Xuan Wen...
2025-07-24
22 min
Daily Paper Cast
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization
đ€ Upvotes: 93 | cs.CL Authors: Xingxuan Li, Yao Xiao, Dianwen Ng, Hai Ye, Yue Deng, Xiang Lin, Bin Wang, Zhanfeng Mo, Chong Zhang, Yueyi Zhang, Zonglin Yang, Ruilin Li, Lei Lei, Shihao Xu, Han Zhao, Weiling Chen, Feng Ji, Lidong Bing Title: MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization Arxiv: http://arxiv.org/abs/2507.14683v1 Abstract: Large language models have recently evolved from fluent text generation to advanced reasoning across diverse domains, giving rise to reasoning language models. Among these domains, mathematical reasoning ser...
2025-07-23
23 min
Daily Paper Cast
Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning
đ€ Upvotes: 47 | cs.CV, cs.CL Authors: Yana Wei, Liang Zhao, Jianjian Sun, Kangheng Lin, Jisheng Yin, Jingcheng Hu, Yinmin Zhang, En Yu, Haoran Lv, Zejia Weng, Jia Wang, Chunrui Han, Yuang Peng, Qi Han, Zheng Ge, Xiangyu Zhang, Daxin Jiang, Vishal M. Patel Title: Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning Arxiv: http://arxiv.org/abs/2507.05255v1 Abstract: The remarkable reasoning capability of large language models (LLMs) stems from cognitive behaviors that emerge through reinforcement with verifiable rewards. This work investigates how to transfer thi...
2025-07-15
21 min
Daily Paper Cast
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities
đ€ Upvotes: 24 | cs.CL, cs.AI Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu, Toby Boyd, Brad Hekman, Aaron Parisi, Chaoyi Zhang, Kornraphop Kawintiranon, Tania Bedrax-Weiss, Oliver Wang, Ya Xu, Ollie Purkiss, Uri Mendlovic, IlaĂŻ Deutel, Nam Nguyen, Adam Langley, Flip Korn, Lucia Rossazza, Alexandre RamĂ©, Sagar Waghmare, Helen Miller, Vaishakh Keshava, Ying Jian...
2025-07-15
20 min
Daily Paper Cast
MiniCPM4: Ultra-Efficient LLMs on End Devices
đ€ Upvotes: 60 | cs.CL, cs.AI Authors: MiniCPM Team, Chaojun Xiao, Yuxuan Li, Xu Han, Yuzhuo Bai, Jie Cai, Haotian Chen, Wentong Chen, Xin Cong, Ganqu Cui, Ning Ding, Shengdan Fan, Yewei Fang, Zixuan Fu, Wenyu Guan, Yitong Guan, Junshao Guo, Yufeng Han, Bingxiang He, Yuxiang Huang, Cunliang Kong, Qiuzuo Li, Siyuan Li, Wenhao Li, Yanghao Li, Yishan Li, Zhen Li, Dan Liu, Biyuan Lin, Yankai Lin, Xiang Long, Quanyu Lu, Yaxi Lu, Peiyan Luo, Hongya Lyu, Litu Ou, Yinxu Pan, Zekai Qu, Qundong Shi, Zijun Song, Jiayuan Su, Zhou Su, Ao Sun, Xianghui Sun, Peijun Tang, Fan...
2025-06-11
20 min
focal podcast
Why Betting on an Idea Space Beats a Single Idea | Why 5,000 Signups Mean Nothing Without Retention | How to Know When to Pivot vs Persist | How to Survive 14 Months of Failure | Why Instincts Beat Market Research | Han Wang, Co-founder & CEO of Mintlify
Mastering the pivot playbook after 14 months of chewing on glass.Han Wang, Co-Founder and CEO of Mintlify, knows pivoting better than pretty much anyone. Mintlify pivoted 8 times during their first 14 months before finding Product-Market-Fit and eventually raising > $20M from a16z, Bain Capital Ventures, and YC.While those 14 months felt like âchewing on glassâ, Han learned a lot of valuable lessons he shared with me during our conversation, including why to bet on a space you care about, not a specific idea, if, when, and how to pivot, and why speed is your only advantage
2025-06-09
58 min
Daily Paper Cast
RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics
đ€ Upvotes: 32 | cs.RO, cs.AI, cs.CV Authors: Enshen Zhou, Jingkun An, Cheng Chi, Yi Han, Shanyu Rong, Chi Zhang, Pengwei Wang, Zhongyuan Wang, Tiejun Huang, Lu Sheng, Shanghang Zhang Title: RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics Arxiv: http://arxiv.org/abs/2506.04308v1 Abstract: Spatial referring is a fundamental capability of embodied robots to interact with the 3D physical world. However, even with the powerful pretrained vision language models (VLMs), recent approaches are still not qualified to accurately understand the complex 3D sce...
2025-06-07
23 min
Daily Paper Cast
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
đ€ Upvotes: 63 | cs.CL Authors: Junyu Zhang, Runpei Dong, Han Wang, Xuying Ning, Haoran Geng, Peihao Li, Xialin He, Yutong Bai, Jitendra Malik, Saurabh Gupta, Huan Zhang Title: AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time Arxiv: http://arxiv.org/abs/2505.24863v1 Abstract: This paper presents AlphaOne ($\alpha$1), a universal framework for modulating reasoning progress in large reasoning models (LRMs) at test time. $\alpha$1 first introduces $\alpha$ moment, which represents the scaled thinking phase with a universal parameter $\alpha$. Within this scaled pre-$\alpha$ moment phase, it...
2025-06-03
20 min
Daily Paper Cast
PhyX: Does Your Model Have the "Wits" for Physical Reasoning?
đ€ Upvotes: 38 | cs.AI Authors: Hui Shen, Taiqiang Wu, Qi Han, Yunta Hsieh, Jizhou Wang, Yuyue Zhang, Yuxin Cheng, Zijian Hao, Yuansheng Ni, Xin Wang, Zhongwei Wan, Kai Zhang, Wendong Xu, Jing Xiong, Ping Luo, Wenhu Chen, Chaofan Tao, Zhuoqing Mao, Ngai Wong Title: PhyX: Does Your Model Have the "Wits" for Physical Reasoning? Arxiv: http://arxiv.org/abs/2505.15929v1 Abstract: Existing benchmarks fail to capture a crucial aspect of intelligence: physical reasoning, the integrated ability to combine domain knowledge, symbolic reasoning, and understanding of real-world constraints. To add...
2025-05-27
22 min
Daily Paper Cast
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
đ€ Upvotes: 80 | cs.CL Authors: Hyungjoo Chae, Sunghwan Kim, Junhee Cho, Seungone Kim, Seungjun Moon, Gyeom Hwangbo, Dongha Lim, Minjin Kim, Yeonjun Hwang, Minju Gwak, Dongwook Choi, Minseok Kang, Gwanhoon Im, ByeongUng Cho, Hyojun Kim, Jun Hee Han, Taeyoon Kwon, Minju Kim, Beong-woo Kwak, Dongjin Kang, Jinyoung Yeo Title: Web-Shepherd: Advancing PRMs for Reinforcing Web Agents Arxiv: http://arxiv.org/abs/2505.15277v1 Abstract: Web navigation is a unique domain that can automate many repetitive real-life tasks and is challenging as it requires long-horizon sequential decision making beyond typical mul...
2025-05-23
22 min
Daily Paper Cast
Model Merging in Pre-training of Large Language Models
đ€ Upvotes: 27 | cs.CL, cs.LG Authors: Yunshui Li, Yiyuan Ma, Shen Yan, Chaoyi Zhang, Jing Liu, Jianqiao Lu, Ziwen Xu, Mengzhao Chen, Minrui Wang, Shiyi Zhan, Jin Ma, Xunhao Lai, Yao Luo, Xingyan Bin, Hongbin Ren, Mingji Han, Wenhao Hao, Bairen Yi, LingJun Liu, Bole Ma, Xiaoying Jia, Zhou Xun, Siyuan Qiao, Liang Xiang, Yonghui Wu Title: Model Merging in Pre-training of Large Language Models Arxiv: http://arxiv.org/abs/2505.12082v2 Abstract: Model merging has emerged as a promising technique for enhancing large language models, though its app...
2025-05-21
23 min
Daily Paper Cast
Visual Planning: Let's Think Only with Images
đ€ Upvotes: 33 | cs.LG, cs.AI, cs.CL, cs.CV Authors: Yi Xu, Chengzu Li, Han Zhou, Xingchen Wan, Caiqi Zhang, Anna Korhonen, Ivan VuliÄ Title: Visual Planning: Let's Think Only with Images Arxiv: http://arxiv.org/abs/2505.11409v1 Abstract: Recent advancements in Large Language Models (LLMs) and their multimodal extensions (MLLMs) have substantially enhanced machine reasoning across diverse tasks. However, these models predominantly rely on pure text as the medium for both expressing and structuring reasoning, even when visual information is present. In this work, we argue that...
2025-05-20
21 min
Daily Paper Cast
Seed1.5-VL Technical Report
đ€ Upvotes: 86 | cs.CV, cs.AI Authors: Dong Guo, Faming Wu, Feida Zhu, Fuxing Leng, Guang Shi, Haobin Chen, Haoqi Fan, Jian Wang, Jianyu Jiang, Jiawei Wang, Jingji Chen, Jingjia Huang, Kang Lei, Liping Yuan, Lishu Luo, Pengfei Liu, Qinghao Ye, Rui Qian, Shen Yan, Shixiong Zhao, Shuai Peng, Shuangye Li, Sihang Yuan, Sijin Wu, Tianheng Cheng, Weiwei Liu, Wenqian Wang, Xianhan Zeng, Xiao Liu, Xiaobo Qin, Xiaohan Ding, Xiaojun Xiao, Xiaoying Zhang, Xuanwei Zhang, Xuehan Xiong, Yanghua Peng, Yangrui Chen, Yanwei Li, Yanxu Hu, Yi Lin, Yiyuan Hu, Yiyuan Zhang, Youbin Wu, Yu Li, Yudong Liu, Yue...
2025-05-14
20 min
Daily Paper Cast
Step1X-Edit: A Practical Framework for General Image Editing
đ€ Upvotes: 55 | cs.CV Authors: Shiyu Liu, Yucheng Han, Peng Xing, Fukun Yin, Rui Wang, Wei Cheng, Jiaqi Liao, Yingming Wang, Honghao Fu, Chunrui Han, Guopeng Li, Yuang Peng, Quan Sun, Jingwei Wu, Yan Cai, Zheng Ge, Ranchen Ming, Lei Xia, Xianfang Zeng, Yibo Zhu, Binxing Jiao, Xiangyu Zhang, Gang Yu, Daxin Jiang Title: Step1X-Edit: A Practical Framework for General Image Editing Arxiv: http://arxiv.org/abs/2504.17761v1 Abstract: In recent years, image editing models have witnessed remarkable and rapid development. The recent unveiling of cutting-edge multimodal mod...
2025-04-26
20 min
Daily Paper Cast
Trillion 7B Technical Report
đ€ Upvotes: 27 | cs.CL, cs.AI, cs.LG Authors: Sungjun Han, Juyoung Suk, Suyeong An, Hyungguk Kim, Kyuseok Kim, Wonsuk Yang, Seungtaek Choi, Jamin Shin Title: Trillion 7B Technical Report Arxiv: http://arxiv.org/abs/2504.15431v1 Abstract: We introduce Trillion-7B, the most token-efficient Korean-centric multilingual LLM available. Our novel Cross-lingual Document Attention (XLDA) mechanism enables highly efficient and effective knowledge transfer from English to target languages like Korean and Japanese. Combined with optimized data mixtures, language-specific filtering, and tailored tokenizer construction, Trillion-7B achieves competitive performance while ded...
2025-04-25
25 min
Daily Paper Cast
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling
đ€ Upvotes: 28 | cs.CV Authors: Tsung-Han Wu, Heekyung Lee, Jiaxin Ge, Joseph E. Gonzalez, Trevor Darrell, David M. Chan Title: Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling Arxiv: http://arxiv.org/abs/2504.13169v1 Abstract: Vision-Language Models (VLMs) excel at visual understanding but often suffer from visual hallucinations, where they generate descriptions of nonexistent objects, actions, or concepts, posing significant risks in safety-critical applications. Existing hallucination mitigation methods typically follow one of two paradigms: generation adjustment, which modifies decoding behavior to align text with visual inp...
2025-04-19
20 min
Daily Paper Cast
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
đ€ Upvotes: 172 | cs.CV Authors: Jinguo Zhu, Weiyun Wang, Zhe Chen, Zhaoyang Liu, Shenglong Ye, Lixin Gu, Yuchen Duan, Hao Tian, Weijie Su, Jie Shao, Zhangwei Gao, Erfei Cui, Yue Cao, Yangzhou Liu, Xingguang Wei, Hongjie Zhang, Haomin Wang, Weiye Xu, Hao Li, Jiahao Wang, Dengnian Chen, Songze Li, Yinan He, Tan Jiang, Jiapeng Luo, Yi Wang, Conghui He, Botian Shi, Xingcheng Zhang, Wenqi Shao, Junjun He, Yingtong Xiong, Wenwen Qu, Peng Sun, Penglong Jiao, Han Lv, Lijun Wu, Kaipeng Zhang, Huipeng Deng, Jiaye Ge, Kai Chen, Limin Wang, Min Dou, Lewei Lu, Xizhou Zhu, Tong Lu, Dah...
2025-04-16
22 min
Daily Paper Cast
Kimi-VL Technical Report
đ€ Upvotes: 71 | cs.CV Authors: Kimi Team, Angang Du, Bohong Yin, Bowei Xing, Bowen Qu, Bowen Wang, Cheng Chen, Chenlin Zhang, Chenzhuang Du, Chu Wei, Congcong Wang, Dehao Zhang, Dikang Du, Dongliang Wang, Enming Yuan, Enzhe Lu, Fang Li, Flood Sung, Guangda Wei, Guokun Lai, Han Zhu, Hao Ding, Hao Hu, Hao Yang, Hao Zhang, Haoning Wu, Haotian Yao, Haoyu Lu, Heng Wang, Hongcheng Gao, Huabin Zheng, Jiaming Li, Jianlin Su, Jianzhou Wang, Jiaqi Deng, Jiezhong Qiu, Jin Xie, Jinhong Wang, Jingyuan Liu, Junjie Yan, Kun Ouyang, Liang Chen, Lin Sui, Longhui Yu, Mengfan Dong, Mengnan Dong, Nuo...
2025-04-12
23 min
Daily Paper Cast
DeepSeek-R1 Thoughtology: Let's
about LLM Reasoning
đ€ Upvotes: 33 | cs.CL Authors: Sara Vera MarjanoviÄ, Arkil Patel, Vaibhav Adlakha, Milad Aghajohari, Parishad BehnamGhader, Mehar Bhatia, Aditi Khandelwal, Austin Kraft, Benno Krojer, Xing Han LĂč, Nicholas Meade, Dongchan Shin, Amirhossein Kazemnejad, Gaurav Kamath, Marius Mosbach, Karolina StaĆczak, Siva Reddy Title: DeepSeek-R1 Thoughtology: Let's about LLM Reasoning Arxiv: http://arxiv.org/abs/2504.07128v1 Abstract: Large Reasoning Models like DeepSeek-R1 mark a fundamental shift in how LLMs approach complex problems. Instead of directly producing an answer for a given input, DeepSeek-R1 creates detailed multi-step reasoning chains, seemingly "thinki...
2025-04-12
25 min
Daily Paper Cast
One-Minute Video Generation with Test-Time Training
đ€ Upvotes: 61 | cs.CV Authors: Karan Dalal, Daniel Koceja, Gashon Hussein, Jiarui Xu, Yue Zhao, Youjin Song, Shihao Han, Ka Chun Cheung, Jan Kautz, Carlos Guestrin, Tatsunori Hashimoto, Sanmi Koyejo, Yejin Choi, Yu Sun, Xiaolong Wang Title: One-Minute Video Generation with Test-Time Training Arxiv: http://arxiv.org/abs/2504.05298v1 Abstract: Transformers today still struggle to generate one-minute videos because self-attention layers are inefficient for long context. Alternatives such as Mamba layers struggle with complex multi-scene stories because their hidden states are less expressive. We experiment with Test-Time Training (TTT...
2025-04-09
18 min
Daily Paper Cast
Token-Efficient Long Video Understanding for Multimodal LLMs
đ€ Upvotes: 41 | cs.CV Authors: Jindong Jiang, Xiuyu Li, Zhijian Liu, Muyang Li, Guo Chen, Zhiqi Li, De-An Huang, Guilin Liu, Zhiding Yu, Kurt Keutzer, Sungjin Ahn, Jan Kautz, Hongxu Yin, Yao Lu, Song Han, Wonmin Byeon Title: Token-Efficient Long Video Understanding for Multimodal LLMs Arxiv: http://arxiv.org/abs/2503.04130v1 Abstract: Recent advances in video-based multimodal large language models (Video-LLMs) have significantly improved video understanding by processing videos as sequences of image frames. However, many existing methods treat frames independently in the vision backbone, lacking explicit temporal mod...
2025-03-08
21 min
Daily Paper Cast
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking
đ€ Upvotes: 27 | cs.AI Authors: Zhuoqun Li, Haiyang Yu, Xuanang Chen, Hongyu Lin, Yaojie Lu, Fei Huang, Xianpei Han, Yongbin Li, Le Sun Title: DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking Arxiv: http://arxiv.org/abs/2502.20730v1 Abstract: Designing solutions for complex engineering challenges is crucial in human production activities. However, previous research in the retrieval-augmented generation (RAG) field has not sufficiently addressed tasks related to the design of complex engineering solutions. To fill this gap, we introduce a new benchmark, SolutionBench, to eva...
2025-03-04
22 min
Daily Paper Cast
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model
đ€ Upvotes: 38 | cs.CV, cs.CL Authors: Guoqing Ma, Haoyang Huang, Kun Yan, Liangyu Chen, Nan Duan, Shengming Yin, Changyi Wan, Ranchen Ming, Xiaoniu Song, Xing Chen, Yu Zhou, Deshan Sun, Deyu Zhou, Jian Zhou, Kaijun Tan, Kang An, Mei Chen, Wei Ji, Qiling Wu, Wen Sun, Xin Han, Yanan Wei, Zheng Ge, Aojie Li, Bin Wang, Bizhu Huang, Bo Wang, Brian Li, Changxing Miao, Chen Xu, Chenfei Wu, Chenguang Yu, Dapeng Shi, Dingyuan Hu, Enle Liu, Gang Yu, Ge Yang, Guanzhe Huang, Gulin Yan, Haiyang Feng, Hao Nie, Haonan Jia, Hanpeng Hu, Hanqi Chen, Haolong Yan, Hen...
2025-02-18
23 min
Daily Paper Cast
ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models
đ€ Upvotes: 27 | cs.CV Authors: Jonathan Roberts, Mohammad Reza Taesiri, Ansh Sharma, Akash Gupta, Samuel Roberts, Ioana Croitoru, Simion-Vlad Bogolin, Jialu Tang, Florian Langer, Vyas Raina, Vatsal Raina, Hanyi Xiong, Vishaal Udandarao, Jingyi Lu, Shiyang Chen, Sam Purkis, Tianshuo Yan, Wenye Lin, Gyungin Shin, Qiaochu Yang, Anh Totti Nguyen, Kai Han, Samuel Albanie Title: ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models Arxiv: http://arxiv.org/abs/2502.09696v1 Abstract: Large Multimodal Models (LMMs) exhibit major shortfalls when interpreting images and, by some measures, have poorer spatial cog...
2025-02-18
22 min
Daily Paper Cast
TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation
đ€ Upvotes: 35 | cs.CV Authors: Alex Jinpeng Wang, Dongxing Mao, Jiawei Zhang, Weiming Han, Zhuobai Dong, Linjie Li, Yiqi Lin, Zhengyuan Yang, Libo Qin, Fuwei Zhang, Lijuan Wang, Min Li Title: TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation Arxiv: http://arxiv.org/abs/2502.07870v1 Abstract: Text-conditioned image generation has gained significant attention in recent years and are processing increasingly longer and comprehensive text prompt. In everyday life, dense and intricate text appears in contexts like advertisements, infographics, and signage, where the integration of both text and...
2025-02-14
18 min
TOPIK & Beyond
#16: Untranslatable Words and Expressions: Understanding Korean Culture Through Language
Ever come across a Korean word that just doesnât translate neatly into English? Words like ì (jeong) or ëìč (nunchi) carry meanings so deep that a simple translation doesnât do them justice. In this episode of TOPIK & Beyond, we explore some of these âuntranslatableâ words and uncover what they reveal about Korean society.Youâll learn:The cultural significance of words like ì , í, and ëìč.How understanding these expressions can give you an edge on the TOPIK exam, especially in the reading and writing sections.Practical tips for using cultural knowledge to make your TOPIK II essays more thoughtful and nuanced.
2025-02-04
30 min
Dialogues on Applied Channel Theory
Episode 57: Insights into the Early History of Acupuncture
Send us a textIn this episode, Jonathan talks to Dr. Shelley Ochs about recent texts and artifacts excavated from the Han Dynasty tomb in Lao Guan Shan (èćźć±±æ±ćą), Sichuan Province. She discusses how these findings are related to her PhD research on Bian Que, including his use of a channel based medicine. Later in the episode, Shelley also talks about a figurine with channel pathways discovered in the tombs which gives insights into the development of channels during that period of time. Excavated texts also point to the use of palpation in the discovery of the channels. Link t...
2024-12-13
34 min
1 ting ad gangen - Naturvidenskab skÄret ud i pap
Den store bioquiz
SÄ er det Tobias' tur til at quizze Villads. Om han vil det eller ej. Har Villads lyttet efter de sidste par Är vi har lavet podcast? Eller har han glemt det hele? Find ud af det i dagens afsnit, hvor I ogsÄ kan gÊtte med derhjemme! Support the showVÊrter: Villads Lundsteen Jacobsen og Tobias Wang BjergHjemmeside: https://1tingadgangen.buzzsprout.com/Facebook: https://www.facebook.com/1tingadgangen/Instagram: https://www.instagram.com/1tingadgangen/
2024-09-26
31 min
1 ting ad gangen - Naturvidenskab skÄret ud i pap
Den store fysikquiz
Denne gang tester jeg Tobias' fysik viden for at se, om han virkelig har lÊrt noget gennem de sidste 4 Är som medvÊrt pÄ 1 ting ad gangen. Kan du klare quizzen? Lyt med og gÊt sammen med TobiasSupport the showVÊrter: Villads Lundsteen Jacobsen og Tobias Wang BjergHjemmeside: https://1tingadgangen.buzzsprout.com/Facebook: https://www.facebook.com/1tingadgangen/Instagram: https://www.instagram.com/1tingadgangen/
2024-09-12
34 min
Signal and Trace
The Future of Developer Documentation - Han Wang from Mintlify
In this episode, we sit down with Han from Mintlify (mintlify.com) to explore how theyâre revolutionizing developer documentation with a focus on interactivity and accessibility. Han shares insights into the challenges of creating tools that enhance developer productivity while ensuring ease of use. We also dive into the future of documentation and how Mintlify is paving the way for more intuitive developer experiences! To learn more, take a look at the following! - https://mintlify.com - https://x.com/mintlify - https://x.com/handotdev
2024-09-06
40 min
1 ting ad gangen - Naturvidenskab skÄret ud i pap
Den FamĂžse Richard Feynman
I denne episode dykker vi ned i livet og karrieren hos en af verdens mest fascinerende og karismatiske fysikere, Richard Feynman. Kendt for sin rolle i udviklingen af atombomben under Manhattan-projektet og som central figur i undersĂžgelsen af Challenger-katastrofen, har Feynman efterladt et uudsletteligt prĂŠg pĂ„ videnskaben. Hans bedrifter og anekdoter kunne fylde flere bĂžger â og det har de ogsĂ„ gjort. Men i dag ser vi nĂŠrmere pĂ„ personen bag geniet og udforsker det liv, som gjorde Richard Feynman til den ekstraordinĂŠre mand, han var.Support the showVĂŠrter: Villads L...
2024-09-05
1h 01
Scaling DevTools
Great documentation with Han Wang from Mintlify
Han Wang is co-founder of Mintlify - modern, out the box documentation.In this episode, Han shares the story of Mintlify and how to make great docs.We even talk about the time Paul Graham told them to change their name.What we cover:- the origin story of Mintlify- what is good documentation- the process of documentation- how AI is affecting documentation- why PG told them to change their nameLinks:- Han https://han.dev/- Mintlify https...
2024-08-01
39 min
1 ting ad gangen - Naturvidenskab skÄret ud i pap
Kan dit hoved forlÊnge rÊkkevidden pÄ din bilnÞgle?
PÄ Tobias' ferie opdagede han, at hvis han holdt sin bilnÞgle til hovedet, sÄ var rÊkkevidden lÊngere. Denne myte har du mÄske hÞrt fÞr, da den har cirkuleret i samfundet i mange Är. Men er det rigtigt? Det handler afsnittet om i dag. Links til mere information:Privat persons egne mÄlinger pÄ effekten"Why does light bend when it enters glass?" youtubeSupport the showVÊrter: Villads Lundsteen Jacobsen og Tobias Wang BjergHjemmeside: https://1tingadgangen.buzzsprout.com/Facebook...
2024-07-11
46 min
1 ting ad gangen - Naturvidenskab skÄret ud i pap
Einsteins Relativitetsteori: FĂžrste skridt mod tidsrejser
Dyk ned i verdenskendte fysiker Albert Einsteins teorier om relativitet og udforsk nogle de fascinerende tankeeksperimenter han brugte til at argumentere for hans teori i 1905.Forestil dig en foton, der hopper mellem to spejle pÄ et tog i bevÊgelse, set fra to perspektiver: en person pÄ toget og en observatÞr udenfor. Hvad sker der med lysets hastighed?Mere info til NÞrden: Sartori, L. (1996). Understanding relativity: a simplified approach to Einstein's theories. Univ of California Press. Support the showVÊrter: Villads Lundsteen Jacobsen og Tobias W...
2023-11-16
35 min
1 ting ad gangen - Naturvidenskab skÄret ud i pap
Oppenheimer: Geniet Bag Atombomben
Udforsk J. Robert Oppenheimers fascinerende livsrejse fra hans tidlige interesser i mineralogi til at blive en central figur i Manhattan-projektet. LĂŠr om hans videnskabelige bedrifter, de etiske dilemmaer han stod overfor, og hans komplekse arv Mere info til nĂžrdenSurley you are Joking Mr Feynman - Ralph LeightonStoryville - The Trials Of Oppenheimer - BBC History DocumentarySupport the showSupport the showVĂŠrter: Villads Lundsteen Jacobsen og Tobias Wang BjergHjemmeside: https://1tingadgangen.buzzsprout.com/Facebook: https://www.facebook.com...
2023-06-29
53 min
1 ting ad gangen - Naturvidenskab skÄret ud i pap
Aarhus, Sydney og Tokyo. En historie om mikrodosimetri
I denne episode vil vi tale om Villads' eventyrlige rejse til Australien, og hvordan hans planer Êndrede sig, da han endte med at tage til Tokyo i stedet. Vi vil ogsÄ diskutere Villads' forskning inden for mikrodosimetri og hvordan dette kan bidrage til fremtidig udvikling inden for kraftforskning .Links til nÞrden:https://www.sciencedirect.com/topics/engineering/microdosimetryhttps://iopscience.iop.org/article/10.1088/1748-0221/17/03/P03006Support the showVÊrter: Villads Lundsteen Jacobsen og Tobias Wang BjergHjemmeside: https://1tingadgangen.buzzsprout.com/Facebook: https://www.facebook.com/1ting...
2023-03-30
57 min
Chino mandarĂn para todos/RTI/Taiwan
Como un tigre al que le han crecido un par de alas
Cuentos y proverbios chinos, con Paco NĂĄjar. En este espacio disfrutaremos juntos de bonitas historias, refranes y dichos de la tradiciĂłn china que se han transmitido por siglos e incluso milenios y forman ya parte de la lengua y la cultura china. En este programa les presento la expresiĂłn "como un tigre al que le han crecido un par de alas". En chino se escribe ćŠèæ·»çżŒÂ .
2022-12-21
00 min
The Girl Dad Show: A Professional Parenting Podcast
Ep #66 | Connie Chan Wang | Creating a Positive Headspace
Young & Connie discuss the importance of mental health and ways to instill positive coping techniques in your children at a young age. They talk about creating a supportive environment for kids while allowing for the freedom to find their own way. Connie also emphasizes perseverance and our ability to learn any skill, even if that skill doesnât come naturally to us.Please enjoy & subscribe! ABOUT OUR GUEST:Connie Chan Wang is a builder, storyteller, and connector who is passionate about building purpose-driven teams and businesses, telling human-centered stories that win hearts and min...
2022-11-20
45 min
1 ting ad gangen - Naturvidenskab skÄret ud i pap
(START HER) - Armageddon filmen eller NASA's DART mission: Hvem gjorde det bedst?
I filmen Armageddon fra 1998 forsÞger Bruce Willis at redde verden. Det gÞr han ved at detonere en atombombe i centrum af en asteroide pÄ vej mod jorden. NASA har den 26. september 2022 ogsÄ forsÞgt at Êndre retningen pÄ en asteroide der dog ikke var pÄ vej mod jorden. Det gjorde de for at forberede os pÄ en potentiel fremtidig asteroide der rent faktisk kunne udslette menneskeheden. Men hvilken teknologi er bedst? Bruce Willis med en atombombe eller NASA med deres DART-mission? Mere info til nÞrden:Armageddon, film, thouchstone pictures, 1998 Brown, Gregory, et al. "P1...
2022-10-28
44 min
Veien til toppen
Episode 10: Prosjekt Karanba â Hvordan kan vi hjelpe ungdommer i Brasil?
I denne episoden av Veien til toppen snakker vi om et viktig spennende prosjekt, prosjekt Karanba. Stikkordet for prosjektet er "livets lag". Det handler om muligheter, deltakelse og mestring gjennom idretten. I 2017 lette WANG etter et felles solidaritetsprosjekt pÄ tvers av skolene som vi kunne vÊre omforent om og som hadde noe med idrett Ä gjÞre. Det var her sÞken etter et prosjekt hvor WANG kunne vÊre en betydelig viktig aktÞr og samarbeidspartner. Her dukket prosjekt Karanba opp. Den viktigste personen i Karanba er en mann som hver dag jobber for at de som er en del...
2022-09-21
22 min
Veien til toppen
Episode 9: Alexander Bonsaksen - Livet som utenlandsproff
I denne episoden har vi en gjest som har gÄtt veien og blitt veldig god i sin idrett. Han har spilt hockey i eliteserien i Norge, spilt i store klubber i utlandet, spilt landslag og representert Norge bÄde i VM og OL. Alexander Bonsaksen var det andre kullet som startet med hockey pÄ WANG Toppidrett Oslo. Der tok han et valg om at han skulle bli god. Etter dette har han jobbet hardt, opplevd- og lÊrt mye. NÄ er han tilbake i Norge og det er spennende Ä snakke om tiden som utenlandsproff og veien dit.
2022-06-08
48 min
Veien til toppen
Episode 5: Slik kan barn lykkes med idrett
I denne episoden av veien til toppen skal vi snakke med to pappaer til to av verdens beste idrettsutÞvere i sin idrett. Han ene hopper lengre enn de fleste i hoppbakken og vant verdenscupen og tre medaljer i VM forrige sesong, vi snakker da om far til Halvor Granerud, Svein Granerud. Vi har ogsÄ med oss far til ei hÄndballjente som har utallige internasjonale medaljer og et ferskt VM-gull nÄ rett fÞr jul. Vi snakker selvsagt om Nora MÞrk og har derfor med oss far Morten MÞrk. Vi skal hÞre hvor viktig disse to pappa...
2022-01-19
56 min
Veien til toppen
Episode 3: Hvordan nÄ toppen i langrenn?
I denne episoden av Veien til toppen snakker vi med Fredrik Aukland og Joar Thele. De jobber begge i WANG, Joar som trener i Oslo og Fredrik som daglig leder for WANG Ung og Wang Toppidrett i TÞnsberg. Fredrik har en lang bakgrunn bÄde som langrennsutÞver og som trener for noen av verdens beste skilÞpere. I dag jobber han ogsÄ som ekspertkommentator i langrenn for NRK. Joar er tidligere europamester og verdensmester for junior i kajakk. Da han flyttet til Oslo som 20-Äring ble han interessert i langrenn. NÄ er han en av verdens beste l...
2021-11-17
50 min
Veien til toppen
Episode 1: OL-sĂžlvvinnerens lange vei til verdenstoppen
I fÞrste episode av Veien til toppen har vi besÞk av sleggekaster, Eivind Henriksen, som fikk sitt internasjonale gjennombrudd denne sesongen. 31-Äringen fra Oslo ble kjent over natten da han i sommer tok en sensasjonell sÞlvmedalje i OL. Under finalen i Tokyo, forbedret han sin norske rekord hele tre ganger og endte til slutt med Ä kaste utrolige 81,58 meter. I denne episoden skal du hÞre om Eivinds spennende reise til verdenstoppen. I studio sitter ogsÄ Veronica Undseth og HÄvard Johansen som begge jobber i WANG.
2021-09-22
30 min
Hearts in Taiwan
Raising the next generation
As English-speaking parents living outside our heritage country, we discuss how we go about passing our culture on to our kids. From seeking out Mandarin Immersion schools and caregivers, cultivating our kids' palates to appreciate Taiwanese food, and planning to visit Taiwan regularly for family vacations (when we can again!), we share whatâs worked and what hasnât across our kids ranging from age 4 to 12. Angela also shares the many new resources that have cropped up from entrepreneurs in the past 5 years to help parents like us.Links:NTNU Mandarin summer camp
2021-09-09
49 min
1 ting ad gangen - Naturvidenskab skÄret ud i pap
Film eller Fakta - Limitless
I denne episode dykker vi ned i filmen Limitless (2011) hvor hovedpersonen tager en pille der gÞr at han kan udnytte 100% af sin hjernekapacitet. I filmen pÄstÄr de nemlig at mennesket kun bruger 20% af hjernens kapacitet normalt. Men passer det overhovedet at vi kun bruger 20 % af vores hjernekapacitet? Og kan man tage en pille der kan at forbedre ens kognitive evner? Vi snakker om myten og hvor faktuel filmen er i dette afsnit af 1 ting ad gangen.Support the showVÊrter: Villads Lundsteen Jacobsen og Tobias Wang Bjerg...
2021-08-27
35 min
1 ting ad gangen - Naturvidenskab skÄret ud i pap
H.C Ărsted og opdagelsen af elektromagnetismen
I denne episode af '1 ting ad gangen' snakker vi om H.C Ărsted og hans opdagelse af elektromagnetismen. Vi diskutere hvad han prĂŠcist fandt ud af, hvordan man har arbejdet videre pĂ„ dette og hvordan vi bruge denne viden til at drive vores moderne samfund.Support the showVĂŠrter: Villads Lundsteen Jacobsen og Tobias Wang BjergHjemmeside: https://1tingadgangen.buzzsprout.com/Facebook: https://www.facebook.com/1tingadgangen/Instagram: https://www.instagram.com/1tingadgangen/
2021-05-27
46 min
1 ting ad gangen - Naturvidenskab skÄret ud i pap
Julemandens hemmeligheder
Denne gang snakker vi om den kÊre julemand og hvordan han kan rejse jorden rundt med gaver til alle artige bÞrn pÄ kun en nat! Vi bruger fysik for at forstÄ dette mirakel.Support the showVÊrter: Villads Lundsteen Jacobsen og Tobias Wang BjergHjemmeside: https://1tingadgangen.buzzsprout.com/Facebook: https://www.facebook.com/1tingadgangen/Instagram: https://www.instagram.com/1tingadgangen/
2020-12-18
49 min