podcast
details
.com
Print
Share
Look for any podcast host, guest or anyone
Search
Showing episodes and shows of
Xintao
Shows
Daily Paper Cast
Flow-GRPO: Training Flow Matching Models via Online RL
🤗 Upvotes: 36 | cs.CV, cs.AI Authors: Jie Liu, Gongye Liu, Jiajun Liang, Yangguang Li, Jiaheng Liu, Xintao Wang, Pengfei Wan, Di Zhang, Wanli Ouyang Title: Flow-GRPO: Training Flow Matching Models via Online RL Arxiv: http://arxiv.org/abs/2505.05470v1 Abstract: We propose Flow-GRPO, the first method integrating online reinforcement learning (RL) into flow matching models. Our approach uses two key strategies: (1) an ODE-to-SDE conversion that transforms a deterministic Ordinary Differential Equation (ODE) into an equivalent Stochastic Differential Equation (SDE) that matches the original model's marginal distribution at all...
2025-05-10
23 min
Daily Paper Cast
A Survey of Interactive Generative Video
🤗 Upvotes: 31 | cs.CV Authors: Jiwen Yu, Yiran Qin, Haoxuan Che, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Hao Chen, Xihui Liu Title: A Survey of Interactive Generative Video Arxiv: http://arxiv.org/abs/2504.21853v1 Abstract: Interactive Generative Video (IGV) has emerged as a crucial technology in response to the growing demand for high-quality, interactive video content across various domains. In this paper, we define IGV as a technology that combines generative capabilities to produce diverse high-quality video content with interactive features that enable user eng...
2025-05-03
22 min
Daily Paper Cast
CoSER: Coordinating LLM-Based Persona Simulation of Established Roles
🤗 Upvotes: 16 | cs.CL, cs.AI Authors: Xintao Wang, Heng Wang, Yifei Zhang, Xinfeng Yuan, Rui Xu, Jen-tse Huang, Siyu Yuan, Haoran Guo, Jiangjie Chen, Wei Wang, Yanghua Xiao, Shuchang Zhou Title: CoSER: Coordinating LLM-Based Persona Simulation of Established Roles Arxiv: http://arxiv.org/abs/2502.09082v1 Abstract: Role-playing language agents (RPLAs) have emerged as promising applications of large language models (LLMs). However, simulating established characters presents a challenging task for RPLAs, due to the lack of authentic character datasets and nuanced evaluation methods using such data. In this pap...
2025-02-15
22 min
Daily Paper Cast
CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation
🤗 Upvotes: 29 | cs.CV Authors: Qinghe Wang, Yawen Luo, Xiaoyu Shi, Xu Jia, Huchuan Lu, Tianfan Xue, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai Title: CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation Arxiv: http://arxiv.org/abs/2502.08639v1 Abstract: In this work, we present CineMaster, a novel framework for 3D-aware and controllable text-to-video generation. Our goal is to empower users with comparable controllability as professional film directors: precise placement of objects within the scene, flexible manipulation of both objects and camera in 3D space, and...
2025-02-14
23 min
Daily Paper Cast
Improving Video Generation with Human Feedback
🤗 Upvotes: 30 | cs.CV, cs.AI, cs.GR, cs.LG Authors: Jie Liu, Gongye Liu, Jiajun Liang, Ziyang Yuan, Xiaokun Liu, Mingwu Zheng, Xiele Wu, Qiulin Wang, Wenyu Qin, Menghan Xia, Xintao Wang, Xiaohong Liu, Fei Yang, Pengfei Wan, Di Zhang, Kun Gai, Yujiu Yang, Wanli Ouyang Title: Improving Video Generation with Human Feedback Arxiv: http://arxiv.org/abs/2501.13918v1 Abstract: Video generation has achieved significant advances through rectified flow techniques, but issues like unsmooth motion and misalignment between videos and prompts persist. In this work, we develop a s...
2025-01-25
24 min
Daily Paper Cast
GameFactory: Creating New Games with Generative Interactive Videos
🤗 Upvotes: 48 | cs.CV Authors: Jiwen Yu, Yiran Qin, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu Title: GameFactory: Creating New Games with Generative Interactive Videos Arxiv: http://arxiv.org/abs/2501.08325v1 Abstract: Generative game engines have the potential to revolutionize game development by autonomously creating new content and reducing manual workload. However, existing video-based game generation methods fail to address the critical challenge of scene generalization, limiting their applicability to existing games with fixed styles and scenes. In this paper, we present GameFactory, a framework focused on exp...
2025-01-22
22 min
Daily Paper Cast
ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning
🤗 Upvotes: 10 | cs.CV Authors: Yuzhou Huang, Ziyang Yuan, Quande Liu, Qiulin Wang, Xintao Wang, Ruimao Zhang, Pengfei Wan, Di Zhang, Kun Gai Title: ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning Arxiv: http://arxiv.org/abs/2501.04698v1 Abstract: Text-to-video generation has made remarkable advancements through diffusion models. However, Multi-Concept Video Customization (MCVC) remains a significant challenge. We identify two key challenges in this task: 1) the identity decoupling problem, where directly adopting existing customization methods inevitably mix attributes when handling multiple concepts simultaneously, and 2) the sca...
2025-01-14
23 min
Daily Paper Cast
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
🤗 Upvotes: 36 | cs.CV Authors: Jianhong Bai, Menghan Xia, Xintao Wang, Ziyang Yuan, Xiao Fu, Zuozhu Liu, Haoji Hu, Pengfei Wan, Di Zhang Title: SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints Arxiv: http://arxiv.org/abs/2412.07760v1 Abstract: Recent advancements in video diffusion models have shown exceptional abilities in simulating real-world dynamics and maintaining 3D consistency. This progress inspires us to investigate the potential of these models to ensure dynamic consistency across various viewpoints, a highly desirable feature for applications such as virtual filming. Unlike existing methods foc...
2024-12-13
21 min
Daily Paper Cast
StyleMaster: Stylize Your Video with Artistic Generation and Translation
🤗 Upvotes: 14 | cs.CV Authors: Zixuan Ye, Huijuan Huang, Xintao Wang, Pengfei Wan, Di Zhang, Wenhan Luo Title: StyleMaster: Stylize Your Video with Artistic Generation and Translation Arxiv: http://arxiv.org/abs/2412.07744v1 Abstract: Style control has been popular in video generation models. Existing methods often generate videos far from the given style, cause content leakage, and struggle to transfer one video to the desired style. Our first observation is that the style extraction stage matters, whereas existing methods emphasize global style but ignore local textures. In order to...
2024-12-13
23 min
Daily Paper Cast
3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation
🤗 Upvotes: 17 | cs.CV Authors: Xiao Fu, Xian Liu, Xintao Wang, Sida Peng, Menghan Xia, Xiaoyu Shi, Ziyang Yuan, Pengfei Wan, Di Zhang, Dahua Lin Title: 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation Arxiv: http://arxiv.org/abs/2412.07759v1 Abstract: This paper aims to manipulate multi-entity 3D motions in video generation. Previous methods on controllable video generation primarily leverage 2D control signals to manipulate object motions and have achieved remarkable synthesis results. However, 2D control signals are inherently limited in expressing the 3D nature of obj...
2024-12-12
23 min
Papers Read on AI
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
We present InstantMesh, a feed-forward framework for instant 3D mesh generation from a single image, featuring state-of-the-art generation quality and significant training scalability. By synergizing the strengths of an off-the-shelf multiview diffusion model and a sparse-view reconstruction model based on the LRM architecture, InstantMesh is able to create diverse 3D assets within 10 seconds. To enhance the training efficiency and exploit more geometric supervisions, e.g, depths and normals, we integrate a differentiable iso-surface extraction module into our framework and directly optimize on the mesh representation. Experimental results on public datasets demonstrate that InstantMesh significantly outperforms other latest image-to-3D baselines...
2024-04-18
20 min
二分电台
#26 大串台之一起聊聊新加坡
“捕蛇者说”的主播 laike9m 来到了新加坡!我和捕蛇者说的 xintao、laike9m 以及代码之外的 勾股 在现场录制了一起节目,一起畅聊了有关新加坡和加州的方方面面~ 00:12 大串台开场:捕蛇者说 laike9m 和 xintao、代码之外 勾股、二分电台 AB 01:37 laike9m 对新加坡的第一印象 & 和加州的天气对比 06:56 本期真正的主题:对新加坡最喜欢和最不喜欢的地方 07:17 勾股:第一个喜欢的点是“轻松”,少有分心的事情(更专注) 11:42 华人社会的教育“怕输”论 13:2 AB:换个视角看看不同上升渠道的新加坡社会 16:07 勾股:对新加坡教育的双面观察(既卷,也有世界前几的人均教育资源) 20:37 Laike9m: 从同事身上了解到的加州教育情况(小朋友都在学 Python) 22:08 xintao: GovTech 公司 与 政府网站的体验讨论 25:52 Cooling Singapore 项目 28:07 美国的公共服务情况 29:47 新加坡的人口规模与填岛计划 33:12 加州的堂食分享,包括“白人饭”、墨西哥菜、中餐、越南餐等等 39:12 新加坡食材吐槽,鸡肉、猪肉、急冻海鲜、批发与零售的区别 43:24 新加坡的“食阁”分类:小贩中心 Hawker Center、开放咖啡店 Coffee Shop、狭义食阁或有空调的连锁咖啡店例如 Kopitiam 和 FoodRepublic 45:52 时代的光谱:新加坡的一些福建广东食物仿佛看到了几十年前的影子(例如红龟粿) 47:16 新加坡的多民族融合政策与大家长式的管理 51:31 新加坡最大的福利:HDB,准购要求与制度。(顺便分享最贵的 HDB 之一 Pinnacle) 57:18 新加坡的让人不喜欢的地方:第一个点就是贵(通涨、车子、啤酒、旅游、追星) 01:02:58 对比其他移民国家,新加坡拿到绿卡前的几年比较难熬(各方面成本较高) 01:04:12 除了贵,对现在想来新加坡旅游和工作的朋友,我们会劝退吗? 01:06:57 新加坡绿卡申请是个黑盒(内部政策会有很多动态性和不确定性) 01:11:07 新加坡绿卡续签更加严格 01:11:57 美国和新加坡的看病区别 01:19:02 新加坡的休闲娱乐生活:美食、徒步、健身撸铁、游泳、烧烤 party、看世界各地的新电影(基本都有中文字幕)、大量演唱会脱口秀商业比赛(亚洲和欧美来的都有)、周边旅游(海岛、雨林、爬山、日韩台) 01:29:22 在新加坡办各地签证特别方便 01:32:03 人人夸的樟宜机场 01:33:52 xintao:徒步、骑行、马拉松活动的设计 01:36:07 新加坡政府的健康生活倡导:Apps、糖分盐分控制、各类球场和基础设施、户外健身操 01:44:27 新加坡的技术 Meetup 和 Conference 01:49:07 新加坡的互联网发展和近年中国大陆人才输出有关系吗? 01:52:26 总结时间:每个主播的工作生活和本次聊天的感受 参考链接: PISA 2022 results Understanding Singapore Math 新加坡副总理尚达曼打脸BBC主持人四十分钟全文 李光耀观天下 新加坡饮料健康等级系统 Measures for Nutri-Grade Beverages ,另外上海也试行饮料营养分级 运动换取代金券的 App Lumihealth Stranger Soccer National Parks Visa Technology Traineeship Program 串台链接: 代码之外 捕蛇者说 音频处理: 西市独柳工作室 相关信息: 公众号:Android高效开发、南瓜饼日常 二分电台官网 关于和版权信息 AB 的联系方式: 关于 AB
2024-04-15
1h 57
Oriente Press
Parole d'ordine: sicurezza e ricchezza condivisa
https://ogzero.org/tag/cina/Bri, Idailu, è ormai qualcosa di acquisito che procede per inerzia o comunque proprio il mondo meno interconnesso non ne prevede più tanto i presupposti; ma comunque è una proposta di Xi e quindi in qualche modo bisogna continuare a investire nella Nuova via della seta, come dimostrano gli accordi di Cosco di questi giorni per quote di controllo del porto di Amburgo. Sgomberiamo subito il campo degli argomenti che mandano in solluchero il mainstream poco avvezzo alla reale società cinese in cui troviamo invece immerso Gabriele Battaglia (@Chen_the_Tramp), che ci parla dal suo covo...
2022-10-28
45 min
捕蛇者说
Ep 35. 和 Gray 聊聊那些年遇到的神奇 Bug
如果喜欢我们的节目,欢迎通过爱发电打赏支持:https://afdian.net/@pythonhunter 最近我们和 Gray 聊了聊 Debug,会分成上下两期,这一期我们主要聊了一些遇到的神奇 bug,以及解决的思路,下一期,我们会聊 debug 的一些工具。 嘉宾 Gray 主播 Manjusaka laike9m laixintao 时间线 03:02 Gray 遇到的 HTTP 下载文件的 bug 10:59 Xintao 遇到的 HTTP 跳转 HTTPS 问题 15:37 Manjusaka 遇到的 Python Asyncio 的问题 24:08 Laike9m: 不要浪费太多时间在一个 bug 上,即时寻求帮助 25:20 Xintao 语雀编辑器的一个神奇的 bug 33:40 监控的 P99 毛刺现象 35:30 uwsgi 中 hping3 信号丢失的一个bug 42:30 推荐 git 的 bisect 工具 44:52 laike9m 遇到的 latency 问题 52:43 有关 Python 的 malloc 的一个问题 57:24 Golang 1.12 内存泄漏? 60:00 Python 中 re.complie cache 的行为问题 链接 git bisect Debug 一个在 uWSGI 下使用 subprocess 卡住的问题 | 卡瓦邦噶! 爱发电上赞助
2022-01-13
1h 04
捕蛇者说
Ep 32. 和李辉聊聊自由职业(上)
如果喜欢我们的节目,欢迎通过爱发电打赏支持:https://afdian.net/@pythonhunter 嘉宾 李辉 主播 Adam Wen 小白 laixintao laike9m 时间线 00:01:21 自由职业经历 00:04:54 找工作的失败经历 00:08:01 编程视频课程 00:10:13 看视频学编程 00:11:31 做外包的奇葩遭遇 00:17:26 小白的职业规划 00:19:45 今天星期几 & 晚上几点睡 & 闹钟哪家强? 00:25:47 毕业后先不要立刻开始工作? 00:28:10 推荐环节 00:28:32 李辉的推荐 00:30:56 xintao 没有什么要推荐 00:31:32 Adam 的推荐 00:34:03 「阿里五型人格」(阿里巴巴的小白兔、野狗、大牛、老牛、老白兔,分别指代什么??) 00:34:50 小白的推荐 00:36:35 laike9m 的推荐 00:37:21 嘉宾自带环节:你未来 3~5 年的阶段目标 00:38:01 李辉的阶段目标 00:40:15 laike9m 的阶段目标 00:42:16 小白的阶段目标 00:45:19 xintao 的阶段目标 00:46:17 知识管理工具 00:49:19 Adam 的阶段目标 00:52:27 结语 链接 00:28:32 娱乐至死 00:31:32 奈飞文化手册 00:31:58 No Rules Rules 00:34:50 Python 神经网络编程 00:36:35 Async Python is Not Faster 00:36:56 Ignore All Web Performance Benchmarks, Including This One 爱发电上赞助
2021-09-11
53 min
本账号已停更,请重新搜索并订阅
Ep 26. 和 xintao 聊聊新加坡的工作与生活
2021-03-09
1h 15
捕蛇者说
Ep 26. 和 xintao 聊聊新加坡的工作与生活
如果喜欢我们的节目,欢迎通过爱发电打赏支持:https://afdian.net/@pythonhunter 主播 Manjusaka laike9m laixintao 时间轴 00:02:00 为什么 xintao 会离开阿里? 00:22:43 办理新加坡签证 00:28:30 新加坡的生活成本和税收 00:29:57 在新加坡租房 00:43:20 新加坡的日常生活 00:58:17 应对诈骗 01:03:13 xintao 在 Shopee 的工作,Shopee 的公司文化 01:06:06 如何进入 Shopee 工作? 01:11:05 Manjusaka 的招人广告 链接 What is Site Reliability Engineering (SRE)? Google December 2020 services outage 智能运维系列(一)| AIOps 的崛起与实践 关于《Fluent Python》中文版中“期物”这个翻译的讨论 组屋 我在新加坡一个月的生活费明细 - by laixintao Join Shopee & Work with Me! - xintao 的内推链接 PyCon US 2021 爱发电上赞助
2021-03-07
1h 15