Look for any podcast host, guest or anyone
Showing episodes and shows of

Mathieu Virbel

Shows

Paper BriefPaper BriefEP100 - Training Chain-of-Thought via Latent-Variable Inference2023-12-0603 minPaper BriefPaper BriefEP99 - Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models2023-12-0602 minPaper BriefPaper BriefEP98 - Voyager: An Open-Ended Embodied Agent with Large Language Models2023-12-0503 minPaper BriefPaper BriefEP97 - VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence2023-12-0502 minPaper BriefPaper BriefEP96 - DeepCache: Accelerating Diffusion Models for Free2023-12-0503 minPaper BriefPaper BriefEP95 - Segment Any 3D Gaussians2023-12-0502 minPaper BriefPaper BriefEP94 - Fast View Synthesis of Casual Videos2023-12-0503 minPaper BriefPaper BriefEP93 - Nash Learning from Human Feedback2023-12-0502 minPaper BriefPaper BriefEP92 - SANeRF-HQ: Segment Anything for NeRF in High Quality2023-12-0502 minPaper BriefPaper BriefEP91 - Segment and Caption Anything2023-12-0502 minPaper BriefPaper BriefEP90 - TextGenSHAP: Scalable Post-hoc Explanations in Text Generation with Long Documents2023-12-0503 minPaper BriefPaper BriefEP89 - Object Recognition as Next Token Prediction2023-12-0502 minPaper BriefPaper BriefEP88 - VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams2023-12-0503 minPaper BriefPaper BriefEP87 - The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning2023-12-0502 minPaper BriefPaper BriefEP86 - GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis2023-12-0502 minPaper BriefPaper BriefEP85 - Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training2023-12-0503 minPaper BriefPaper BriefEP84 - Generative Powers of Ten2023-12-0502 minPaper BriefPaper BriefEP83 - Using Large Language Models to Accelerate Communication for Users with Severe Motor Impairments2023-12-0503 minPaper BriefPaper BriefEP82 - Rejuvenating image-GPT as Strong Visual Representation Learners2023-12-0503 minPaper BriefPaper BriefEP81 - DiffiT: Diffusion Vision Transformers for Image Generation2023-12-0503 minPaper BriefPaper BriefEP80 - Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models2023-12-0502 minPaper BriefPaper BriefEP79 - Style Aligned Image Generation via Shared Attention2023-12-0503 minPaper BriefPaper BriefEP78 - Magicoder: Source Code Is All You Need2023-12-0502 minPaper BriefPaper BriefEP77 - VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models2023-12-0502 minPaper BriefPaper BriefEP76 - RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback2023-12-0502 minPaper BriefPaper BriefEP75 - GIVT: Generative Infinite-Vocabulary Transformers2023-12-0503 minPaper BriefPaper BriefEP74 - FSGS: Real-Time Few-shot View Synthesis using Gaussian Splatting2023-12-0404 minPaper BriefPaper BriefEP73 - Mamba: Linear-Time Sequence Modeling with Selective State Spaces2023-12-0403 minPaper BriefPaper BriefEP72 - Dolphins: Multimodal Language Model for Driving2023-12-0402 minPaper BriefPaper BriefEP71 - StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter2023-12-0403 minPaper BriefPaper BriefEP70 - Instruction-tuning Aligns LLMs to the Human Brain2023-12-0402 minPaper BriefPaper BriefEP69 - Merlin:Empowering Multimodal LLMs with Foresight Minds2023-12-0402 minPaper BriefPaper BriefEP68 - X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation2023-12-0403 minPaper BriefPaper BriefEP67 - PyNeRF: Pyramidal Neural Radiance Fields2023-12-0403 minPaper BriefPaper BriefEP66 - MoMask: Generative Masked Modeling of 3D Human Motions2023-12-0403 minPaper BriefPaper BriefEP65 - HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models2023-12-0403 minPaper BriefPaper BriefEP64 - Beyond ChatBots: ExploreLLM for Structured Thoughts and Personalized Model Responses2023-12-0402 minPaper BriefPaper BriefEP63 - Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering2023-12-0402 minPaper BriefPaper BriefEP62 - VideoBooth: Diffusion-based Video Generation with Image Prompts2023-12-0403 minPaper BriefPaper BriefEP61 - Text-Guided 3D Face Synthesis -- From Generation to Editing2023-12-0402 minPaper BriefPaper BriefEP60 - DREAM: Diffusion Rectification and Estimation-Adaptive Models2023-12-0403 minPaper BriefPaper BriefEP59 - SeaLLMs -- Large Language Models for Southeast Asia2023-12-0402 minPaper BriefPaper BriefEP58 - GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs2023-12-0402 minPaper BriefPaper BriefEP57 - Towards Accurate Differential Diagnosis with Large Language Models2023-12-0402 minPaper BriefPaper BriefEP56 - RO-LLaMA: Generalist LLM for Radiation Oncology via Noise Augmentation and Consistency Regularization2023-11-2902 minPaper BriefPaper BriefEP55 - Advancements in Generative AI: A Comprehensive Review of GANs, GPT, Autoencoders, Diffusion Model, and Transformers2023-11-2902 minPaper BriefPaper BriefEP54 - MEDITRON-70B: Scaling Medical Pretraining for Large Language Models2023-11-2902 minPaper BriefPaper BriefEP53 - Global Performance Disparities Between English-Language Accents in Automatic Speech Recognition2023-11-2302 minPaper BriefPaper BriefEP52 - Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model2023-11-2302 minPaper BriefPaper BriefEP51 - FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline2023-11-2302 minPaper BriefPaper BriefEP50 - LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes2023-11-2304 minPaper BriefPaper BriefEP49 - ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs2023-11-2303 minPaper BriefPaper BriefEP48 - Visual In-Context Prompting2023-11-2302 minPaper BriefPaper BriefEP47 - Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models2023-11-2302 minPaper BriefPaper BriefEP46 - PG-Video-LLaVA: Pixel Grounding Large Video-Language Models2023-11-2302 minPaper BriefPaper BriefEP45 - Diffusion Model Alignment Using Direct Preference Optimization2023-11-2302 minPaper BriefPaper BriefEP44 - GAIA: a benchmark for General AI Assistants2023-11-2302 minPaper BriefPaper BriefEP43 - PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction2023-11-2202 minPaper BriefPaper BriefEP42 - HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis2023-11-2202 minPaper BriefPaper BriefEP41 - PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics2023-11-2202 minPaper BriefPaper BriefEP40 - SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering2023-11-2202 minPaper BriefPaper BriefEP39 - Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models2023-11-2202 minPaper BriefPaper BriefEP38 - MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer2023-11-2202 minPaper BriefPaper BriefEP37 - NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation2023-11-2203 minPaper BriefPaper BriefEP36 - GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning2023-11-2203 minPaper BriefPaper BriefEP35 - System 2 Attention (is something you might need too)2023-11-2101 minPaper BriefPaper BriefEP34 - Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression2023-11-2103 minPaper BriefPaper BriefEP33 - LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching2023-11-2103 minPaper BriefPaper BriefEP32 - TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems2023-11-2103 minPaper BriefPaper BriefEP31 - Make Pixels Dance: High-Dynamic Video Generation2023-11-2103 minPaper BriefPaper BriefEP30 - Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning2023-11-2102 minPaper BriefPaper BriefEP29 - Orca 2: Teaching Small Language Models How to Reason2023-11-2102 minPaper BriefPaper BriefEP28 - M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models2023-11-2103 minPaper BriefPaper BriefEP27 - AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort2023-11-2102 minPaper BriefPaper BriefEP26 - Exponentially Faster Language Modelling2023-11-2103 minPaper BriefPaper BriefEP25 - GPQA: A Graduate-Level Google-Proof Q&A Benchmark2023-11-2102 minPaper BriefPaper BriefEP24 - ProAgent: From Robotic Process Automation to Agentic Process Automation2023-11-2103 minPaper BriefPaper BriefEP23 - GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration2023-11-2103 minPaper BriefPaper BriefEP22 - MultiLoRA: Democratizing LoRA for Better Multi-Task Learning2023-11-2103 minPaper BriefPaper BriefEP21 - ToolTalk: Evaluating Tool-Usage in a Conversational Setting2023-11-2102 minPaper BriefPaper BriefEP20 - Memory Augmented Language Models through Mixture of Word Experts2023-11-2102 minPaper BriefPaper BriefEP19 - VideoCon: Robust Video-Language Alignment via Contrast Captions2023-11-2002 minPaper BriefPaper BriefEP18 - I&S-ViT: An Inclusive & Stable Method for Pushing the Limit of Post-Training ViTs Quantization2023-11-2003 minPaper BriefPaper BriefEP17 - Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers2023-11-2003 minPaper BriefPaper BriefEP16 - Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning2023-11-2003 minPaper BriefPaper BriefEP15 - SelfEval: Leveraging the discriminative nature of generative models for evaluation2023-11-2003 minPaper BriefPaper BriefEP14 - Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections2023-11-2003 minPaper BriefPaper BriefEP13 - Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 22023-11-2002 minPaper BriefPaper BriefEP12 - MetaDreamer: Efficient Text-to-3D Creation With Disentangling Geometry and Texture2023-11-2002 minPaper BriefPaper BriefEP11 - Testing Language Model Agents Safely in the Wild2023-11-2002 minPaper BriefPaper BriefEP10 - Video-LLaVA: Learning United Visual Representation by Alignment Before Projection2023-11-2002 minPaper BriefPaper BriefEP9 - UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework2023-11-2002 minPaper BriefPaper BriefEP8 - UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs2023-11-1903 minPaper BriefPaper BriefEP7 - Adaptive Shells for Efficient Neural Radiance Field Rendering2023-11-1903 minPaper BriefPaper BriefEP6 - Contrastive Chain-of-Thought Prompting2023-11-1902 minPaper BriefPaper BriefEP5 - JaxMARL: Multi-Agent RL Environments in JAX2023-11-1902 minPaper BriefPaper BriefEP4 - Tied-Lora: Enhacing parameter efficiency of LoRA with weight tying2023-11-1903 minPaper BriefPaper BriefEP3 - Open-Sourcing Highly Capable Foundation Models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives2023-11-1902 minPaper BriefPaper BriefEP2 - The Chosen One: Consistent Characters in Text-to-Image Diffusion Models2023-11-1902 minPaper BriefPaper BriefEP1 - ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks2023-11-1902 min