Mathieu Virbel - Podcast Details

Shows

Paper BriefEP100 - Training Chain-of-Thought via Latent-Variable Inference 2023-12-0603 min

Paper BriefEP99 - Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models 2023-12-0602 min

Paper BriefEP98 - Voyager: An Open-Ended Embodied Agent with Large Language Models 2023-12-0503 min

Paper BriefEP97 - VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence 2023-12-0502 min

Paper BriefEP96 - DeepCache: Accelerating Diffusion Models for Free 2023-12-0503 min

Paper BriefEP95 - Segment Any 3D Gaussians 2023-12-0502 min

Paper BriefEP94 - Fast View Synthesis of Casual Videos 2023-12-0503 min

Paper BriefEP93 - Nash Learning from Human Feedback 2023-12-0502 min

Paper BriefEP92 - SANeRF-HQ: Segment Anything for NeRF in High Quality 2023-12-0502 min

Paper BriefEP91 - Segment and Caption Anything 2023-12-0502 min

Paper BriefEP90 - TextGenSHAP: Scalable Post-hoc Explanations in Text Generation with Long Documents 2023-12-0503 min

Paper BriefEP89 - Object Recognition as Next Token Prediction 2023-12-0502 min

Paper BriefEP88 - VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams 2023-12-0503 min

Paper BriefEP87 - The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning 2023-12-0502 min

Paper BriefEP86 - GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis 2023-12-0502 min

Paper BriefEP85 - Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training 2023-12-0503 min

Paper BriefEP84 - Generative Powers of Ten 2023-12-0502 min

Paper BriefEP83 - Using Large Language Models to Accelerate Communication for Users with Severe Motor Impairments 2023-12-0503 min

Paper BriefEP82 - Rejuvenating image-GPT as Strong Visual Representation Learners 2023-12-0503 min

Paper BriefEP81 - DiffiT: Diffusion Vision Transformers for Image Generation 2023-12-0503 min

Paper BriefEP80 - Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models 2023-12-0502 min

Paper BriefEP79 - Style Aligned Image Generation via Shared Attention 2023-12-0503 min

Paper BriefEP78 - Magicoder: Source Code Is All You Need 2023-12-0502 min

Paper BriefEP77 - VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models 2023-12-0502 min

Paper BriefEP76 - RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback 2023-12-0502 min

Paper BriefEP75 - GIVT: Generative Infinite-Vocabulary Transformers 2023-12-0503 min

Paper BriefEP74 - FSGS: Real-Time Few-shot View Synthesis using Gaussian Splatting 2023-12-0404 min

Paper BriefEP73 - Mamba: Linear-Time Sequence Modeling with Selective State Spaces 2023-12-0403 min

Paper BriefEP72 - Dolphins: Multimodal Language Model for Driving 2023-12-0402 min

Paper BriefEP71 - StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter 2023-12-0403 min

Paper BriefEP70 - Instruction-tuning Aligns LLMs to the Human Brain 2023-12-0402 min

Paper BriefEP69 - Merlin:Empowering Multimodal LLMs with Foresight Minds 2023-12-0402 min

Paper BriefEP68 - X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation 2023-12-0403 min

Paper BriefEP67 - PyNeRF: Pyramidal Neural Radiance Fields 2023-12-0403 min

Paper BriefEP66 - MoMask: Generative Masked Modeling of 3D Human Motions 2023-12-0403 min

Paper BriefEP65 - HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models 2023-12-0403 min

Paper BriefEP64 - Beyond ChatBots: ExploreLLM for Structured Thoughts and Personalized Model Responses 2023-12-0402 min

Paper BriefEP63 - Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering 2023-12-0402 min

Paper BriefEP62 - VideoBooth: Diffusion-based Video Generation with Image Prompts 2023-12-0403 min

Paper BriefEP61 - Text-Guided 3D Face Synthesis -- From Generation to Editing 2023-12-0402 min

Paper BriefEP60 - DREAM: Diffusion Rectification and Estimation-Adaptive Models 2023-12-0403 min

Paper BriefEP59 - SeaLLMs -- Large Language Models for Southeast Asia 2023-12-0402 min

Paper BriefEP58 - GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs 2023-12-0402 min

Paper BriefEP57 - Towards Accurate Differential Diagnosis with Large Language Models 2023-12-0402 min

Paper BriefEP56 - RO-LLaMA: Generalist LLM for Radiation Oncology via Noise Augmentation and Consistency Regularization 2023-11-2902 min

Paper BriefEP55 - Advancements in Generative AI: A Comprehensive Review of GANs, GPT, Autoencoders, Diffusion Model, and Transformers 2023-11-2902 min

Paper BriefEP54 - MEDITRON-70B: Scaling Medical Pretraining for Large Language Models 2023-11-2902 min

Paper BriefEP53 - Global Performance Disparities Between English-Language Accents in Automatic Speech Recognition 2023-11-2302 min

Paper BriefEP52 - Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model 2023-11-2302 min

Paper BriefEP51 - FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline 2023-11-2302 min

Paper BriefEP50 - LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes 2023-11-2304 min

Paper BriefEP49 - ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs 2023-11-2303 min

Paper BriefEP48 - Visual In-Context Prompting 2023-11-2302 min

Paper BriefEP47 - Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models 2023-11-2302 min

Paper BriefEP46 - PG-Video-LLaVA: Pixel Grounding Large Video-Language Models 2023-11-2302 min

Paper BriefEP45 - Diffusion Model Alignment Using Direct Preference Optimization 2023-11-2302 min

Paper BriefEP44 - GAIA: a benchmark for General AI Assistants 2023-11-2302 min

Paper BriefEP43 - PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction 2023-11-2202 min

Paper BriefEP42 - HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis 2023-11-2202 min

Paper BriefEP41 - PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics 2023-11-2202 min

Paper BriefEP40 - SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering 2023-11-2202 min

Paper BriefEP39 - Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models 2023-11-2202 min

Paper BriefEP38 - MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer 2023-11-2202 min

Paper BriefEP37 - NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation 2023-11-2203 min

Paper BriefEP36 - GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning 2023-11-2203 min

Paper BriefEP35 - System 2 Attention (is something you might need too)2023-11-2101 min

Paper BriefEP34 - Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression 2023-11-2103 min

Paper BriefEP33 - LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching 2023-11-2103 min

Paper BriefEP32 - TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems 2023-11-2103 min

Paper BriefEP31 - Make Pixels Dance: High-Dynamic Video Generation 2023-11-2103 min

Paper BriefEP30 - Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning 2023-11-2102 min

Paper BriefEP29 - Orca 2: Teaching Small Language Models How to Reason 2023-11-2102 min

Paper BriefEP28 - M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models 2023-11-2103 min

Paper BriefEP27 - AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort 2023-11-2102 min

Paper BriefEP26 - Exponentially Faster Language Modelling 2023-11-2103 min

Paper BriefEP25 - GPQA: A Graduate-Level Google-Proof Q&A Benchmark 2023-11-2102 min

Paper BriefEP24 - ProAgent: From Robotic Process Automation to Agentic Process Automation 2023-11-2103 min

Paper BriefEP23 - GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration 2023-11-2103 min

Paper BriefEP22 - MultiLoRA: Democratizing LoRA for Better Multi-Task Learning 2023-11-2103 min

Paper BriefEP21 - ToolTalk: Evaluating Tool-Usage in a Conversational Setting 2023-11-2102 min

Paper BriefEP20 - Memory Augmented Language Models through Mixture of Word Experts 2023-11-2102 min

Paper BriefEP19 - VideoCon: Robust Video-Language Alignment via Contrast Captions 2023-11-2002 min

Paper BriefEP18 - I&S-ViT: An Inclusive & Stable Method for Pushing the Limit of Post-Training ViTs Quantization 2023-11-2003 min

Paper BriefEP17 - Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers 2023-11-2003 min

Paper BriefEP16 - Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning 2023-11-2003 min

Paper BriefEP15 - SelfEval: Leveraging the discriminative nature of generative models for evaluation 2023-11-2003 min

Paper BriefEP14 - Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections 2023-11-2003 min

Paper BriefEP13 - Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2 2023-11-2002 min

Paper BriefEP12 - MetaDreamer: Efficient Text-to-3D Creation With Disentangling Geometry and Texture 2023-11-2002 min

Paper BriefEP11 - Testing Language Model Agents Safely in the Wild 2023-11-2002 min

Paper BriefEP10 - Video-LLaVA: Learning United Visual Representation by Alignment Before Projection 2023-11-2002 min

Paper BriefEP9 - UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework 2023-11-2002 min

Paper BriefEP8 - UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs 2023-11-1903 min

Paper BriefEP7 - Adaptive Shells for Efficient Neural Radiance Field Rendering 2023-11-1903 min

Paper BriefEP6 - Contrastive Chain-of-Thought Prompting 2023-11-1902 min

Paper BriefEP5 - JaxMARL: Multi-Agent RL Environments in JAX 2023-11-1902 min

Paper BriefEP4 - Tied-Lora: Enhacing parameter efficiency of LoRA with weight tying 2023-11-1903 min

Paper BriefEP3 - Open-Sourcing Highly Capable Foundation Models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives 2023-11-1902 min

Paper BriefEP2 - The Chosen One: Consistent Characters in Text-to-Image Diffusion Models 2023-11-1902 min

Paper BriefEP1 - ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks 2023-11-1902 min