Look for any podcast host, guest or anyone
Showing episodes and shows of

Anlie Arnaudy

Shows

AI OdysseyAI OdysseyFrom Agents to Teammates: Building Cohesive AI SquadsMeet the Aime framework—ByteDance’s fresh take on multi-agent systems that lets AI teammates think on their feet instead of following brittle, pre-planned scripts. A dynamic planner keeps adjusting the big picture, an Actor Factory spins up just-right specialist agents on demand, and a shared progress board keeps everyone in sync. In tests ranging from general reasoning (GAIA) to software bug-fixing (SWE-Bench) and live web navigation (WebVoyager), Aime consistently out-performed hand-tuned rivals—showing that flexible, reactive collaboration beats static role-play every time.This episode of IA Odyssey unpacks how Yexuan Shi and colleagues replace rigid “plan-and-execute” pipelines wi...2025-07-1915 minAI OdysseyAI OdysseyWhen Machines Self-Improve: Inside the Self-Challenging AIIn this episode of IA Odyssey, we explore a bold new approach in training intelligent AI agents: letting them invent their own problems.We dive into “Self-Challenging Language Model Agents” by Yifei Zhou, Sergey Levine (UC Berkeley), Jason Weston, Xian Li, and Sainbayar Sukhbaatar (FAIR at Meta), which introduces a powerful framework called Self-Challenging Agents (SCA). Rather than relying on human-labeled tasks, this method enables AI agents to generate their own training tasks, assess their quality using executable code, and learn through reinforcement learning — all without external supervision.Using the novel Code-as-Task format, agents first act as "cha...2025-07-1613 minAI OdysseyAI OdysseyBeyond Code: Navigating the AI Software Revolution with Andrej KarpathyWe're witnessing one of the most profound shifts in the history of software—a rapid evolution from traditional coding (Software 1.0) to neural networks (Software 2.0) and now, the dawn of Software 3.0: large language models (LLMs) programmable with simple English. Inspired by insights from Andrej Karpathy, former AI Director at Tesla, we explore how this paradigm shift reshapes the very concept of programming and its profound implications for everyone engaging with technology.From the "Iron Man" analogy, where AI augments human capabilities rather than replacing them, to the fascinating vision of LLMs as new operating systems, this episode dives de...2025-07-0516 minAI OdysseyAI OdysseyUnlocking the Secrets: How Much Do Language Models Memorize?Ever wondered how much information your favorite AI language models, like GPT, actually retain from their training data? In this episode of AI Odyssey, we delve into groundbreaking research by John X. Morris, Chawin Sitawarin, Chuan Guo, Narine Kokhlikyan, G. Edward Suh, Alexander M. Rush, Kamalika Chaudhuri, and Saeed Mahloujifar. The authors introduce a new method for quantifying memorization in AI, distinguishing between unintended memorization (dataset-specific information) and generalization (knowledge of underlying data patterns). With findings revealing that models like GPT have a surprising capacity of about 3.6 bits per parameter, this study explores how memorization plateaus and eventually gives...2025-06-2918 minAI OdysseyAI OdysseySimulating UX with AI: Introducing UXAgentWhat if you could simulate a full-scale usability test—before involving a single human user? In this episode, we explore UXAgent, a groundbreaking system developed by researchers from Northeastern University, Amazon, and the University of Notre Dame. This tool leverages Large Language Models (LLMs) to create persona-driven agents that simulate real user interactions on web interfaces.UXAgent's innovative architecture mimics both fast, intuitive decisions and deeper, reflective reasoning—bringing realistic and diverse user behavior into early-stage UX testing. The system enables rapid iteration of study designs, helps identify potential flaws, and even allows interviews with simulated users....2025-06-2117 minAI OdysseyAI OdysseyAI Agents Are Old News—Meet the Rise of Agentic AIWhat if your AI didn't just follow instructions… but coordinated a whole team to solve complex problems on its own?In this episode, we dive into the fascinating shift from traditional AI Agents to a bold new paradigm: Agentic AI. Based on the eye-opening paper “AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges”, we unpack why single-task bots like AutoGPT are already being outpaced by swarms of intelligent agents that collaborate, strategize, and adapt—almost like digital organizations.Discover how these systems are transforming research, medicine, robotics, and cybersecurity, and why Google’s new A2A pr...2025-06-1416 minAI OdysseyAI OdysseyThe Illusion of Thinking: When More Reasoning Doesn’t Mean Better ReasoningIn this episode, we explore “The Illusion of Thinking”, a thought-provoking study from Apple researchers that dives into the true capabilities—and surprising limits—of Large Reasoning Models (LRMs). Despite being designed to "think harder," these advanced AI models often fall short when problem complexity increases, failing to generalize reasoning and even reducing effort just when it’s most needed.Using controlled puzzle environments, the authors reveal a curious three-phase behavior: standard language models outperform LRMs on simple tasks, LRMs shine on moderately complex ones, but both collapse entirely under high complexity. Even with access to explicit algorithms...2025-06-0916 minAI OdysseyAI OdysseySmarter Prompts, Faster Results: The Power of Local Prompt OptimizationPrompting AI just got smarter. In this episode, we dive into Local Prompt Optimization (LPO) — a breakthrough approach that turbocharges prompt engineering by focusing edits on just the right words. Developed by Yash Jain and Vishal Chowdhary from Microsoft, LPO refines prompts with surgical precision, dramatically improving accuracy and speed across reasoning benchmarks like GSM8k, MultiArith, and BIG-bench Hard.Forget rewriting entire prompts. LPO reduces the optimization space, speeding up convergence and enhancing performance — even in complex production environments. We explore how this technique integrates seamlessly into existing prompt optimization methods like APE, APO, and PE2, and how...2025-05-3112 minAI OdysseyAI OdysseyBack to Basics: Understanding AI, From Buzzwords to RealityAI is everywhere—but what is it, really? In this episode, we cut through the noise to explore the fundamentals of artificial intelligence, from narrow AI and reactive systems to generative models, AI agents, and the emerging frontier of agentic AI. Using insights from expert sources, articles, and research papers, we break down key concepts in simple, accessible terms.You'll learn how tools like ChatGPT work under the hood, why generative AI felt like such a leap, and what it actually means for an AI to be an agent—or part of a multi-agent system. We explore the re...2025-05-2419 minAI OdysseyAI OdysseyFrom Nothing to Genius: How AI Learns Without DataWhat if an AI could become smarter without being taught anything? In this episode, we dive into Absolute Zero, a groundbreaking framework where an AI model trains itself to reason—without any curated data, labeled examples, or human guidance. Developed by researchers from Tsinghua, BIGAI, and Penn State, this radical approach replaces traditional training with a bold form of self-play, where the model invents its own tasks and learns by solving them.The result? Absolute Zero Reasoner (AZR) surpasses existing models that depend on tens of thousands of human-labeled examples, achieving state-of-the-art performance in math and code reaso...2025-05-1917 minAI OdysseyAI OdysseyUnifying the AI Agent Internet: How Protocols Can Unlock Collective IntelligenceWhat if AI agents could collaborate as seamlessly as devices do over the Internet? In this episode, we dive into "A Survey of AI Agent Protocols" by Yingxuan Yang and colleagues from Shanghai Jiao Tong University, a landmark paper that tackles the missing piece in today’s intelligent agent landscape: standardized communication protocols. As large language model (LLM) agents spread across industries—from customer service to healthcare—they still operate in silos, struggling to integrate with tools or with one another. This paper proposes a two-dimensional classification of agent protocols and explores a future where agents form coalitions, speak common l...2025-05-1123 minAI OdysseyAI OdysseyAI Meets Art: The Creative Revolution UnfoldingWhat happens when generative AI collides with human creativity? In this episode, we dive into the extraordinary transformation sweeping across visual arts, music, film, and writing—powered by tools like DALL·E, Midjourney, Suno, and ChatGPT. From text-to-image magic and AI-composed music to VFX breakthroughs and story co-writing, we explore how these innovations are democratizing access, supercharging workflows, and sparking heated debates over ethics, copyright, and what it means to be an artist. Drawing on a wide range of sources—made accessible with help from Google’s NotebookLM—we unpack how individuals and industries are adapting, and what the future of...2025-05-0413 minAI OdysseyAI OdysseyHow Real Companies Are Winning with AIIn this episode of IA Odyssey, we go beyond the AI hype and into the trenches with real-world business stories from OpenAI’s “AI in the Enterprise” guide. From Morgan Stanley's precision evals to Klarna's rapid-fire customer service, and BBVA’s bottom-up innovation strategy, we explore seven powerful lessons that show how companies are embedding AI into their workflows—not just for efficiency, but for transformation. You’ll hear how organizations are improving personalization, accelerating operations, and unlocking their teams’ potential.Whether you're curious, cautious, or already deploying AI, this deep dive offers insights you can actual...2025-04-2716 minAI OdysseyAI OdysseyHow Netflix Knows What You’ll Watch Before You DoIn this episode, we unpack how Netflix is using cutting-edge AI—similar to the tech behind ChatGPT—to power hyper-personalized recommendations. Discover how their new foundation model moves beyond traditional algorithms, blending massive data with NLP-inspired strategies like interaction tokenization and multi-token prediction. We also explore how this personalization revolution is reshaping customer expectations across industries, drawing on insights from marketing leaders like Qualtrics, Epsilon France, and Doozy Publicity. But with great AI power comes big questions: What about privacy, ethics, and the joy of unexpected discovery?Based on original sources and developed with the help of Goog...2025-04-2011 minAI OdysseyAI OdysseyThe AI That Remembers: How Memory Is Powering the Next Leap in IntelligenceWhat happens when AI stops forgetting? In this episode of IA Odyssey, we dive deep into OpenAI's rollout of memory in ChatGPT—and why it’s so much more than a feature toggle. From personalized ad agents to AI doctors learning on the job, we explore how memory transforms artificial intelligence into agentic AI: systems that adapt, personalize, and evolve. Drawing from cutting-edge research like KARMA, MeAgent Zero, and cognitive architecture frameworks, we unpack how memory lets AI learn from experience, get more accurate, and even form something close to relationships.2025-04-1220 minAI OdysseyAI OdysseyWhy AI Teams Fall Apart: Cracking the Code of Multi-Agent FailuresWhat happens when you put multiple AI agents together to solve a task? You might expect teamwork—but more often, you get chaos. In this episode of IA Odyssey, we dive into a groundbreaking study from UC Berkeley and Intesa Sanpaolo that reveals why multi-agent systems built on large language models are failing—spectacularly.The researchers examined over 150 real MAS conversations and uncovered 14 unique ways these systems break down—whether it’s agents ignoring each other, forgetting their roles, or ending tasks too early. They created MASFT, the first taxonomy to map these failures, and tested whether better pr...2025-04-0516 minAI OdysseyAI OdysseyHow DeepSeek Is Beating OpenAI at Their Own Game—On a BudgetIn this episode of IA Odyssey, we unpack how DeepSeek's open-source models are shaking up the AI world—matching GPT-level performance at a fraction of the cost. Drawing on insights from the research paper by Chengen Wang (University of Texas at Dallas) and Murat Kantarcioglu (Virginia Tech), we explore DeepSeek's secret sauce: memory-efficient Multi-Head Latent Attention, an evolved Mixture of Experts architecture, and reinforcement learning without supervised data. Oh, and did we mention they trained this monster on a $ave-the-GPU budget?From hardware-aware model design to the surprisingly powerful GRPO algorithm, this episode decodes the magic that’s mak...2025-03-2916 minAI OdysseyAI OdysseyThe Rise of AI Agents: Could They Transform the Future of Work?AI agents are revolutionizing automation—but not in the way you might think. These intelligent systems don’t just follow commands; they learn, adapt, and make decisions, reshaping industries from finance to healthcare. In this episode, we break down what makes AI agents different from traditional software, explore their growing role in our work, and dive into the game-changing potential of multi-agent systems. Are we witnessing the dawn of a new AI-powered workforce? Tune in to find out!2025-03-1809 minAI OdysseyAI OdysseyAI vs. Wall Street – The Rise of Multi-Agent TradingHow can AI revolutionize financial trading? The TradingAgents framework introduces a multi-agent system where AI-powered analysts, researchers, and traders collaborate to make more informed investment decisions. Inspired by real-world trading firms, this innovative approach leverages specialized agents—fundamental analysts, sentiment analysts, technical analysts, and traders with diverse risk profiles—to optimize trading strategies.Unlike traditional models, TradingAgents enhances explainability, risk management, and market adaptability through agentic debates and structured decision-making. Extensive backtesting reveals significant performance improvements over standard trading strategies.Discover the future of AI-driven finance and explore the full research paper here: https://arxiv.org/abs...2025-03-1510 minAI OdysseyAI OdysseyAgentic AI in Finance: Smarter Models, Safer DecisionsCan AI-powered teams replace traditional financial modeling workflows? This episode explores how agentic AI systems—where multiple specialized AI agents work together—are transforming financial services. Based on recent research, we break down how these AI "crews" tackle complex tasks like credit risk modeling, fraud detection, and regulatory compliance.We dive into the structure of these AI-driven teams, from model selection and hyperparameter tuning to risk assessment and bias detection. How do they compare to human-led processes? What challenges remain in ensuring fairness, transparency, and robustness in financial AI applications? Join us as we unpack the future of a...2025-03-0815 minAI OdysseyAI OdysseyThe Future of Prompting: Can AI Optimize Its Own Instructions?Crafting the perfect prompt for large language models (LLMs) is an art—but what if AI could master it for us? This episode explores Automatic Prompt Optimization (APO), a rapidly evolving field that seeks to automate and enhance how we interact with AI. Based on a comprehensive survey, we dive into the key APO techniques, their ability to refine prompts without direct model access, and the potential for AI to fine-tune its own instructions. Could this be the key to unlocking even more powerful AI capabilities? Join us as we break down the latest research, challenges, and the future of...2025-03-0217 minAI OdysseyAI OdysseyThe AI That Reads and Remembers - Cracking the Memory ProblemOne of AI’s biggest weaknesses? Memory. Today’s language models struggle with long documents, quickly losing track of crucial details. That’s a major limitation for businesses relying on AI for legal analysis, research synthesis, or strategic decision-making.Enter ReadAgent, a new system from Google DeepMind that expands an AI’s effective memory up to 20x. Inspired by how humans read, it builds a "gist memory"—capturing the essence of long texts while knowing when to retrieve key details. The result?🔹 AI that understands full reports, contracts, or meeting notes—without missing context.🔹 Smarter automation and ass...2025-02-2212 minAI OdysseyAI OdysseyIs Learning to Code Still Worth It? AI Can Now Reason Like a HumanIf AI can now outthink top programmers in competitive coding, what else can it master? OpenAI’s latest models don’t just generate code—they reason through complex problems, surpassing humans without handcrafted strategies. This breakthrough suggests AI could soon tackle fields beyond coding, from mathematics to scientific discovery. But if machines become expert problem-solvers, where does that leave us? Are we entering an era of AI-human collaboration, or are we gradually outsourcing intelligence itself? Let’s explore the future of AI reasoning—and what it means for humanity.Read the full paper here: https://arxiv.org/abs/2502.02025-02-1717 minAI OdysseyAI OdysseyAI is Taking Over Code Migration—Are Developers Ready?What if AI could handle the most tedious and complex code migrations—faster and more accurately than ever before? Big tech is already making it happen, using Large Language Models (LLMs) to automate software upgrades, refactor legacy code, and eliminate years of technical debt in record time. But what does this mean for developers, companies, and the future of software engineering? In this episode, we dive into groundbreaking AI-driven code migrations, uncover surprising results, and explore how these innovations could change the way we build and maintain code forever.🔗 Full research paper: https://arxiv.org/abs/2501.069722025-02-0911 minAI OdysseyAI OdysseyAI Wars: OpenAI vs. DeepSeek, US vs. ChinaThe AI arms race is heating up! OpenAI and DeepSeek are at odds over model training, NVIDIA’s stock takes a hit, and the battle for AI supremacy is reshaping global politics. In this episode, we break down OpenAI’s latest model, O3 Mini, and its surprising flaws, the ethical dilemmas surrounding AI development, and the future of jobs in a world where AI can code. Is AI a powerful ally or a looming threat? Tune in as we explore the rapid evolution of AI and what it all means for you.2025-02-0112 minAI OdysseyAI OdysseySmarter AI Starts Here: How Agentic RAG Changes EverythingThis episode dives into the cutting-edge world of Agentic Retrieval-Augmented Generation (RAG), a transformative AI paradigm that integrates autonomous agents into retrieval and generation workflows. Drawing on a comprehensive survey, we explore how Agentic RAG enhances real-time adaptability, multi-step reasoning, and contextual understanding. From applications in healthcare to personalized education and financial analytics, discover how this innovation addresses the limitations of static AI systems while paving the way for smarter, more dynamic solutions. Thanks to the authors for their pioneering insights into this groundbreaking technology. Explore the original paper here: https://arxiv.org/pdf/2501.091362025-01-2514 minAI OdysseyAI OdysseyTitans: AI Inspired by Human MemoryExplore how Titans, a revolutionary neural architecture, mimics the way humans remember and manage their memories. Developed by Google researchers, this groundbreaking framework combines short-term and long-term memory modules, drawing inspiration from how the brain processes and prioritizes information. With features like adaptive forgetting and memory persistence, Titans replicate the human ability to retain crucial details while discarding irrelevant data, making them ideal for tasks like language modeling, reasoning, and genomics. Discover how this human-inspired approach enables Titans to scale to massive context sizes while maintaining efficiency and accuracy—marking a leap forward in AI design. 📖...2025-01-1815 minAI OdysseyAI OdysseyAutomating Discovery: LLM-Powered Research LabsIn this episode, we explore "Agent Laboratory," an innovative framework leveraging large language models (LLMs) to act as research assistants. Developed by a team from AMD and Johns Hopkins University, this pipeline automates the research process—from literature review and experimentation to report writing—dramatically reducing time and costs. We'll discuss how the framework integrates human feedback, generates state-of-the-art machine learning solutions, and addresses challenges like result accuracy and evaluation biases. Tune in to learn how Agent Laboratory could reshape the future of scientific discovery by turning tedious tasks into automated workflows, allowing researchers to focus on creativity and crit...2025-01-1116 minAI OdysseyAI OdysseyCan AI Agents Survive the Real World? A Deep Dive into TheAgentCompany BenchmarkIn this episode, we explore TheAgentCompany, a comprehensive benchmark designed to evaluate large language model (LLM) agents in performing realistic professional tasks. The benchmark simulates a digital workplace, featuring tasks in software engineering, project management, HR, and finance. Remarkably, even the best AI agent autonomously completes only 24% of tasks, highlighting significant gaps in AI capabilities for workplace automation. Tune in as we discuss the implications for industries, workforce automation, and AI policy, and how benchmarks like these drive AI innovation. Content creation powered by Google's NotebookLM. Link to the full research paper : https://arxiv.org/pdf/2412.141612025-01-0511 minAI OdysseyAI OdysseyHas OpenAI Built AI That Thinks Like Humans?Could OpenAI’s o3 model be the breakthrough that changes everything? In this episode of IA Odyssey, we delve into how o3 shattered records on the ARC-AGI test—a benchmark designed to measure an AI’s ability to think and solve problems like a human. Previously considered nearly impossible for AI systems, the ARC-AGI test challenges models to adapt to entirely new tasks without prior training, mimicking human reasoning. We unpack what this means for the future of artificial intelligence: are we on the brink of human-level AI, or is there still a long road ahead? Tune in for a thril...2024-12-2211 minAI OdysseyAI OdysseyAI Everywhere: Decoding Satya Nadella's Vision for the Future - version 2Satya Nadella's keynote at Microsoft Ignite 2024 wasn't just a glimpse into the future—it was a rocket launch. In this episode, we dissect his bold predictions, including AI's warp-speed growth, the rise of multimodal interfaces, reasoning capabilities, and game-changing tool use. Nadella compares AI's transformation to pivotal moments in tech history, like the dawn of Windows and the shift to the cloud. What does that mean for you, your work, and daily life? We break it down, jargon-free. We also explore Microsoft's Copilot ecosystem, AI-powered PCs, and the exciting (and slightly mind-melting) potential of quantum computing. Nadella's fo...2024-12-1507 minAI OdysseyAI OdysseyAI Everywhere: Decoding Satya Nadella's Vision for the Future - version 1Satya Nadella's keynote at Microsoft Ignite 2024 wasn't just a glimpse into the future—it was a rocket launch. In this episode, we dissect his bold predictions, including AI's warp-speed growth, the rise of multimodal interfaces, reasoning capabilities, and game-changing tool use. Nadella compares AI's transformation to pivotal moments in tech history, like the dawn of Windows and the shift to the cloud. What does that mean for you, your work, and daily life? We break it down, jargon-free. We also explore Microsoft's Copilot ecosystem, AI-powered PCs, and the exciting (and slightly mind-melting) potential of quantum computing. Nadella's fo...2024-12-1522 minAI OdysseyAI OdysseyCan AI Take on Wall Street’s Finest?What happens when cutting-edge AI goes head-to-head with Wall Street’s top analysts? Enter FinRobot, a revolutionary AI agent designed to redefine equity research. Combining real-time data, financial modeling, and human-like judgment, FinRobot creates investment reports that rival the elite of sell-side firms. In this episode, we uncover how this open-source innovation from the AI4Finance Foundation uses multi-agent reasoning to tackle the complexities of financial markets. Could this be the start of a new era in finance, where algorithms take the lead? Link to the original paper: https://arxiv.org/abs/2411.088042024-11-3011 minAI OdysseyAI OdysseyInfinite Context: Unlocking Transformers for Boundless UnderstandingDiscover how researchers are redefining transformer models with "Infini-attention," an innovative approach that introduces compressive memory to handle infinitely long sequences without overwhelming computational resources. This episode delves into how this breakthrough enables efficient long-context modeling, solving tasks like book summarization with unprecedented input lengths and accuracy. Learn how Infini-attention bridges local and global memory while scaling transformer capabilities beyond limits, transforming the landscape of AI memory systems. Dive deeper with the original paper here:  https://arxiv.org/abs/2404.07143 Crafted using insights powered by Google's NotebookLM.2024-11-2309 minAI OdysseyAI OdysseyEvaluating AI Assistants: How Models Judge Each OtherIn this episode, we dive into the cutting-edge techniques used to evaluate large language model (LLM)-based chat assistants, as detailed in the paper “Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena.” The researchers explore innovative benchmarks—MT-Bench for multi-turn dialogue analysis and Chatbot Arena for crowdsourced assessments. Learn how AI models like GPT-4 are being leveraged as impartial judges to measure chatbot performance, overcoming traditional evaluation limitations. Discover the challenges, biases, and future potential of using AI to approximate human preferences. Explore the full study at https://arxiv.org/abs/2306.05685 This summary was crafted using insight...2024-11-1713 minAI OdysseyAI OdysseySimulating Societies: AI Agents Learning to Build CivilizationsIn this episode of IA Odyssey, we explore an innovative study that pushes the boundaries of AI by simulating complex societies within the Minecraft universe. Researchers have used a new architecture, PIANO (Parallel Information Aggregation via Neural Orchestration), to allow AI agents to self-organize, develop specialized roles, and follow collective rules in large-scale social structures. These agents demonstrate autonomous decision-making, cultural exchange, and even community governance, resembling the dynamics of real human civilizations. With these advancements, the research opens new discussions on integrating AI into social environments. This episode, made possible with the support of Google NotebookLM, takes a...2024-11-1121 minAI OdysseyAI OdysseyMastering Prompt Engineering: From Basics to Advanced TechniquesJoin us as we delve into the transformative realm of prompt engineering, a crucial aspect of enhancing the potential of large language models (LLMs). This episode explores foundational concepts, such as simple question prompts, and advances to techniques like Chain-of-Thought and Tree-of-Thought prompting. We’ll also discuss the limitations of LLMs, such as their tendency to fabricate information and lack of real-time updates, while showcasing strategies to mitigate these issues. Whether you're a beginner or looking to refine your AI expertise, this episode covers how prompt design shapes the output of models like GPT-4, and the sophisticated tools and framewo...2024-11-0315 minAI OdysseyAI OdysseyWhen Machines Self-Improve: Inside the Self-Challenging AIWhat if we could make AI smarter simply by creating new data for it to learn from? In this episode, we dive into a groundbreaking study by researchers at Beihang University, exploring how synthetic data—computer-generated text and examples—could be the key to training next-gen AI language models. As the demand for these models grows, real-world data just isn’t enough. This study reveals how techniques like data synthesis and augmentation can not only improve how AI models understand language but also extend their usefulness in everyday applications.We break down the main ideas, the surprising benefi...2024-10-2524 minAI OdysseyAI OdysseyThe Future of Real-Time Conversational AIJoin us as we dive into the cutting-edge world of real-time conversational AI with Moshi—a speech-to-speech foundation model that reimagines what dialogue systems can do. Forget the clunky delays and robotic responses of old: Moshi, introduced by Alexandre Défossez from Kyutai, represents the next frontier with its seamless, overlapping interactions and emotion-aware conversation flow. Curious about how Moshi achieves near-human-like latency and full-duplex communication? Tune in to explore the innovations behind Moshi, and what it means for the future of AI assistants. Learn more in the original research paper https://arxiv.org/pdf/2410.000372024-10-1910 minAI OdysseyAI OdysseySelf-Learning AI Agents: Breaking New Ground in AutomationIn this episode, we explore Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents, an inspiring research paper from Tsinghua University. This groundbreaking study presents a virtual hospital where AI-powered agents, acting as doctors, nurses, and patients, simulate the entire medical process. What's truly remarkable is that these intelligent agents not only manage the hospital's daily operations but also learn and improve their performance over time through continuous interaction with simulated cases. This work is a major step forward for AI, revealing unprecedented possibilities for automating complex tasks in healthcare and beyond. Source: Li, J., Wang, S...2024-10-1108 minAI OdysseyAI OdysseyAI for Everyone: How Small Language Models Are Changing the GameWelcome to AI Odyssey! In today's episode, we delve into "Small Language Models: Survey, Measurements, and Insights" by Zhenyan Lu, Xiang Li, Dongqi Cai, and their team from Beijing University of Posts and Telecommunications, Cambridge University, and more. We'll explore the rise of small language models (SLMs) and how they are reshaping AI accessibility on everyday devices. For more insights, access the full paper https://arxiv.org/abs/2409.090302024-10-0315 minAI OdysseyAI OdysseyHow "Thinking Out Loud" Makes AI SmarterIn this episode, we break down a fascinating new approach that helps AI models think more like humans. Researchers Zhiyuan Li, Hong Liu, Denny Zhou, and Tengyu Ma have discovered that by guiding AI to think step-by-step — a process they call "Chain-of-Thought" (CoT) — it can tackle much tougher tasks like solving puzzles, doing math, and making complex decisions. We’ll explain how this method works and why it could be a game-changer for AI. If you’re curious about how AI can learn to think better, this episode is for you! Original Paper:"Chain of...2024-09-2907 minAI OdysseyAI OdysseyRAG Revolution: How External Data is Supercharging AIIn the premiere episode of AI Odyssey, we tackle one of the most pressing challenges in artificial intelligence: how can we make large language models smarter and more reliable? Join us as we explore the groundbreaking paper "Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make Your LLMs Use External Data More Wisely", authored by Siyun Zhao, Yuqing Yang, Zilong Wang, Zhiyuan He, Luna K. Qiu, and Lili Qiu from Microsoft Research Asia. This episode, generated with Google's NotebookLM, uncovers how integrating external data can turn powerful AI into true domain experts, minimize hallucinations, and pu...2024-09-2611 min