podcast
details
.com
Print
Share
Look for any podcast host, guest or anyone
Search
Showing episodes and shows of
Daniel Herbera And Guillaume Fournier
Shows
AI Odyssey
From Agents to Teammates: Building Cohesive AI Squads
Meet the Aime framework—ByteDance’s fresh take on multi-agent systems that lets AI teammates think on their feet instead of following brittle, pre-planned scripts. A dynamic planner keeps adjusting the big picture, an Actor Factory spins up just-right specialist agents on demand, and a shared progress board keeps everyone in sync. In tests ranging from general reasoning (GAIA) to software bug-fixing (SWE-Bench) and live web navigation (WebVoyager), Aime consistently out-performed hand-tuned rivals—showing that flexible, reactive collaboration beats static role-play every time.This episode of IA Odyssey unpacks how Yexuan Shi and colleagues replace rigid “plan-and-execute” pipelines wi...
2025-07-19
15 min
AI Odyssey
When Machines Self-Improve: Inside the Self-Challenging AI
In this episode of IA Odyssey, we explore a bold new approach in training intelligent AI agents: letting them invent their own problems.We dive into “Self-Challenging Language Model Agents” by Yifei Zhou, Sergey Levine (UC Berkeley), Jason Weston, Xian Li, and Sainbayar Sukhbaatar (FAIR at Meta), which introduces a powerful framework called Self-Challenging Agents (SCA). Rather than relying on human-labeled tasks, this method enables AI agents to generate their own training tasks, assess their quality using executable code, and learn through reinforcement learning — all without external supervision.Using the novel Code-as-Task format, agents first act as "cha...
2025-07-16
13 min
AI Odyssey
Beyond Code: Navigating the AI Software Revolution with Andrej Karpathy
We're witnessing one of the most profound shifts in the history of software—a rapid evolution from traditional coding (Software 1.0) to neural networks (Software 2.0) and now, the dawn of Software 3.0: large language models (LLMs) programmable with simple English. Inspired by insights from Andrej Karpathy, former AI Director at Tesla, we explore how this paradigm shift reshapes the very concept of programming and its profound implications for everyone engaging with technology.From the "Iron Man" analogy, where AI augments human capabilities rather than replacing them, to the fascinating vision of LLMs as new operating systems, this episode dives de...
2025-07-05
16 min
AI Odyssey
Unlocking the Secrets: How Much Do Language Models Memorize?
Ever wondered how much information your favorite AI language models, like GPT, actually retain from their training data? In this episode of AI Odyssey, we delve into groundbreaking research by John X. Morris, Chawin Sitawarin, Chuan Guo, Narine Kokhlikyan, G. Edward Suh, Alexander M. Rush, Kamalika Chaudhuri, and Saeed Mahloujifar. The authors introduce a new method for quantifying memorization in AI, distinguishing between unintended memorization (dataset-specific information) and generalization (knowledge of underlying data patterns). With findings revealing that models like GPT have a surprising capacity of about 3.6 bits per parameter, this study explores how memorization plateaus and eventually gives...
2025-06-29
18 min
AI Odyssey
Simulating UX with AI: Introducing UXAgent
What if you could simulate a full-scale usability test—before involving a single human user? In this episode, we explore UXAgent, a groundbreaking system developed by researchers from Northeastern University, Amazon, and the University of Notre Dame. This tool leverages Large Language Models (LLMs) to create persona-driven agents that simulate real user interactions on web interfaces.UXAgent's innovative architecture mimics both fast, intuitive decisions and deeper, reflective reasoning—bringing realistic and diverse user behavior into early-stage UX testing. The system enables rapid iteration of study designs, helps identify potential flaws, and even allows interviews with simulated users....
2025-06-21
17 min
AI Odyssey
AI Agents Are Old News—Meet the Rise of Agentic AI
What if your AI didn't just follow instructions… but coordinated a whole team to solve complex problems on its own?In this episode, we dive into the fascinating shift from traditional AI Agents to a bold new paradigm: Agentic AI. Based on the eye-opening paper “AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges”, we unpack why single-task bots like AutoGPT are already being outpaced by swarms of intelligent agents that collaborate, strategize, and adapt—almost like digital organizations.Discover how these systems are transforming research, medicine, robotics, and cybersecurity, and why Google’s new A2A pr...
2025-06-14
16 min
AI Odyssey
The Illusion of Thinking: When More Reasoning Doesn’t Mean Better Reasoning
In this episode, we explore “The Illusion of Thinking”, a thought-provoking study from Apple researchers that dives into the true capabilities—and surprising limits—of Large Reasoning Models (LRMs). Despite being designed to "think harder," these advanced AI models often fall short when problem complexity increases, failing to generalize reasoning and even reducing effort just when it’s most needed.Using controlled puzzle environments, the authors reveal a curious three-phase behavior: standard language models outperform LRMs on simple tasks, LRMs shine on moderately complex ones, but both collapse entirely under high complexity. Even with access to explicit algorithms...
2025-06-09
16 min
AI Odyssey
Smarter Prompts, Faster Results: The Power of Local Prompt Optimization
Prompting AI just got smarter. In this episode, we dive into Local Prompt Optimization (LPO) — a breakthrough approach that turbocharges prompt engineering by focusing edits on just the right words. Developed by Yash Jain and Vishal Chowdhary from Microsoft, LPO refines prompts with surgical precision, dramatically improving accuracy and speed across reasoning benchmarks like GSM8k, MultiArith, and BIG-bench Hard.Forget rewriting entire prompts. LPO reduces the optimization space, speeding up convergence and enhancing performance — even in complex production environments. We explore how this technique integrates seamlessly into existing prompt optimization methods like APE, APO, and PE2, and how...
2025-05-31
12 min
AI Odyssey
Back to Basics: Understanding AI, From Buzzwords to Reality
AI is everywhere—but what is it, really? In this episode, we cut through the noise to explore the fundamentals of artificial intelligence, from narrow AI and reactive systems to generative models, AI agents, and the emerging frontier of agentic AI. Using insights from expert sources, articles, and research papers, we break down key concepts in simple, accessible terms.You'll learn how tools like ChatGPT work under the hood, why generative AI felt like such a leap, and what it actually means for an AI to be an agent—or part of a multi-agent system. We explore the re...
2025-05-24
19 min
AI Odyssey
From Nothing to Genius: How AI Learns Without Data
What if an AI could become smarter without being taught anything? In this episode, we dive into Absolute Zero, a groundbreaking framework where an AI model trains itself to reason—without any curated data, labeled examples, or human guidance. Developed by researchers from Tsinghua, BIGAI, and Penn State, this radical approach replaces traditional training with a bold form of self-play, where the model invents its own tasks and learns by solving them.The result? Absolute Zero Reasoner (AZR) surpasses existing models that depend on tens of thousands of human-labeled examples, achieving state-of-the-art performance in math and code reaso...
2025-05-19
17 min
AI Odyssey
Unifying the AI Agent Internet: How Protocols Can Unlock Collective Intelligence
What if AI agents could collaborate as seamlessly as devices do over the Internet? In this episode, we dive into "A Survey of AI Agent Protocols" by Yingxuan Yang and colleagues from Shanghai Jiao Tong University, a landmark paper that tackles the missing piece in today’s intelligent agent landscape: standardized communication protocols. As large language model (LLM) agents spread across industries—from customer service to healthcare—they still operate in silos, struggling to integrate with tools or with one another. This paper proposes a two-dimensional classification of agent protocols and explores a future where agents form coalitions, speak common l...
2025-05-11
23 min
AI Odyssey
AI Meets Art: The Creative Revolution Unfolding
What happens when generative AI collides with human creativity? In this episode, we dive into the extraordinary transformation sweeping across visual arts, music, film, and writing—powered by tools like DALL·E, Midjourney, Suno, and ChatGPT. From text-to-image magic and AI-composed music to VFX breakthroughs and story co-writing, we explore how these innovations are democratizing access, supercharging workflows, and sparking heated debates over ethics, copyright, and what it means to be an artist. Drawing on a wide range of sources—made accessible with help from Google’s NotebookLM—we unpack how individuals and industries are adapting, and what the future of...
2025-05-04
13 min
AI Odyssey
How Real Companies Are Winning with AI
In this episode of IA Odyssey, we go beyond the AI hype and into the trenches with real-world business stories from OpenAI’s “AI in the Enterprise” guide. From Morgan Stanley's precision evals to Klarna's rapid-fire customer service, and BBVA’s bottom-up innovation strategy, we explore seven powerful lessons that show how companies are embedding AI into their workflows—not just for efficiency, but for transformation. You’ll hear how organizations are improving personalization, accelerating operations, and unlocking their teams’ potential.Whether you're curious, cautious, or already deploying AI, this deep dive offers insights you can actual...
2025-04-27
16 min
AI Odyssey
How Netflix Knows What You’ll Watch Before You Do
In this episode, we unpack how Netflix is using cutting-edge AI—similar to the tech behind ChatGPT—to power hyper-personalized recommendations. Discover how their new foundation model moves beyond traditional algorithms, blending massive data with NLP-inspired strategies like interaction tokenization and multi-token prediction. We also explore how this personalization revolution is reshaping customer expectations across industries, drawing on insights from marketing leaders like Qualtrics, Epsilon France, and Doozy Publicity. But with great AI power comes big questions: What about privacy, ethics, and the joy of unexpected discovery?Based on original sources and developed with the help of Goog...
2025-04-20
11 min
AI Odyssey
The AI That Remembers: How Memory Is Powering the Next Leap in Intelligence
What happens when AI stops forgetting? In this episode of IA Odyssey, we dive deep into OpenAI's rollout of memory in ChatGPT—and why it’s so much more than a feature toggle. From personalized ad agents to AI doctors learning on the job, we explore how memory transforms artificial intelligence into agentic AI: systems that adapt, personalize, and evolve. Drawing from cutting-edge research like KARMA, MeAgent Zero, and cognitive architecture frameworks, we unpack how memory lets AI learn from experience, get more accurate, and even form something close to relationships.
2025-04-12
20 min
AI Odyssey
Why AI Teams Fall Apart: Cracking the Code of Multi-Agent Failures
What happens when you put multiple AI agents together to solve a task? You might expect teamwork—but more often, you get chaos. In this episode of IA Odyssey, we dive into a groundbreaking study from UC Berkeley and Intesa Sanpaolo that reveals why multi-agent systems built on large language models are failing—spectacularly.The researchers examined over 150 real MAS conversations and uncovered 14 unique ways these systems break down—whether it’s agents ignoring each other, forgetting their roles, or ending tasks too early. They created MASFT, the first taxonomy to map these failures, and tested whether better pr...
2025-04-05
16 min
AI Odyssey
How DeepSeek Is Beating OpenAI at Their Own Game—On a Budget
In this episode of IA Odyssey, we unpack how DeepSeek's open-source models are shaking up the AI world—matching GPT-level performance at a fraction of the cost. Drawing on insights from the research paper by Chengen Wang (University of Texas at Dallas) and Murat Kantarcioglu (Virginia Tech), we explore DeepSeek's secret sauce: memory-efficient Multi-Head Latent Attention, an evolved Mixture of Experts architecture, and reinforcement learning without supervised data. Oh, and did we mention they trained this monster on a $ave-the-GPU budget?From hardware-aware model design to the surprisingly powerful GRPO algorithm, this episode decodes the magic that’s mak...
2025-03-29
16 min
AI Odyssey
The Rise of AI Agents: Could They Transform the Future of Work?
AI agents are revolutionizing automation—but not in the way you might think. These intelligent systems don’t just follow commands; they learn, adapt, and make decisions, reshaping industries from finance to healthcare. In this episode, we break down what makes AI agents different from traditional software, explore their growing role in our work, and dive into the game-changing potential of multi-agent systems. Are we witnessing the dawn of a new AI-powered workforce? Tune in to find out!
2025-03-18
09 min
AI Odyssey
AI vs. Wall Street – The Rise of Multi-Agent Trading
How can AI revolutionize financial trading? The TradingAgents framework introduces a multi-agent system where AI-powered analysts, researchers, and traders collaborate to make more informed investment decisions. Inspired by real-world trading firms, this innovative approach leverages specialized agents—fundamental analysts, sentiment analysts, technical analysts, and traders with diverse risk profiles—to optimize trading strategies.Unlike traditional models, TradingAgents enhances explainability, risk management, and market adaptability through agentic debates and structured decision-making. Extensive backtesting reveals significant performance improvements over standard trading strategies.Discover the future of AI-driven finance and explore the full research paper here: https://arxiv.org/abs...
2025-03-15
10 min
AI Odyssey
Agentic AI in Finance: Smarter Models, Safer Decisions
Can AI-powered teams replace traditional financial modeling workflows? This episode explores how agentic AI systems—where multiple specialized AI agents work together—are transforming financial services. Based on recent research, we break down how these AI "crews" tackle complex tasks like credit risk modeling, fraud detection, and regulatory compliance.We dive into the structure of these AI-driven teams, from model selection and hyperparameter tuning to risk assessment and bias detection. How do they compare to human-led processes? What challenges remain in ensuring fairness, transparency, and robustness in financial AI applications? Join us as we unpack the future of a...
2025-03-08
15 min
AI Odyssey
The Future of Prompting: Can AI Optimize Its Own Instructions?
Crafting the perfect prompt for large language models (LLMs) is an art—but what if AI could master it for us? This episode explores Automatic Prompt Optimization (APO), a rapidly evolving field that seeks to automate and enhance how we interact with AI. Based on a comprehensive survey, we dive into the key APO techniques, their ability to refine prompts without direct model access, and the potential for AI to fine-tune its own instructions. Could this be the key to unlocking even more powerful AI capabilities? Join us as we break down the latest research, challenges, and the future of...
2025-03-02
17 min
AI Odyssey
The AI That Reads and Remembers - Cracking the Memory Problem
One of AI’s biggest weaknesses? Memory. Today’s language models struggle with long documents, quickly losing track of crucial details. That’s a major limitation for businesses relying on AI for legal analysis, research synthesis, or strategic decision-making.Enter ReadAgent, a new system from Google DeepMind that expands an AI’s effective memory up to 20x. Inspired by how humans read, it builds a "gist memory"—capturing the essence of long texts while knowing when to retrieve key details. The result?🔹 AI that understands full reports, contracts, or meeting notes—without missing context.🔹 Smarter automation and ass...
2025-02-22
12 min
AI Odyssey
Is Learning to Code Still Worth It? AI Can Now Reason Like a Human
If AI can now outthink top programmers in competitive coding, what else can it master? OpenAI’s latest models don’t just generate code—they reason through complex problems, surpassing humans without handcrafted strategies. This breakthrough suggests AI could soon tackle fields beyond coding, from mathematics to scientific discovery. But if machines become expert problem-solvers, where does that leave us? Are we entering an era of AI-human collaboration, or are we gradually outsourcing intelligence itself? Let’s explore the future of AI reasoning—and what it means for humanity.Read the full paper here: https://arxiv.org/abs/2502.0
2025-02-17
17 min
AI Odyssey
AI is Taking Over Code Migration—Are Developers Ready?
What if AI could handle the most tedious and complex code migrations—faster and more accurately than ever before? Big tech is already making it happen, using Large Language Models (LLMs) to automate software upgrades, refactor legacy code, and eliminate years of technical debt in record time. But what does this mean for developers, companies, and the future of software engineering? In this episode, we dive into groundbreaking AI-driven code migrations, uncover surprising results, and explore how these innovations could change the way we build and maintain code forever.🔗 Full research paper: https://arxiv.org/abs/2501.06972
2025-02-09
11 min
AI Odyssey
AI Wars: OpenAI vs. DeepSeek, US vs. China
The AI arms race is heating up! OpenAI and DeepSeek are at odds over model training, NVIDIA’s stock takes a hit, and the battle for AI supremacy is reshaping global politics. In this episode, we break down OpenAI’s latest model, O3 Mini, and its surprising flaws, the ethical dilemmas surrounding AI development, and the future of jobs in a world where AI can code. Is AI a powerful ally or a looming threat? Tune in as we explore the rapid evolution of AI and what it all means for you.
2025-02-01
12 min
AI Odyssey
Smarter AI Starts Here: How Agentic RAG Changes Everything
This episode dives into the cutting-edge world of Agentic Retrieval-Augmented Generation (RAG), a transformative AI paradigm that integrates autonomous agents into retrieval and generation workflows. Drawing on a comprehensive survey, we explore how Agentic RAG enhances real-time adaptability, multi-step reasoning, and contextual understanding. From applications in healthcare to personalized education and financial analytics, discover how this innovation addresses the limitations of static AI systems while paving the way for smarter, more dynamic solutions. Thanks to the authors for their pioneering insights into this groundbreaking technology. Explore the original paper here: https://arxiv.org/pdf/2501.09136
2025-01-25
14 min
AI Odyssey
Titans: AI Inspired by Human Memory
Explore how Titans, a revolutionary neural architecture, mimics the way humans remember and manage their memories. Developed by Google researchers, this groundbreaking framework combines short-term and long-term memory modules, drawing inspiration from how the brain processes and prioritizes information. With features like adaptive forgetting and memory persistence, Titans replicate the human ability to retain crucial details while discarding irrelevant data, making them ideal for tasks like language modeling, reasoning, and genomics. Discover how this human-inspired approach enables Titans to scale to massive context sizes while maintaining efficiency and accuracy—marking a leap forward in AI design. 📖...
2025-01-18
15 min
AI Odyssey
Automating Discovery: LLM-Powered Research Labs
In this episode, we explore "Agent Laboratory," an innovative framework leveraging large language models (LLMs) to act as research assistants. Developed by a team from AMD and Johns Hopkins University, this pipeline automates the research process—from literature review and experimentation to report writing—dramatically reducing time and costs. We'll discuss how the framework integrates human feedback, generates state-of-the-art machine learning solutions, and addresses challenges like result accuracy and evaluation biases. Tune in to learn how Agent Laboratory could reshape the future of scientific discovery by turning tedious tasks into automated workflows, allowing researchers to focus on creativity and crit...
2025-01-11
16 min
AI Odyssey
Can AI Agents Survive the Real World? A Deep Dive into TheAgentCompany Benchmark
In this episode, we explore TheAgentCompany, a comprehensive benchmark designed to evaluate large language model (LLM) agents in performing realistic professional tasks. The benchmark simulates a digital workplace, featuring tasks in software engineering, project management, HR, and finance. Remarkably, even the best AI agent autonomously completes only 24% of tasks, highlighting significant gaps in AI capabilities for workplace automation. Tune in as we discuss the implications for industries, workforce automation, and AI policy, and how benchmarks like these drive AI innovation. Content creation powered by Google's NotebookLM. Link to the full research paper : https://arxiv.org/pdf/2412.14161
2025-01-05
11 min
AI Odyssey
Has OpenAI Built AI That Thinks Like Humans?
Could OpenAI’s o3 model be the breakthrough that changes everything? In this episode of IA Odyssey, we delve into how o3 shattered records on the ARC-AGI test—a benchmark designed to measure an AI’s ability to think and solve problems like a human. Previously considered nearly impossible for AI systems, the ARC-AGI test challenges models to adapt to entirely new tasks without prior training, mimicking human reasoning. We unpack what this means for the future of artificial intelligence: are we on the brink of human-level AI, or is there still a long road ahead? Tune in for a thril...
2024-12-22
11 min
AI Odyssey
AI Everywhere: Decoding Satya Nadella's Vision for the Future - version 2
Satya Nadella's keynote at Microsoft Ignite 2024 wasn't just a glimpse into the future—it was a rocket launch. In this episode, we dissect his bold predictions, including AI's warp-speed growth, the rise of multimodal interfaces, reasoning capabilities, and game-changing tool use. Nadella compares AI's transformation to pivotal moments in tech history, like the dawn of Windows and the shift to the cloud. What does that mean for you, your work, and daily life? We break it down, jargon-free. We also explore Microsoft's Copilot ecosystem, AI-powered PCs, and the exciting (and slightly mind-melting) potential of quantum computing. Nadella's fo...
2024-12-15
07 min
AI Odyssey
AI Everywhere: Decoding Satya Nadella's Vision for the Future - version 1
Satya Nadella's keynote at Microsoft Ignite 2024 wasn't just a glimpse into the future—it was a rocket launch. In this episode, we dissect his bold predictions, including AI's warp-speed growth, the rise of multimodal interfaces, reasoning capabilities, and game-changing tool use. Nadella compares AI's transformation to pivotal moments in tech history, like the dawn of Windows and the shift to the cloud. What does that mean for you, your work, and daily life? We break it down, jargon-free. We also explore Microsoft's Copilot ecosystem, AI-powered PCs, and the exciting (and slightly mind-melting) potential of quantum computing. Nadella's fo...
2024-12-15
22 min
AI Odyssey
Can AI Take on Wall Street’s Finest?
What happens when cutting-edge AI goes head-to-head with Wall Street’s top analysts? Enter FinRobot, a revolutionary AI agent designed to redefine equity research. Combining real-time data, financial modeling, and human-like judgment, FinRobot creates investment reports that rival the elite of sell-side firms. In this episode, we uncover how this open-source innovation from the AI4Finance Foundation uses multi-agent reasoning to tackle the complexities of financial markets. Could this be the start of a new era in finance, where algorithms take the lead? Link to the original paper: https://arxiv.org/abs/2411.08804
2024-11-30
11 min
AI Odyssey
Infinite Context: Unlocking Transformers for Boundless Understanding
Discover how researchers are redefining transformer models with "Infini-attention," an innovative approach that introduces compressive memory to handle infinitely long sequences without overwhelming computational resources. This episode delves into how this breakthrough enables efficient long-context modeling, solving tasks like book summarization with unprecedented input lengths and accuracy. Learn how Infini-attention bridges local and global memory while scaling transformer capabilities beyond limits, transforming the landscape of AI memory systems. Dive deeper with the original paper here: https://arxiv.org/abs/2404.07143 Crafted using insights powered by Google's NotebookLM.
2024-11-23
09 min
AI Odyssey
Evaluating AI Assistants: How Models Judge Each Other
In this episode, we dive into the cutting-edge techniques used to evaluate large language model (LLM)-based chat assistants, as detailed in the paper “Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena.” The researchers explore innovative benchmarks—MT-Bench for multi-turn dialogue analysis and Chatbot Arena for crowdsourced assessments. Learn how AI models like GPT-4 are being leveraged as impartial judges to measure chatbot performance, overcoming traditional evaluation limitations. Discover the challenges, biases, and future potential of using AI to approximate human preferences. Explore the full study at https://arxiv.org/abs/2306.05685 This summary was crafted using insight...
2024-11-17
13 min
AI Odyssey
Simulating Societies: AI Agents Learning to Build Civilizations
In this episode of IA Odyssey, we explore an innovative study that pushes the boundaries of AI by simulating complex societies within the Minecraft universe. Researchers have used a new architecture, PIANO (Parallel Information Aggregation via Neural Orchestration), to allow AI agents to self-organize, develop specialized roles, and follow collective rules in large-scale social structures. These agents demonstrate autonomous decision-making, cultural exchange, and even community governance, resembling the dynamics of real human civilizations. With these advancements, the research opens new discussions on integrating AI into social environments. This episode, made possible with the support of Google NotebookLM, takes a...
2024-11-11
21 min
AI Odyssey
Mastering Prompt Engineering: From Basics to Advanced Techniques
Join us as we delve into the transformative realm of prompt engineering, a crucial aspect of enhancing the potential of large language models (LLMs). This episode explores foundational concepts, such as simple question prompts, and advances to techniques like Chain-of-Thought and Tree-of-Thought prompting. We’ll also discuss the limitations of LLMs, such as their tendency to fabricate information and lack of real-time updates, while showcasing strategies to mitigate these issues. Whether you're a beginner or looking to refine your AI expertise, this episode covers how prompt design shapes the output of models like GPT-4, and the sophisticated tools and framewo...
2024-11-03
15 min
AI Odyssey
When Machines Self-Improve: Inside the Self-Challenging AI
What if we could make AI smarter simply by creating new data for it to learn from? In this episode, we dive into a groundbreaking study by researchers at Beihang University, exploring how synthetic data—computer-generated text and examples—could be the key to training next-gen AI language models. As the demand for these models grows, real-world data just isn’t enough. This study reveals how techniques like data synthesis and augmentation can not only improve how AI models understand language but also extend their usefulness in everyday applications.We break down the main ideas, the surprising benefi...
2024-10-25
24 min
AI Odyssey
The Future of Real-Time Conversational AI
Join us as we dive into the cutting-edge world of real-time conversational AI with Moshi—a speech-to-speech foundation model that reimagines what dialogue systems can do. Forget the clunky delays and robotic responses of old: Moshi, introduced by Alexandre Défossez from Kyutai, represents the next frontier with its seamless, overlapping interactions and emotion-aware conversation flow. Curious about how Moshi achieves near-human-like latency and full-duplex communication? Tune in to explore the innovations behind Moshi, and what it means for the future of AI assistants. Learn more in the original research paper https://arxiv.org/pdf/2410.00037
2024-10-19
10 min
AI Odyssey
Self-Learning AI Agents: Breaking New Ground in Automation
In this episode, we explore Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents, an inspiring research paper from Tsinghua University. This groundbreaking study presents a virtual hospital where AI-powered agents, acting as doctors, nurses, and patients, simulate the entire medical process. What's truly remarkable is that these intelligent agents not only manage the hospital's daily operations but also learn and improve their performance over time through continuous interaction with simulated cases. This work is a major step forward for AI, revealing unprecedented possibilities for automating complex tasks in healthcare and beyond. Source: Li, J., Wang, S...
2024-10-11
08 min
AI Odyssey
AI for Everyone: How Small Language Models Are Changing the Game
Welcome to AI Odyssey! In today's episode, we delve into "Small Language Models: Survey, Measurements, and Insights" by Zhenyan Lu, Xiang Li, Dongqi Cai, and their team from Beijing University of Posts and Telecommunications, Cambridge University, and more. We'll explore the rise of small language models (SLMs) and how they are reshaping AI accessibility on everyday devices. For more insights, access the full paper https://arxiv.org/abs/2409.09030
2024-10-03
15 min
AI Odyssey
How "Thinking Out Loud" Makes AI Smarter
In this episode, we break down a fascinating new approach that helps AI models think more like humans. Researchers Zhiyuan Li, Hong Liu, Denny Zhou, and Tengyu Ma have discovered that by guiding AI to think step-by-step — a process they call "Chain-of-Thought" (CoT) — it can tackle much tougher tasks like solving puzzles, doing math, and making complex decisions. We’ll explain how this method works and why it could be a game-changer for AI. If you’re curious about how AI can learn to think better, this episode is for you! Original Paper:"Chain of...
2024-09-29
07 min
AI Odyssey
RAG Revolution: How External Data is Supercharging AI
In the premiere episode of AI Odyssey, we tackle one of the most pressing challenges in artificial intelligence: how can we make large language models smarter and more reliable? Join us as we explore the groundbreaking paper "Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make Your LLMs Use External Data More Wisely", authored by Siyun Zhao, Yuqing Yang, Zilong Wang, Zhiyuan He, Luna K. Qiu, and Lili Qiu from Microsoft Research Asia. This episode, generated with Google's NotebookLM, uncovers how integrating external data can turn powerful AI into true domain experts, minimize hallucinations, and pu...
2024-09-26
11 min