Earkind - Podcast Details

Shows

GPT ReviewsChatGPT Browsing 🌐 // Meta's AI on Social Media 🤖 // VQ-VAE Made Simple 💡ChatGPT can now browse the internet to provide users with current information, but concerns about accuracy and reliability remain. Meta has introduced social profiles for its AIs, allowing users to interact with them directly on Instagram, Messenger, and WhatsApp. The paper "Finite Scalar Quantization: VQ-VAE Made Simple" proposes a simpler alternative to vector quantization in VAEs, which could be a game-changer for the field. "Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation" proposes a hybrid model that combines pixel-based and latent-based VDMs for more efficient and accurate text-to-video generation. Contact: sergi@earkind.com Ti...2023-09-2915 min

GPT ReviewsMeta's AI Everywhere 🤖 // Factually Augmented RLHF 🤔 // Vision-language models 🌅OpenAI CEO Sam Altman caused a stir by posting a comment that AGI has been achieved internally, only to edit the post later to dispel the news. Meta is bringing its AI assistant to WhatsApp, Messenger, and Instagram, and releasing dozens of AI characters based on celebrities like MrBeast and Charli D’Amelio. "Aligning Large Multimodal Models with Factually Augmented RLHF" proposes a new approach to address the issue of misalignment between modalities in large multimodal models, achieving a remarkable 94% performance level on the LLaVA-Bench dataset. "InternLM-XComposer" is a vision-language large model that enables advanced image-text comprehension and composition, seaml...2023-09-2815 min

GPT ReviewsAI Girlfriends 🤖 // AI for Competitive Intelligence 💼 // Large Language Models & Causality 📊The rise of AI girlfriends and their impact on America's future population is a thought-provoking topic discussed in this episode. The use of AI for competitive intelligence and its advantages for companies is explored in-depth, with a focus on a new company called Prelaunch.com. The papers discussed in this episode shed light on important topics such as understanding large language models, training instabilities, and causality for machine learning. The insights provided by the experts on the show offer valuable perspectives on the implications of AI for various industries and society as a whole. Contact: sergi@earkind.co...2023-09-2714 min

GPT ReviewsChatGPT Can Now See, Hear, and Speak 🗣️ // Amazon Invests $4B in AI Chatbot Rival 🤑 // Spotify Clones Podcasters' Voices 🎙️OpenAI's ChatGPT-4 can now see, hear, and speak, making it more intuitive for daily use. Amazon is investing up to $4 billion in OpenAI rival Anthropic, which is developing a new AI chatbot called Claude 2. Spotify is partnering with OpenAI to clone podcasters' voices and translate their shows into other languages. Lastly, Text2Reward is a new framework that automates the generation of dense reward functions for reinforcement learning. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:32 ChatGPT can now see, hear, and speak: Announcing GPT-4 multimodal 03:16 Amazon to invest up to $4bn i...2023-09-2614 min

GPT ReviewsMeta's Sassy Robot 🤖 // Google's Bard Extensions 📧 // MetaMath for Math Reasoning 🔢Meta's plan to release a "sassy robot" for younger users and concerns about potential implications of these chatbots. Google's new Bard Extensions feature and its performance in retrieving emails and drafting responses. Research on language models with Claude's long context window and the need for models that can effectively use information from the middle of long contexts. The development of MetaMath, a fine-tuned language model that specializes in mathematical reasoning, and its superior performance on mathematical reasoning benchmarks. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:33 Meta’s AI chatbot plan includes a ‘sassy r...2023-09-2514 min

GPT ReviewsOpenAI Wins Lawsuit 🧑‍⚖️ // Human vs. Model Carbon Footprint 🌿 // Chain-of-Verification ⛓️OpenAI's privacy lawsuit has been dismissed, but the company is still facing legal controversies. YouTube is introducing new AI-powered tools for creators, including AI-generated backgrounds and personalized music recommendations. We also discuss the potential impact of open source AI on language and image models, and a new study that shows AI systems have a much lower carbon footprint than humans when it comes to tasks like writing and illustrating. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:25 A lawsuit alleging privacy violations by OpenAI was dismissed 03:01 YouTube is going all in on A...2023-09-2214 min

GPT ReviewsDALL-E 3 meets ChatGPT 🖼️ // Google's BART Extensions for Workplace 📈 // Neuralink's Clinical Trial 🧠OpenAI's latest release of DALL-E 3 integrates with ChatGPT, making it more accessible for people who struggle with prompts. Google's BART Extensions for the Workplace uses a reinforced learning model to pull relevant information from all their Google tools and services, potentially setting it apart from its competitors. Neuralink has opened recruitment for their first-in-human clinical trial for their brain-computer interface, which aims to help those with paralysis control external devices with their thoughts. The research papers discussed in this episode explore the compression capabilities of large language models, a new text generation method called Contrastive Decoding, and Compositional Foundation...2023-09-2114 min

GPT ReviewsMicrosoft's Data Scandal 💻 // UK's AI Principles for Transparency 🇬🇧 // Efficient Language Models 🚀Microsoft's AI research team accidentally exposed 38 terabytes of private data while publishing open-source training data on GitHub, posing a significant security risk. The UK's new AI principles focus on accountability and transparency, seeking views from leading AI developers and governments to ensure the development and use of foundation models evolves in a way that promotes competition and protects consumers. Two new papers explore ways to improve the efficiency and quality of large language models, including a new inference scheme called self-speculative decoding and the ability to prune pretraining data while still retaining performance. A third paper introduces a new...2023-09-2013 min

GPT ReviewsAmazon's Kindle Limits 📖 // Interactive AI Future 🤖 // Scaling Sparsely-Connected Models 🔍Amazon is limiting new Kindle books due to the rapid evolution of generative AI, which has flooded the market with low-quality content. DeepMind's co-founder believes that interactive AI is the future, which can carry out tasks by calling on other software and people to get things done. "Compositional Foundation Models for Hierarchical Planning" proposes a solution for effective decision-making in novel environments with long-horizon goals. "Scaling Laws for Sparsely-Connected Foundation Models" explores the impact of parameter sparsity on the scaling behavior of transformers trained on massive datasets, which can lead to more efficient and scalable models in the future.2023-09-1915 min

GPT ReviewsGoogle's Gemini vs. OpenAI's GPT-4 🤖 // AI-generated medieval village 🏰 // Autonomous Language Agents 🗣️Google is preparing to release their latest AI software Gemini, which aims to compete with OpenAI's GPT-4 model, and can be used for everything from chatbots to generating original text, music lyrics, and news stories. "Generative Image Dynamics" is a paper from Google Research that focuses on creating a model for scene dynamics in images, which can be used to turn still images into seamlessly looping dynamic videos or allow users to realistically interact with objects in real pictures. "Agents: An Open-source Framework for Autonomous Language Agents" is an open-source library that makes it easier for non-specialists to build...2023-09-1814 min

GPT ReviewsAI x Amazon 🛍️ // OpenAI in Dublin 🌍 // LLMs for Compiler Optimization 💻Stable Audio, a new text-to-audio generative AI platform, uses a diffusion model trained with audio to create background music for podcasts or videos. Amazon has launched a new generative AI tool to help sellers write better product descriptions, which dramatically improves the listing creation and management experience for sellers. OpenAI is opening an office in Dublin, Ireland, to collaborate with the government and industry to advance AI development and deployment. Three papers were discussed, including the use of Large Language Models (LLMs) to optimize code, MagiCapture's personalization method for generating high-resolution portrait images, and Statistical Rejection Sampling Optimization (RSO) to...2023-09-1514 min

GPT ReviewsApple's Hidden AI 🍎 // Musk's Federal AI Dept. Proposal 🏛️ // Efficient MoE & Memory Management 🚀Apple's use of AI in its new devices, including a new chip that includes improved data crunching capabilities and a four-core "Neural Engine" that can process machine learning tasks up to twice as quickly. Elon Musk's proposal for a federal department of AI after his Capitol Hill summit, citing the potential harm of unchecked AI development. Cutting-edge research on efficient memory management, mesa-optimization algorithms, and extremely parameter-efficient MoE. The proposed solutions to challenges in serving large language models efficiently, including PagedAttention and vLLM. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:41 AI quietly res...2023-09-1414 min

GPT ReviewsTextbooks Are All You Need 📚 // Coca-Cola AI Flavor 🥤 // Robot Parkour 🦿NVIDIA is teaming up with leaders to discuss AI standards, Coca-Cola has released a new zero sugar flavor created with AI, Microsoft Research has introduced a new 1.3 billion parameter model named phi-1.5, and Google DeepMind and Google Research have introduced MADLAD-400, a manually audited, general domain 3T token monolingual dataset based on CommonCrawl, spanning 419 languages. Tune in to hear more about these exciting developments in the world of AI. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:57 NVIDIA Lends Support to Washington’s Efforts to Ensure AI Safety 03:30 Coca-Cola Drops a Zero S...2023-09-1314 min

GPT ReviewsResponsible AI at G20 🤝 // Meta’s Chatbot Model Goal 👥 // Explaining Grokking in Neural Networks 🧠G20's reaffirmation of responsible AI use, Meta's plans for a new chatbot model, Google DeepMind's explanation for grokking in neural networks, and a system for automatically generating high-quality audiobooks from online e-books. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:36 G20 nations reaffirm responsible use and development of AI technology 03:13 Meta sets GPT-4 as the bar for its next AI model, says a new report 04:48 Deep Neural Nets: 33 years ago and 33 years from now 06:29 Fake sponsor 08:25 Explaining grokking through circuit efficiency 10:02 Large-Scale Automat...2023-09-1214 min

GPT ReviewsPentagon's AI Fleet vs. China 🇺🇸 // OpenAI's Writing Detectors Fail ❌ // Emergent Abilities in Large Language Models 🤖From the Pentagon's plans for a vast AI fleet to counter the China threat, to OpenAI's confirmation that AI writing detectors just don't cut it, we cover it all. We also explore the emergent abilities in large language models and their potential to revolutionize optimization in various fields, as well as ImageBind-LLM, a new method for tuning large language models with multi-modality instructions. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:46 Pentagon Plans Vast AI Fleet to Counter China Threat 03:01 OpenAI confirms that AI writing detectors don’t work 04:46 Why is...2023-09-1113 min

GPT ReviewsTIME-100 AI List 🕰️ // Open-vocabulary Vision Models 👀 // Retrieval-Augmented Generation 📚The episode covers cutting-edge AI research on vision and language models, including a new pretraining methodology for open-vocabulary object detection and a physically grounded VLM for robotic manipulation tasks. The show also features two interesting papers on DSPy, a framework for working with language models and retrieval models, and Verba, an open-source initiative for retrieval-augmented generation applications. The crew discusses the TIME100 Most Influential People in AI, highlighting the significance of generative AI and the ethical questions surrounding its development. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:38 How We Chose the TIME100 Mos...2023-09-0815 min

GPT ReviewsOpenAI DevConference 🌟 // Zoom's AI Assistant 💬 // RNNs Implementing Attention 🤯OpenAI announces their first developer conference, Zoom debuts an AI assistant, and we explore the discovery that certain RNNs might be implementing attention under the hood. We also discuss Sequential Dexterity, a system that chains multiple dexterous policies for achieving long-horizon task goals. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:37 OpenAI to hold its first developer conference on November 6 in San Francisco 03:03 Zoom Debuts AI assistant 04:39 Jason Wei Tweets about alternative altmetrics 06:28 Fake sponsor 08:44 Gated recurrent neural networks discover attention 10:07 One Wide Feedfor...2023-09-0714 min

GPT ReviewsAI-Generated Sex Workers on TikTok 🫣 // Amazon One's Generative AI 🤲 // BatchPrompt 🥐Amazon One for convenient payment and verification, the concerning trend of AI-generated sex workers flooding social media platforms, OpenAI's legal battle over ChatGPT's alleged use of pirated books, and a promising solution to the challenge of catastrophic forgetting in Continual Learning with LGCL. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:33 How generative AI helped train Amazon One to recognize your palm 02:55 Ads for AI sex workers are flooding Instagram and TikTok 04:07 OpenAI disputes authors’ claims that every ChatGPT response is a derivative work 05:39 Fake sponsor 07:33...2023-09-0613 min

GPT ReviewsTwitter Will Train on Your Data 🏋️ // Teaching with AI 🧑‍🏫 // LLMs for Speech and Language 🗣️Twitter's updated privacy policy and how they plan to use public data to train their AI models. We also dive into OpenAI's Guide to Teaching with AI and explore the potential benefits and limitations of using AI in education. Additionally, we highlight some cutting-edge research papers on large language and speech models, a unified speech tokenizer, and a multimodal wine dataset. And, for a bit of fun, we have an entertaining ad for SplashTech's SuperSoak water gun. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:41 Twitter’s privacy policy confirms it will use publi...2023-09-0515 min

GPT ReviewsGoogle Duet AI's 🤖 // Growing Concern About AI Use 😟 // RLAIF, CityDreamer, and FACET 📈Google's Duet AI introducing new features for productivity software, growing public concern about the role of AI in daily life, and Andrej Karpathy's tweet about an optimization technique for inference-time with LLMs. Additionally, the episode features three fascinating papers: RLAIF, CityDreamer, and FACET, which cover topics such as reinforcement learning, 3D city generation, and fairness in computer vision evaluation. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:33 Google Duet AI: New Features for Gmail, Docs and Sheets, at $30 a Month 03:20 Growing public concern about the role of artificial intelligence in daily life...2023-09-0414 min

GPT ReviewsBaidu's ERNIE vs. ChatGPT 👥 // Export Restrictions Impact AI Industry 💥 // GPT-3 Fine-tuning for SQL 🤖China's Baidu has rolled out a new AI chatbot called ERNIE, which is being touted as a rival to OpenAI's ChatGPT. The US has expanded restrictions on the export of Nvidia AI chips beyond China to some countries in the Middle East, which could have serious implications for China's AI industry. OpenAI has released GPT-3.5-Turbo for fine-tuning natural language to SQL, which allows for even more advanced NL-to-SQL models to be created. A new paper proposes LLaSM, a large multi-modal speech-language model that can follow speech-and-language instructions, which could provide a more natural way for humans to interact...2023-09-0114 min

GPT ReviewsGoogle Cloud AI Updates 🌥️ // Copyright and AI 📝 // Security Risks of Generative AI 🔒Google Cloud's latest AI updates and partnerships, to the US Copyright Office's call for opinions on AI and copyright issues, this episode is packed with insights and analysis. The show also features papers on vector search and the security risks of generative AI, highlighting the need for ethical guidelines and methods for detecting and mitigating these risks. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:32 Google AI Updates from their Developer Keynote 02:45 US Copyright Office wants to hear what people think about AI and copyright 04:36 ChatLZMA – text generation from data co...2023-08-3014 min

GPT ReviewsOpenAI ChatGPT Enterprise 🔒 // Websites Blocking GPTBot 🚫 // Adversarial Fine-Tuning for Problematic Content Detection 🔎OpenAI has launched ChatGPT Enterprise, an AI chatbot with enterprise-grade security and privacy, unlimited higher-speed GPT-4 access, longer context windows, and advanced data analysis capabilities. Some websites, including Amazon, Quora, and NY Times, have already blocked OpenAI's GPTBot, a language model that can generate text indistinguishable from human writing. Researchers from the Australian National University have proposed a dual-stage optimization technique using adversarial fine-tuning to generate and detect problematic content in large language models. LongBench, the first bilingual, multi-task benchmark for long context understanding, provides a more rigorous evaluation of long context understanding, which is crucial for developing language...2023-08-3015 min

GPT ReviewsAlibaba's AI Models vs. ChatGPT 🤖 // Tesla's Full Self-Driving 🚗 // Closed vs Open Source 🌟Alibaba's new AI models, Tesla's "Full Self-Driving" software, and Google's new Gemini model. The papers discussed also provide valuable insights into evaluating safeguards in LLMs, improving the quality of generated code, and large vision-language models with versatile abilities. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:38 Alibaba launches chatGPT competitor that can understand images and have more complex conversations 03:06 Elon Musk demoes Tesla's AI-only self driving 05:06 Google Gemini Eats The World – Gemini Smashes GPT-4 By 5X, The GPU-Poors 06:14 Fake sponsor 08:39 Qwen-VL: A Frontier Large Vision-Language Model wi...2023-08-2915 min

GPT ReviewsHugging Face's $235M Funding 💰 // Anti-hype LLM Reading List 📚 // Large Language Models for Autonomous Agents 🤖Hugging Face, an AI startup, has raised a massive $235 million in funding from big tech companies like Google, Amazon, and IBM, highlighting the demand for collaborative AI development. The Anti-hype LLM reading list is a valuable resource for anyone interested in understanding language models from a practical perspective, providing links to reasonable and good explanations of how things work, with no hype or vendor content. Large language models are being utilized in the development of autonomous agents, which can better understand natural language, generate human-like responses, and adapt to new situations. The Giraffe paper explores the limitations of fixed...2023-08-2815 min

GPT ReviewsCode Llama 🦙 // AI on Windows 11 💻 // Bayesian Flow Networks 🌊 Code Llama, a new state-of-the-art large language model for coding, and Microsoft's plans to add AI capabilities to apps like Paint and Photos on Windows 11. We also explore Ludwig, a low-code framework for building custom AI models, and WizardMath, a new model that improves the mathematical reasoning abilities of large language models. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:34 Introducing Code Llama, a state-of-the-art large language model for coding 03:24 Microsoft may bring AI capabilities to apps like Paint and Photos on Windows 11 05:24 Ludwig: a low-code framework for building custo...2023-08-2514 min

GPT ReviewsSeamlessM4T Translation Model 🌎 // GPT 3.5 Turbo Finetuning 💻 // Motion-Guided Masking for Video 🎥Meta introduces SeamlessM4T, a multimodal AI model for speech and text translations that supports nearly 100 languages. OpenAI announces fine-tuning for GPT 3.5 Turbo, allowing businesses to customize the model for unique user experiences. The Backyard Emporium offers Miracle Grow, a product that makes plants grow super fast. Three AI research papers are discussed, covering adversarial robustness of multi-modal foundation models, causal inference learning, and motion-guided masking for video masked autoencoding. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:30 Meta Introduces SeamlessM4T, a Multimodal AI Model for Speech and Text Translations 03:10 Open...2023-08-2413 min

GPT ReviewsYouTube AI x Music Labels 🎶 // IBM Study on the Future of Workers 👨‍💼 // DeepMind's Reinforced Self-Training 🧠YouTube's partnership with music labels to establish rules for AI-generated content, IBM's study on the impact of AI on the workforce, and two papers on language modeling and safety alignment. The episode also explores Reinforced Self-Training (ReST), a new reinforcement learning algorithm that improves the quality of large language models, and "Adapting Learned Sparse Retrieval for Long Documents," which proposes adaptations to improve the performance of Learned Sparse Retrieval (LSR) for long documents. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:50 YouTube is figuring out its AI strategy by working with music labels ...2023-08-2315 min

GPT ReviewsAI Social Media App 📱 // AI Art and Copyright 🎨 // Brain-Inspired Deep Learning 🧠From the launch of BeFake, an AI-based social media app, to a ruling that AI-created art is not copyrightable, the show explores the latest developments in the field. The episode also features two research papers that propose new approaches to deep neural networks and machine unlearning, as well as a paper that introduces a general approach to personalized text generation inspired by writing education. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:26 Ex-MZ CEO launches BeFake, an AI-based social media app 02:59 AI-Created Art Isn’t Copyrightable, Judge Says in Ruling That Could...2023-08-2214 min

GPT ReviewsMicrosoft's Databricks Plans 🤖 // Dry News August 😩 // Multi-Agent Debate for LLM 🤝Microsoft plans to sell a new version of Databricks software that helps customers make AI apps for their businesses, potentially hurting OpenAI's business. Businesses should prioritize customer experience over cost reduction when implementing AI, according to an article titled "How NOT to apply Artificial Intelligence in your business". Three AI research papers were discussed, including a multi-agent debate framework for language model evaluation, a curricular subgoal-based framework for inverse reinforcement learning, and a parameter-efficient module operation approach for deficiency unlearning in large language models. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:32 Microsoft Pla...2023-08-2114 min

GPT ReviewsDeepMind Life Advice 🤔 // Eric Schmidt's AI Moonshot 🚀 // Role-Play Prompting for Zero-Shot Reasoning 👥Google's AI unit, DeepMind, is developing AI tools for life advice, planning, and tutoring, but AI safety experts have concerns about users taking life advice from AI tools. Former Google CEO Eric Schmidt is launching an AI-science moonshot, building a new nonprofit organization to tackle scientific challenges with the help of AI. "Transformers in Reinforcement Learning: A Survey" explores how transformers can be used in reinforcement learning to address unique challenges and improve applications. "Better Zero-Shot Reasoning with Role-Play Prompting" investigates the influence of role-playing on large language models' reasoning abilities and introduces a new methodology called "role-play prompting"...2023-08-1814 min

GPT ReviewsIBM's Analog AI Chip 🧠 // New Google Search AI Features 🕵️‍♂️ // Multimodal LLMs with LCL 🔗IBM has unveiled a new prototype of an analog AI chip that works like a human brain, promising to be more efficient and less battery-draining for computers and smartphones. Google has rolled out new search AI features, including the ability to see definitions within AI-generated responses and color-coded syntax highlighting for coding. The paper "Learning to Identify Critical States for Reinforcement Learning from Videos" explores how videos can be used to extract implicit information about rewarding action sequences in deep reinforcement learning, with potential applications in robotics. "Link-Context Learning for Multimodal LLMs" proposes a new approach called Link-Context Learning...2023-08-1714 min

GPT ReviewsChatGPT for Business 📊 // Race for Scarce Nvidia Chips 🚀 // GPT-4 Code Interpreter's Remarkable Performance 🔥Microsoft's new enterprise spin-off of ChatGPT, the race for scarce Nvidia chips, the remarkable performance of GPT-4 Code Interpreter on challenging math datasets, and a new paper from Google Research comparing the performance of prefixLM and causalLM for in-context learning. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:54 Microsoft Azure ChatGPT allows enterprises to run ChatGPT within their network 03:32 Saudi Arabia, UAE join Elon Musk and Chinese tech titans in the race for scarce Nvidia chips 05:14 AI Town 06:19 Fake sponsor 07:59 Solving Challenging Math Word Problems Using...2023-08-1614 min

GPT ReviewsAI in Elections 🗳️ // OpenAI's Financial Trouble 💰 // Self-Alignment with Instruction Backtranslation 🤖The Federal Election Commission is considering regulating the use of AI-generated content in political ads ahead of the 2024 elections. Zoom has updated its policies to clarify that user data, such as videos, won't be used to train AI models. OpenAI is facing financial struggles and may have to file for bankruptcy by the end of 2024. Three exciting papers were discussed, including a method for building a high-quality instruction-following language model, a comparison of LLM-augmented autonomous agent architectures, and a large dataset of over 16 million multiple sequence alignments for protein structure prediction. Contact: sergi@earkind.com Ti...2023-08-1515 min

GPT ReviewsZoom Keystroke Detection 🔍 // DeepMind's AlphaStar Unplugged 🔌 //Claude Instant 1.2 🤖Anthropic has released Claude Instant 1.2, a faster and safer model that outperforms its previous version in math, coding, and safety. Media organizations are calling for regulations to protect copyright in data used to train generative AI models, as it undermines their business models and reduces media diversity. Researchers have made a breakthrough in detecting keystrokes over Zoom calls, using machine learning and microphones to interpret remote keystrokes based on sound profiles of individual keys. The papers discussed in this episode showcase advancements in reinforcement learning for complex games like StarCraft II, language models that critique and refine their own...2023-08-1414 min

GPT ReviewsGoogle's IDF 🌐 // Nvidia bet on AI 💰 // Simple Synthetic Data 🤖Google's new project IDX, Nvidia's bet on AI, and two papers on language models are discussed. The first paper from Google DeepMind explores how simple synthetic data can reduce sycophancy in large language models, while the second paper from Stanford University proposes a new algorithm called staged speculative decoding to speed up the inference of large language models in small-batch, on-device scenarios. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:41 Google Unveils Project IDF 03:20 Nvidia CEO: "We bet the farm on AI and no one knew it" 04:53 Jason Wei Long...2023-08-1114 min

GPT ReviewsNvidia's New AI Chip 🚀 // Disney Looking into AI 🎥 // Context-Prompting for Language Models 🤖Nvidia's new AI chip, the GH200, promises to significantly reduce the cost of running large language models, making AI more accessible for smaller companies. Disney is exploring the use of AI to cut costs in movie and television production, as well as enhance customer support and create unique interactions within its theme parks. The paper "Skills-in-Context Prompting" proposes a novel prompting strategy that significantly improves the compositional generalization capabilities of large language models. The paper "SILO Language Models" addresses the legal risks associated with training language models on copyrighted or restricted data, proposing a solution that mitigates these risks whi...2023-08-1015 min

GPT ReviewsBing Chat on Mobile 📱 // Zoom's Privacy Policy Update 🔒 // AgentBench for LLMs 🚀Microsoft's AI-powered Bing Chat now available on all mobile browsers, Zoom's updated privacy policy, the introduction of AgentBench for evaluating LLMs as agents, and the Flows framework for modeling complex interactions between AI systems and humans. These developments have the potential to lead to more robust and reliable AI models that can perform well in complex, real-world scenarios. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:29 Microsoft’s AI-powered Bing Chat is coming to mobile browsers 02:50 Zoom says its new AI tools aren’t stealing ownership of your content 04:30 Kubernete...2023-08-0913 min

GPT ReviewsMicrosoft Kills Cortana 💀 // AI-Powered Brain Implants Restore Mobility 🌟 // Graphical Language for Predictive Processing 🤔Microsoft is shutting down Cortana and shifting its focus to modern-day AI advances, like its ChatGPT-like Bing Chat and other AI-powered productivity features across Windows and its web browser Edge. Researchers have used AI-powered brain implants to restore movement and sensation for a man who was paralyzed from the chest down, offering life-changing mobility and independence to many. A paper presents a categorical formulation of Predictive Processing and Active Inference using string diagrams, providing a graphical language for understanding these cognitive frameworks with potential implications for robotics, cognitive science, and machine learning. A new approach called "self-translate" leverages the...2023-08-0814 min

GPT ReviewsApple's Generative AI 💰 // CoreWeave's $2.3B Loan 🌩️ // Evaluating Large Multimodal Models 🧐Apple's investment in generative AI and CoreWeave's $2.3 billion loan for cloud infrastructure. We also dive into two research papers, "Retroformer" and "MM-Vet," which explore optimizing language agents and evaluating large multimodal models, respectively. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:42 Apple has been quiet about ChatGPT. Now Tim Cook says its hefty $22.6 billion research spend is down to generative AI. 02:58 CoreWeave, which provides cloud infrastructure for AI training, secures $2.3B loan 05:17 Analysis: While AI takes the spotlight, infrastructure stocks shine 06:56 Fake sponsor 08:53 Retroformer: Retrospective Large...2023-08-0715 min

GPT ReviewsGPT-5 Trademark Application 📝 // Google's "Mind-Reading" AI 🧠 // Soft MoE Outperforms Transformers 🔥OpenAI's trademark application for GPT-5 could signify a continued advancement in natural language processing and machine learning. Google's "mind-reading" AI raises ethical concerns about potential implications and future uses of the technology. Soft MoE, a new type of mixture of expert architecture proposed by Google DeepMind, outperforms standard Transformers and popular MoE variants in visual recognition. PerceptionCLIP, a method proposed by the University of Maryland and the Bosch Center for Artificial Intelligence, improves zero-shot image classification by inferring and conditioning on contextual attributes, achieving better generalization, group robustness, and interpretability. Contact: sergi@earkind.com Timestamps:...2023-08-0414 min

GPT ReviewsAI Video Summaries on YouTube 🎥 // Generative AI for Audio 🎵 // Synthetic Dataset for Point Tracking 🕹️YouTube is experimenting with AI-generated video summaries, potentially changing the way creators structure their videos. AudioCraft, a generative AI for audio, is now available to all and could revolutionize the way we create and experience audio. PointOdyssey, a synthetic dataset and data generation framework, aims to advance the state-of-the-art in long-term fine-grained tracking algorithms. A new attack method on aligned language models demonstrates the vulnerability of even aligned models to adversarial attacks, raising important questions about preventing objectionable content. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:33 YouTube uses AI to summarize videos in...2023-08-0314 min

GPT ReviewsPalantir's AI Weapons 💣 // Meta's Chatbot Personas 🤖 // Google Assistant x BART 🗣️Billionaire CEO of military technology supplier Palantir advocates for AI weapons, sparking controversy and raising questions about the risks and benefits of such technology. Meta is developing AI-powered chatbots with different personalities, which could collect vast amounts of data on users' interests, but also raises concerns around privacy and potential manipulation. Google is planning to update Assistant with features powered by generative AI, which could allow it to answer questions based on information gleaned from across the web, but also raises potential privacy implications. Cutting-edge AI research papers have been published, including one...2023-08-0214 min

GPT ReviewsStackOverflow's OverflowAI 💡 // Vision & Language to Action 🤖 // Limits of RLHF 🚫The RT-2 model translates vision and language into action, showing improved generalization capabilities and semantic and visual understanding beyond the robotic data it was exposed to. OverflowAI is a new space for Stack Overflow's community and customers to explore the future of knowledge sharing together, featuring semantic search, enterprise knowledge ingestion, Slack integration, a Visual Studio Code extension, and AI community discussions. "Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback" surveys the fundamental limitations and open problems of RLHF, as well as techniques to improve and complement it in practice. "...2023-08-0114 min

GPT ReviewsNetflix's AI job Ad 💼 // Google's impressive Q2 earnings 📈 // Boosting LLMs for Code 🤖Netflix's controversial AI job ad, Google's impressive Q2 earnings, and advancements in autonomous agents and large language models for code. The episode also features discussions on a realistic web environment for building autonomous agents and a new framework for boosting pre-trained Code LLMs. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:41 Netflix touts $900k AI jobs amid Hollywood strikes 02:55 Google stock jumped 10% this week, fueled by cloud, ads and hope in A.I. 04:44 Worldcoin: a solution in search of its problem 05:52 Fake sponsor 08:00 WebArena: A Realis...2023-07-3114 min

GPT ReviewsOpenAI x Google x Anthropic x Microsoft Partnership 🤝 // Stablilty's New Text-to-Image Model 🌅 // Factual Error Detection Framework 📚A new partnership promoting responsible AI, the release of a new text-to-image model with ethical safeguards, a framework for detecting factual errors in generated text, and a framework for high-quality object tracking in videos. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:32 A new partnership to promote responsible AI 03:18 Stability AI releases its latest image-generating model, Stable Diffusion XL 1.0 05:27 No One Wants To Talk To Your Chatbot 06:28 Fake sponsor 08:45 FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios ...2023-07-2815 min

GPT ReviewsJapan's Supercomputer 🇯🇵 // GitHub x EU AI Law 🇪🇺 // ARB benchmark for LLMs 🧪Japan's Ministry of Economy, Trade, and Industry is investing heavily in AI development by building a new supercomputer to accelerate progress in AI and reduce Japan's dependence on foreign countries. GitHub and other companies are calling for more open-source support in EU AI law to promote a more open ecosystem for AI innovation. The ARB benchmark presents a more challenging test for large language models, and current models score well below 50% on more demanding tasks. "Predicting Code Coverage without Execution" proposes using Machine Learning to predict code coverage without actual execution, which could lower the cost of code coverage...2023-07-2715 min

GPT ReviewsWorldcoin from Sam Altman 🪙 // ChatGPT for Android 📱 // Retentive Network for LLMs 🧠The launch of Worldcoin at the intersection of AI, identity, and finance, to the Android release of OpenAI's ChatGPT, this episode explores the cutting-edge of AI-powered conversational tools. The paper "Evaluating the Ripple Effects of Knowledge Editing in Language Models" proposes a new evaluation benchmark that could have important implications for improving the accuracy of language models. Finally, the Retentive Network, a new architecture proposed as a successor to the Transformer architecture for large language models, achieves favorable scaling results, parallel training, low-cost deployment, and efficient inference. Contact: sergi@earkind.com Timestamps: 00:34 Introduction...2023-07-2614 min

GPT ReviewsStability's New LMs 🆕 // Anthropic on Faithfulnes in CoT 🔗 // STEVE-1 for Minecraft 🎮two new open-source Large Language Models, the departure of OpenAI's head of trust and safety, the introduction of STEVE-1, a model that can follow a wide range of instructions in Minecraft, and a paper exploring ways to protect the copyright of Neural Radiance Fields. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 02:03 Stability AI Releases 2 Language Models 03:20 OpenAI’s head of trust and safety Dave Willner steps down 04:59 Twitter Thread by Anthropic on Faithfulness in Chain of thought reasoning 06:12 Fake sponsor 08:19 Invalid Logic, Equivalent Gains: The Biza...2023-07-2514 min

GPT ReviewsCustomizing ChatGPT 🤖 // AI Companies' Voluntary Safeguards 🚨 // Neural Sparse Retrieval 🔎OpenAI's ChatGPT has released a new update to give users more control over how it responds. A.I. companies have agreed to voluntary safeguards to manage the risks associated with their technology. "Secrets of RLHF in Large Language Models Part I: PPO" introduces a new approach to reinforcement learning with human feedback, which is important for large language models. Finally, "SPRINT: A Unified Toolkit for Evaluating and Demystifying Zero-shot Neural Sparse Retrieval" introduces a new paradigm in retrieval called neural sparse retrieval, and a toolkit called SPRINT to evaluate and compare different models. Contact: sergi@earkind.com...2023-07-2414 min

GPT ReviewsNYC Subyay's AI 🚈 // A $100M Supercomputer 💸 // College-Level LLM Benchmark 🏫Cerebras has sold a $100 million AI supercomputer and is planning eight more, challenging the market for AI hardware and validating the market for specialized AI hardware outside of GPUs. The NYC subway system is using AI to track fare evasion, raising concerns about privacy and surveillance. LLM Evaluation research... Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:39 Cerebras Sells $100 Million AI Supercomputer, Plans Eight More 03:14 NYC subway using AI to track fare evasion 04:50 AI Safety and the Age of Dislightenment 06:23 Fake sponsor 08:12 Toward...2023-07-2115 min

GPT ReviewsLLAMA 2 🦙 // Apple GPT 🍎 // Zero-Shot Retrieval 🎯LLAMA 2, the new open-source conversational language model from Meta, has been released, with Microsoft as the preferred partner. Apple has created its own AI-based chatbot called "Apple GPT" to compete with Google and Open AI, but had to halt the rollout due to security concerns around generative AI. "Precise Zero-Shot Dense Retrieval without Relevance Labels" proposes a new approach called Hypothetical Document Embeddings (HyDE) to address the challenge of creating fully zero-shot dense retrieval systems without any relevance labels. "To Infinity and Beyond: SHOW-1 and Showrunner Agents in Multi-Agent Simulations" explores the use of large language models and multi-agent...2023-07-2015 min

GPT ReviewsWix's AI Journey 🚀 // SEC's AI Risks Warning ⚠️ // NaViT Vision Transformer 🔭A comprehensive look at Wix's AI journey, from its current AI-powered features to its ambitious future plans. It also delves into the SEC's warning about AI's potential risks to financial stability, including its use in financial fraud and conflicts of interest. The episode also features a deep dive into three intriguing research papers, exploring the NaViT Vision Transformer, the Retentive Network as a potential successor to Transformers, and a theory on Adam Instability in large-scale machine learning. Lastly, the episode includes a segment on Random Reads, where potential inaccuracies in the "gzip beats BERT" paper are discussed. ...2023-07-1917 min

GPT ReviewsMeta's CM3leon 🎨 // ChatGPT Isn't Getting Dumber 🤔 // Linear Complexity Speech Recognition 🗣️CM3leon, a new generative model for text and images that is more efficient and state-of-the-art. OpenAI researcher Jason Wei is also featured, offering an "Ask Me Anything" document on AI research. Additionally, Sumformer, a linear-complexity alternative to self-attention for speech recognition, and DreamTeacher, a self-supervised feature representation learning framework that uses generative networks for pre-training downstream image backbones, are discussed. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 02:16 Introducing CM3leon, a more efficient, state-of-the-art generative model for text and images 03:40 OpenAI product leader denies claims GPT-4 has gotten ‘lazier and du...2023-07-1813 min

GPT ReviewsAP & OpenAI Partnership 🤝 // Google's Multilingual Bard 🌍 // Meta's Commercial LLaMA 💻AP's partnership with OpenAI, Google's language model Bard, and Meta's release of a commercial version of LLaMA. Additionally, two AI research papers are discussed, one about using LLMs to help robots with complex tasks and another about a hypernetwork called HyperDreamBooth that can efficiently generate personalized weights from a single image of a person. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:30 AP strikes news-sharing and tech deal with OpenAI 02:37 July Bard Update from Google 04:01 Meta to release open-source commercial AI model to compete with OpenAI and Google 06...2023-07-1713 min

GPT ReviewsElon Musk's New Company 🧠// Anthropic's Claude 2 🤖 // Google's NotebookLM 📝Elon Musk's new AI company, xAI, aims to understand the true nature of the universe and has a team of heavy hitters from AI powerhouses. Anthropic's new model, Claude 2, has made significant improvements in coding, math, and reasoning, and is being used by businesses for a wide variety of use cases. Google's new AI-backed tool, NotebookLM, is a note-taking tool that uses AI to help users with research and document review, and is part of Google's push to integrate AI into every aspect of our lives. The challenges of regulating advanced AI models...2023-07-1414 min

GPT ReviewsGoogle vs Misinformation 🔎 // Volkswagen's self-driving 🚗 // GLUE for Video 👀Google's efforts to combat political misinformation, Volkswagen's partnership with Mobileye to launch autonomous vehicles, and two AI research papers on video understanding and efficient text generation. The episode highlights the need for improved video-focused foundation models and the potential for more efficient inference in large language models. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:27 Google working on tech to discern AI-made content: Company executive 03:19 Volkswagen to start testing self-driving ID Buzz vans in Austin 04:55 Yao Fu Tweets about Hallucination in Language Models 06:11 Fake sponsor 08:31 VideoG...2023-07-1014 min

GPT ReviewsAI Best-Sellers? 📚 // ChatGPT Web Browsing Disabled 🔒 // Superalignment 🤖From the concerning flood of AI-generated books on Amazon's Kindle Program to OpenAI's Superalignment team dedicated to addressing the superintelligence alignment problem. The team also discusses the temporary shutdown of the web browsing feature for ChatGPT Plus subscribers and three research papers, including LongNet, SDXL, and KnowNo. These papers cover topics such as scaling sequence length, text-to-image synthesis, and aligning the uncertainty of LLM-based planners. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:34 Amazon has a big problem as AI-generated books flood Kindle Unlimited 02:54 OpenAI disables ChatGPT Web Browsing 04:30 Intro...2023-07-0714 min

GPT ReviewsEU-Japan AI Partnership 🤝 // UN Security Council meeting on AI threats 🌎 // Meta-learning vs Pre-training 🔍The EU and Japan's potential partnership on AI and chips to reduce reliance on China, the UN Security Council's first-ever meeting on the potential threats of AI to global peace and security, a paper challenging the belief that pre-trained models always outperform meta-learning algorithms in few-shot learning, and Microsoft's ZeRO++ introducing communication volume reduction techniques to improve the efficiency of training large language models. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:42 EU and Japan look to partner on A.I. and chips as China ‘de-risking’ strategy continues 02:58 UN council to hold fir...2023-07-0615 min

GPT ReviewsGPT-4 to Scam Scammers ♻️ // Neural Reasoning 🧠 // State-Maintaining Language Models 🤖From the potential misuse of AI-generated imagery for disinformation campaigns to the use of AI chatbots to frustrate telemarketing scammers, the episode highlights the versatility of AI in addressing persistent issues. The development of Hint-ReLIC and Statler state-maintaining language models also show promising advancements in neural algorithmic reasoning and large language model-based robot reasoning. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:36 Political Satirist Slammed for Creating Deepfakes of Trump, Biden Cheating on their Wives 02:51 California man's business is frustrating telemarketing scammers with chatbots 04:26 The industry behind the industry behin...2023-07-0513 min

GPT ReviewsElon Musk vs AI Paywalls 🤖 // EU AI Regulation Warning 🚨 // GPT-4's Big Secret 🤫Elon Musk blames AI companies for new paywalls on reading tweets. European VCs and tech firms warn against over-regulation of AI in EU la. GPT-4's big secret is revealed: it's a mixture of 8 smaller models using a 2-year-old technique. AI research papers discussed include Preference Ranking Optimization for Human Alignment, ChatGPT for Robotics, and Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:36 Elon Musk blames data scraping by AI startups for his new paywalls on reading tweets 02:51 European VCs and...2023-07-0415 min

GPT ReviewsAI-Generated Drug Enters Trial 🧪 // Slot-TTA for Visual Detectors 🔍 // Catastrophic AI Risks ⚠️From the first fully AI-generated drug entering clinical trials to a new method for improving out-of-distribution performance of visual detectors, there's plenty to pique your interest. The OBELISC dataset is also discussed, which could have important implications for training large multimodal models. Finally, the potential catastrophic risks associated with AI are explored, with practical suggestions proposed for mitigating these dangers. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:33 How AI Influences What You See on Facebook and Instagram 03:13 The first fully A.I.-generated drug enters clinical trials in human patients 2023-07-0315 min

GPT ReviewsCrowd Workers = ChatGPT ❓ // Amazon AI Marketplace 🛍️ // ImageNET vs. LAION 📸Amazon's AI marketplace, Stability AI's recent changes, potential biases in large language models, and the use of LLMs by crowd workers. These discussions shed light on important issues in the AI industry and highlight the need for transparency and oversight in the development and use of AI models. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:41 Amazon reveals AI strategy 03:37 Stability AI’s head of research quits 05:21 Optimize & Deploy BERT on AWS inferentia2 06:29 Fake sponsor 08:41 Towards Measuring the Representation of Subjective Global Opinions in Language Models 2023-06-3015 min

GPT ReviewsERNIE 3.5 > GPT-4? 🤖 // Learning to Rank with Generative Retrieval 🔝 // System-Level Language Feedback 💬ERNIE 3.5 has made significant improvements in efficacy, functionality, and performance, while OpenAI's work assistant may put the company at odds with Microsoft's copilots. "Learning to Rank in Generative Retrieval" proposes a novel framework that achieves state-of-the-art performance among generative retrieval methods. Finally, "System-Level Natural Language Feedback" proposes a framework for using natural language feedback to improve machine learning models at a system-level, leading to further gains in model performance. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:24 Introducing ERNIE 3.5: Baidu’s Knowledge-Enhanced Foundation Model Takes a Giant Leap Forward 02:56 Would an OpenAI...2023-06-2914 min

GPT ReviewsDeepMind's Chatbot is Coming 🤖 // Reka the new Gen AI Contender 🚀 // Prompt Engineering Guide 📚DeepMind is developing a chatbot called Gemini that will rival ChatGPT, while Reka has received $58 million in funding to advance AI research. We also explore the emerging discipline of prompt engineering and the use of large language models as weak learners in machine learning pipelines. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:34 DeepMind claims its next chatbot will rival ChatGPT 02:53 Announcing our $58M funding to Build Generative Models and Advance AI Research 04:11 Prompt Engineering Guide 05:57 Fake sponsor 07:45 Language models are weak learners 09:02 Supervi...2023-06-2814 min

GPT ReviewsDatabricks $1.3B Deal With MosaicML 💰 // Self-supervised evaluation 🧐 // Scaling MLPs 📈Databricks' acquisition of MosaicML for $1.3 billion, Ramp's acquisition of Cohere.io to improve its customer service, and two research papers on self-supervised evaluation for large language models and scaling MLPs. These topics provide valuable insights into the growing demand for generative AI tools, the importance of realistic data evaluation, and the limits of MLPs' performance on vision tasks. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:31 Databricks Strikes $1.3 Billion Deal for Generative AI Startup MosaicML 03:33 As the generative AI craze rages on, Ramp acquires customer support startup Cohere.io 05:36 Insid...2023-06-2716 min

GPT ReviewsStability's New Models 📸 // Translating 5,000-year-old Tablets 🗿 // Text-to-Thought Models 💭Stability AI's SDXL 0.9, which has potential applications in film, television, music, design, and industrial fields, to archaeologists using AI to translate 5,000-year-old cuneiform tablets, this episode is packed with fascinating developments. We also delve into MIT and Stanford's "From Word Models to World Models," which proposes a computational framework for language-informed thinking, and Google's AudioPaLM, a large language model that fuses text-based and speech-based language models into a unified multimodal architecture that can process and generate both text and speech. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:44 Stability AI launches SDXL 0.9: A...2023-06-2614 min

GPT ReviewsChatGPT Data Breach 🚨 // OpenAI's App Store 📱 // HomeRobot 🦾From the concerning news of over 100,000 ChatGPT account credentials being compromised, to OpenAI's plans to launch a marketplace for AI models, potentially competing with existing app stores. We also explore Hyung Won Chung's unique approach to software development with GPT-4 and discuss promising research papers on improving language models with memory-augmentation and mobile manipulation. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:53 100000+ ChatGPT Accounts Data Leaked 03:22 OpenAI plans app store for AI software, The Information reports 05:08 Hyung Won Chung on Test Driven Development with GPT-4 06:28 Fake sponsor 08:36 RepoFusion: Training Code Models to Understand Your Repository 10:26 GLIMMER: generalized late-interaction memory reranker 11:30 HomeRobot: Open-Vocabulary Mobile M...2023-06-2314 min

GPT ReviewsDeepMind's RoboCat 🙀 // Fast Diffusion Model 🎨 // Multilingual Relation Extraction 🌎DeepMind's self-improving robotic agent, RoboCat, and Microsoft's new AI model, Orca. They also discuss two research papers, "RED FM" and "SqueezeLLM," that introduce new resources for multilingual relation extraction and a post-training quantization framework, respectively. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:37 DeepMind Announces RoboCat: A self-improving robotic agent 03:35 Meet Orca, Microsoft’s new 13 billion parameter AI model that can imitate GPT-4 05:40 Levanter — Legible, Scalable, Reproducible Foundation Models with JAX 06:57 Fake sponsor 09:15 RED$^{\rm FM}$: a Filtered and Multilingual Relation Extraction Dataset 11:06 Fast Diffus...2023-06-2215 min

GPT ReviewsGrammys' AI Rules 🎙️ // OpenLLaMA 🤖 // Recurrent Memory Decision Transformer 🔍the intersection of AI and music with the Grammys' new rules for AI use. We also dive into the OpenLLaMA project, an open source reproduction of Google's LLaMA language model. Our AI research expert, Belinda, joins us to discuss two papers: the Recurrent Memory Decision Transformer, which proposes a model for reinforcement learning tasks, and the Block-State Transformer, which combines State Space Models and Block Transformers for language modeling. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:40 And the award goes to AI ft. humans: the Grammys outline new rules for AI use ...2023-06-2115 min

GPT ReviewsInverse Scaling 🧩 // Meta's Deepfake Concerns 🕶️ // China Welcomes AI 🇨🇳Meta's decision not to release their AI speech tool due to deepfake concerns, China's President welcoming AI tech with potential global implications, the "Waluigi Effect" article revealing the challenges of large language models, and research papers exploring fine-tuning, inverse scaling, and unifying large language models and knowledge graphs. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 02:12 Meta's New AI Speech Tool Is Ready-Made for Deepfakes—So It's Not Being Released 03:52 China’s President Welcomes AI Tech 05:23 The Waluigi Effect (mega-post) 07:25 Fake sponsor 09:46 Full Parameter Fine-tuning for Large Language Models with Limited Resources 11:22 Inverse Scaling: When Bigger Isn't Better 13:05 Unifying Large Language Models and Knowledg...2023-06-2017 min

GPT ReviewsGoogle's Virtual Try-on 👔 // Captioning for Representation Learning 📸 // Efficient Atari Players 🎮OpenAI's ChatGPT takes the AI world by storm with 1 million users in five days, while Google revolutionizes online shopping with a virtual try-on feature. The essay "Imaginary Problems Are the Root of Bad Software" sheds light on the importance of clear communication in software development. Meanwhile, research papers explore TryOnDiffusion, image captioning, and the BBF agent's super-human performance in Atari games. It's a whirlwind of AI advancements and insights that will leave you craving more. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 02:07 OpenAI’s Mira Murati: the woman charged with pushing generative AI in...2023-06-1919 min

GPT ReviewsJapan's Permissive AI Copyright 🇯🇵 // The Future of Open LMs 🔮 // Galactic 🪐Amazon's leaked document on ChatGPT, Japan's decision on AI training copyrights, and Israel's support for copyrighted works in machine learning. We also explore two research papers on end-to-end reinforcement learning for robotic mobile manipulation and benchmarking general conditional image similarity. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:51 A leaked document of Amazon's ideas for using ChatGPT and AI at work lists 67 ways to take advantage of the ChatGPT boom 03:36 Japan Goes All In: Copyright Doesn’t Apply To AI Training 04:49 Israel Ministry of Justice Issues Opinion Supporting the Use of...2023-06-1615 min

GPT ReviewsMistral AI raises $113M 💸 // The Beatles' new AI album 🎶 // Progressive Learning from GPT-4 💬Mistral AI secures $113 million seed funding to compete against OpenAI with a unique approach, while Hugging Face partners with AMD to optimize transformer performance. The Beatles announce a final record using John Lennon's voice via AI assist, raising concerns about the ethics of AI in music. We also dive into research papers exploring the efficiency of language models, the use of retrieval-enhanced models, and advanced techniques for next-gen language models. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:39 France’s Mistral AI blows in with a $113M seed round at a $260M valuation to ta...2023-06-1514 min

GPT ReviewsOpenAI API Updates 💵 // Hot Takes on LMs 🔥 // LMs + Long Term Memory 🧠New function calling capability in the Chat Completions API, and the release of I-JEPA, a new AI model based on Yann LeCun's vision for more human-like AI. They also discuss the collaboration between OpenAI, Google DeepMind, and Anthropic with the UK government, and the potential risks of misuse. Finally, the team explores two papers, "Augmenting Language Models with Long-Term Memory" and "Benchmarking Neural Network Training Algorithms," which propose a framework for language models to memorize long history and a new benchmark to reliably identify training algorithm improvements. Contact: sergi@earkind.com Timestamps: 00:34 Introduction...2023-06-1414 min

GPT ReviewsChatGPT Leaks 🤐 // Apple execs on Facebook 🍎 // LMs Inferring Causation 🤖OpenAI's ChatGPT leaked potential new features for a business variant of the chatbot, while Aya aims to accelerate multilingual AI progress through open collaboration. LEACE and Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding propose new methods for improving fairness, interpretability, and computer vision. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:33 Leaked ChatGPT Docs Reveal Potential New Features 03:08 Introducing Aya: An Open Science Initiative to Accelerate Multilingual AI Progress 06:01 Apple execs on Facebook 07:01 Fake sponsor 08:40 LEACE: Perfect linear concept erasure in closed form 2023-06-1315 min

GPT ReviewsMeta's MusicGen 🎶 // Improving Reasoning with Bard 🤔 // Multi-Modal Instruction Tuning 🌟They discuss Meta's MusicGen AI model that can generate music with very little data, Google's Bard AI language model that is improving at mathematical tasks, BlenderBot 3x that is trained using organic conversation and feedback data, and MIMIC-IT, a dataset comprising 2.8 million multimodal instruction-response pairs for vision-language tasks that has been used to train the Otter model that outperformed existing models on several tasks. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:34 Meta just released MusicGen, a simple and controllable model for music generation 03:07 Bard is getting better at logic and reas...2023-06-1213 min

GPT ReviewsGPT-4 for Govt. 🌐 // Bard Boosted by 30%🚀 // Lossless Text Compression with LLMs💾Microsoft grants government access to GPT-4, while Google Bard improves by 30% in reasoning abilities. AlphaDev discovers faster sorting algorithms, and LLMZip proposes a lossless text compression algorithm using large language models. These advancements have the potential to transform how we automate responses, program computers, and compress and store text. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 02:08 Microsoft Gives The Govt. GPT-4 03:30 Google Bard Just got 30% better 05:12 AlphaDev discovers faster sorting algorithms 06:17 Fake sponsor 07:56 Benchmarking Foundation Models with Language-Model-as-an-Examiner 10:17 LLMZip: Lossless Text Compression using L...2023-06-0914 min

GPT ReviewsInstagram Chatbot 📸 // Tim Cook on ChatGPT 🍎 // LLMs + Symbolic Memory 🧠Instagram's AI chatbot, Tim Cook's take on AI and ChatGPT, and Google Cloud's new no-cost generative AI training courses. They also delve into research papers that explore improving the trustworthiness of Large Language Models, training interpretable Transformers, and augmenting LLMs with databases as their symbolic memory. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:46 Instagram is apparently testing an AI chatbot that lets you choose from 30 personalities 03:11 Tim Cook Talks ChatGPT and AI 05:00 Seven new no-cost generative AI training courses to advance your cloud career 06:28 Fake sponsor 2023-06-0815 min

GPT ReviewsChatGPT for Pokemon 🐱 // Apple Avoids AI at WWDC 🍎 // Sparse Quantization for Weight Compression 🏋️Apple's preference for "machine learning" over "AI," McKinsey and Company's cautious use of generative AI tools, ChatGPT's reasoning capabilities in the Pokemon universe, and SpQR's near-lossless compression for large language models. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:31 For better or worse, Apple is avoiding the AI hype train 03:08 McKinsey says ‘about half’ of its employees are using generative AI 05:26 Why AI Will Save the World 07:04 Fake sponsor 09:11 PokemonChat: Auditing ChatGPT for Pokémon Universe Knowledge 10:40 On the Tool Manipulation Capability of Open-source...2023-06-0716 min

GPT Reviews4000 Job Losses 📉 // Zoom's Meeting Summaries 💬 // Pixels to UI Actions 👾The impact of AI on job loss, the new AI-powered feature on Zoom, and two groundbreaking papers on creating digital agents for graphical user interfaces and generating high-quality long-form text. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:26 Nearly 4,000 Jobs Were Lost to AI Last Month, Report Shows 03:09 Zoom can now give you AI summaries of the meetings you’ve missed 05:03 Sterling Crispin on Apple's Vision Pro 06:23 Fake sponsor 08:23 From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces 10:03 PLANNER: Generatin...2023-06-0615 min

GPT ReviewsPaul Graham on AI Boom 💰 // Drones Gone Rogue 🚁 // Object Segmentation 🎯AI drone going rogue during a defense conference and public market investors missing out on the AI boom. Our resident AI research expert, Belinda, joins us to discuss "Segment Anything in High Quality," which proposes HQ-SAM, a modification of the Segment Anything Model that accurately segments any object while maintaining efficiency and zero-shot generalizability. We also dive into "Fine-Grained Human Feedback Gives Better Rewards for Language Model Training," which explores how to improve language models by using fine-grained human feedback as an explicit training signal. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 02:00 AI...2023-06-0515 min

GPT ReviewsJapan's AI & Copyright 🇯🇵 // Image Gen with GPT-4 🎨 // Discovering Conservation Laws 🛰️Google's new AI-powered search tool and its potential impact on the online publishing industry, as well as Japan's new policy on AI and copyright laws. We also explore "Discovering New Interpretable Conservation Laws as Sparse Invariants," which proposes an algorithm that can auto-discover conservation laws from differential equations, and "Controllable Text-to-Image Generation with GPT-4," which introduces Control-GPT, greatly boosting the controllability of image generation. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:25 Google’s New AI-Powered Search Is A Beautiful Plagiarism Machine 02:58 Japan Hops On the AI Train 04:37 How importa...2023-06-0213 min

GPT ReviewsChatGPT Plugins Prompt Injections 💉 // More AI X-Risk 💀 // Unfair Evaluation of LLMs 👎Risks associated with AI technology, including prompt injection and the potential for AI to cause extinction, language models, with a new optimizer called MeZO proposed for fine-tuning large models and a paper investigating whether language models can identify their own "hallucinations." Additionally, a bias in the evaluation paradigm of using large language models to score the quality of responses generated by other models is uncovered, and two calibration strategies are proposed to address this bias. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:33 ChatGPT Plugins aren’t safe, Prompt Injections 02:48 Statement on AI...2023-06-0115 min

GPT ReviewsMeta's Quest 3 🕶️ // OpenAI's potential EU exit 🇪🇺 // BiomedGPT 🩺Meta's Quest 3 headset promising improved AR performance, OpenAI's potential exit from Europe over new AI regulations, the impact of Twitter's algorithm on public opinion and democratic engagement, and a new pre-trained transformer model called BiomedGPT that could improve healthcare outcomes. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:37 Quest 3 hands-on confirms Meta’s building a ‘far thinner and lighter’ headset 03:02 OpenAI Could Quit Europe Over New AI Rules, CEO Sam Altman Warns 05:01 State of GPT 06:01 Fake sponsor 08:13 Twitter's Algorithm: Amplifying Anger, Animosity, and Affective Polarization 10:00 ...2023-05-3115 min

GPT ReviewsNvidia's AI Supercomputers 🚀 // AI Antibiotic Discovery 💊 // Demo2Code 💻Nvidia's new AI supercomputers and services, the discovery of a new antibiotic using AI, the challenges of implementing AI in specialized fields like healthcare, and a study on Large Language Models' behavior in interactive social settings. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:37 Nvidia debuts new AI supercomputers and services after shares skyrocket 03:01 New superbug-killing antibiotic discovered using AI 04:54 Production AI systems are really hard 06:09 Fake sponsor 08:23 Playing repeated games with Large Language Models 10:09 Demo2Code: From Summarizing Demonstrations to Synthesizing Code via Ext...2023-05-3014 min

GPT ReviewsParalized man walks with AI 🚶 // Pitfalls of Imitating ChatGPT 💬 // Voyager's Proficiency in Minecraft 🎮Swiss scientists helping a paralyzed man walk again with the help of a "digital bridge," ChatGPT's life-saving capabilities, Voyager's exceptional proficiency in playing Minecraft, and SPRING's potential for language models in completing sophisticated high-level trajectories in open-world games. These advancements have the potential to revolutionize the way we treat paralysis, learn, and play games, and open the door for the development of generalist agents that can learn and solve tasks in a variety of environments without human intervention. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:23 Swiss Scientists Rebuild Spinal Cord With AI ...2023-05-2914 min

GPT ReviewsWindows Copilot 🚀 // RL for Chemistry 🧪 // Control-A-Video 🎥Microsoft's Windows Copilot, Spotify's use of AI for podcast ads, and Azure AI Studio's capabilities for businesses. We also dive into three research papers on digital chemistry, controllable text-to-video generation, and arithmetic tasks. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 02:02 Microsoft announces Windows Copilot, an AI ‘personal assistant’ for Windows 11 03:46 Spotify may use AI to make host-read podcast ads that sound like real people 05:22 Microsoft’s Azure AI Studio lets developers build their own AI ‘copilots’ 07:06 Fake sponsor 09:20 ChemGymRL: An Interactive Framework for Reinforcement Learning for Dig...2023-05-2615 min

GPT ReviewsGoogle's AI Ads 🔎 // Robots for Real-World Tasks 🤖 // Schmidhuber on AI Future 💡Google introduces new AI-powered ad features, while 1X deploys humanoid robots for security and healthcare tasks, potentially addressing labor shortages. Juergen Schmidhuber shares his optimistic perspective on the potential of AI and the fear of a dystopian future. The team also dives into AI language model advancements, including textually pretrained speech language models, efficient finetuning of quantized LLMs, and aligning large language models through synthetic feedback. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:53 Introducing a new era of AI-powered ads with Google 03:22 OpenAI-backed robot startup beats Elon Musk’s Tesla, deploy...2023-05-2515 min

GPT ReviewsAnthropic Raises $450M 💰 // Waymo-Uber Partnership 🚗 // RNNs strike back 🔃Anthropic raises $450 million in Series C funding to scale their reliable AI products, while Waymo and Uber partner to bring Waymo's autonomous driving technology to the Uber platform. We also explore two fascinating papers: RWKV, which combines the efficiency of transformers with the inference of RNNs, and RecurrentGPT, which enables interactive generation of arbitrarily long text without forgetting. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:53 Anthropic Raises $450 Million in Series C Funding to Scale Reliable AI Products 03:11 Waymo and Uber partner to bring Waymo’s autonomous driving technology to the Uber p...2023-05-2415 min

GPT ReviewsMeta's GDPR fine 💰 // LIMA language model 🤖 // Composable Diffusion Models 🎨Meta's €1.2 billion GDPR fine over US mass surveillance is a major blow for the company, while LIMA, a language model trained with only 1,000 prompts and responses, demonstrated strong performance. Composable Diffusion (CoDi) is a powerful generative model that can handle any combination of input and output modalities, and CRITIC is a framework that allows large language models to self-correct with external feedback. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:44 noyb win: € 1.2 billion fine against Meta over EU-US data transfers 04:21 GPT-4 didn't really score 90th percentile on the bar exam 05:43 ...2023-05-2316 min

GPT ReviewsApple Limits ChatGPT 🍎 // Hippocratic Raises $50M 💰 // Generalist Dynamic Model Control 🤖Apple is limiting the internal use of some AI-powered tools, while Hippocratic AI has raised $50 million to develop healthcare chatbots. We also discuss a new approach to teaching AI called Guidance and a new large language model called Med-PaLM 2 that improves upon previous work in medical question answering. Contact: sergi@earkind.com Timestamps: 566:54 Introduction 1905:32 Apple reportedly limits internal use of AI-powered tools like ChatGPT and GitHub Copilot 3376:42 Hippocratic AI Raises $50 Million To Power The Healthcare Bot Workforce 5500:37 Guidance: an alternative to prompting or chaining 6601:09 Fake spons...2023-05-2215 min

GPT ReviewsChatGPT app for iOS 💬 // An RLHF alternative 🧠 // Meta's MTIA v1 accelerator 🚀The ChatGPT app for iOS, Meta's breakthrough in AI accelerator technology with MTIA v1, and an article that explores the future of language and communication. We also discuss three fascinating AI research papers, including "TinyStories," "SLiC-HF," and "Bot or Human?" which presents a new way for online service providers to protect themselves against nefarious activities. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:48 Introducing the ChatGPT app for iOS 03:13 MTIA v1: Meta’s first-generation AI inference accelerator 05:34 Life After Language 07:13 Fake sponsor 09:34 TinyStories: How Small Can Lan...2023-05-1916 min

GPT ReviewsPaLM 2 details leaked 🤐 // Apple's Speech Cloning 🗣️ // Contextual Pre-Training 📖Apple's AI-Driven Accessibility Updates and Google's PaLM 2. We also delve into the world of efficient audio generation with SoundStorm and explore the potential of active retrieval augmented generation. Finally, we discuss the importance of in-context learning in NLP and how Pre-Training to Learn in Context can enhance language models' abilities. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:45 Apple's AI-Driven Accessibility Updates Include Text-to-Speech That Can Mimic Your Voice 03:10 PaLM 2 Details Leaked: it uses 5x the data the original PaLM trained on 05:32 Numbers every LLM Developer should know 06...2023-05-1814 min

GPT ReviewsSam Altman on AI Elections Risk 🗳️ // Anthropic-Zoom Partnership 🤝 // Symbol Tuning ICL ❎We discussed Sam Altman's proposed solutions for regulating AI's impact on elections, as well as the exciting partnership between Anthropic and Zoom to build customer-centric AI products. We also highlighted the LLM University by Cohere, a comprehensive learning resource for anyone interested in NLP using language models. Finally, we delved into three research papers, including Optimizing Memory Mapping Using Deep Reinforcement Learning, Professional Certification Benchmark Dataset, and Symbol Tuning for ICL in Large Language Models. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:46 Sam Altman is concerned about AI being used to compromise el...2023-05-1718 min

GPT ReviewsEU AI regulation vs. Open Source 🇪🇺 // Prompt Engineering Guide ✅ // Byte-level Transformers 🤖The EU AI Act's potential impact on American tech companies, Meta's impressive new model architecture for better ad experiences, a multi-scale decoder architecture for modeling long sequences, and the need for best practices and regulations for AGI labs. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:36 EU AI Act To Target US Open Source Software 03:16 New AI advancements drive Meta’s ads system performance and efficiency 05:29 Brex's Prompt Engineering Guide 06:41 Why we need uncensored models and how to train them 07:45 Fake sponsor: Scrub Busters 09:51...2023-05-1615 min

GPT ReviewsMeta's AI for Advertisers 💸 // GPT-4 explains GPT-2 Neurons 🧠 // ImageBlind 👨‍🦯Meta introduces generative AI features for advertisers, while Yuval Noah Harari and Yann LeCun debate the promises and dangers of AI. The WebLLMs project allows users to run large-language models and LLM-based chatbots directly in their browsers, and the ImageBind project learns a joint embedding across six different modalities. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 02:07 Meta announces generative AI features for advertisers 03:28 Debate highlights: Yuval Noah Harari (Sapiens) versus Yann Le Cun (Meta) on artificial intelligence 04:38 WebLLMs project: running Language Models in your browser 06:17 Language model...2023-05-1516 min

GPT ReviewsAnthropic's 100k context // 🤗 Transformer Agents // Federated Instruction TuningAnthropic's Claude introduces 100k tokens context windows for businesses to analyze complex documents quickly, while Huggingface releases Transformers Agents, an experimental API for natural language processing and task completion. Stability AI also releases Stable Animation SDK, a powerful text-to-animation tool for artists and developers. Additionally, researchers propose new AI models and frameworks, including Federated Instruction Tuning, Evaluating Embedding APIs for Information Retrieval, and Pretraining Without Attention. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:26 Anthropic's Claude introduces 100k tokens context Windows 02:54 Huggingface Releases Transformers Agents 04:10 Stability AI releases Stable Anim...2023-05-1215 min

GPT ReviewsPaLM 2 // Google's IO // Snapchat Influencer AI GirlfriendGPT Reviews Artificial Intelligence on Thursday, May 11, 2023 Google's use of generative AI in core products, a controversial AI version of a Snapchat influencer, advancements in language and video understanding with Google's PaLM 2 language model, VideoChat, the potential for unfaithful explanations in large language models revealed by NYU, and more. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:22 Google IO is held with heavy focus on AI 03:00 Google finally demos generative AI in Search, with a waitlist starting today 04:08 A 23-year-old Snapchat influencer used OpenAI’s technology to creat...2023-05-1115 min

GPT ReviewsGPT Reviews — Intro ThemeTrailer for GPT Reviews, the daily show about AI, made by AI. Learn more about Earkind at www.earkind.com 2023-05-1000 min