The impending launch of the real Siri by Apple, with improvements in reliability and integration inside apps.
The Mixture-of-Agents approach to leverage the collective strengths of multiple large language models, achieving state-of-the-art performance.
The Proofread feature in Google's Gboard, using a large language model to provide sentence-level and paragraph-level corrections with a single tap.
The Comprehensive RAG Benchmark, shedding light on the limitations of current question answering models and laying the groundwork for a KDD Cup 2024 challenge.
Contact:Β Β sergi@earkind.com
Timestamps:
00:34 Introduction
02:13Β Is Apple about to finally launch the real Siri?
04:03Β WARC-GPT: An Open-Source Tool for Exploring Web Archives Using AI
05:10Β Claudeβs Character
06:47 Fake sponsor
08:45Β Mixture-of-Agents Enhances Large Language Model Capabilities
10:18Β Proofread: Fixes All Errors with One Tap
11:53Β CRAG -- Comprehensive RAG Benchmark
14:03 Outro