Listen

Description

The RT-2 model translates vision and language into action, showing improved generalization capabilities and semantic and visual understanding beyond the robotic data it was exposed to.

OverflowAI is a new space for Stack Overflow's community and customers to explore the future of knowledge sharing together, featuring semantic search, enterprise knowledge ingestion, Slack integration, a Visual Studio Code extension, and AI community discussions.

"Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback" surveys the fundamental limitations and open problems of RLHF, as well as techniques to improve and complement it in practice.

"Robust Distortion-free Watermarks for Language Models" proposes a way to plant watermarks in text generated by language models that are robust to perturbations without changing the distribution over text up to a certain maximum generation budget.

Contact:  sergi@earkind.com

Timestamps:

00:34 Introduction

01:43 RT-2: New model translates vision and language into action

03:17 Announcing OverflowAI, the future of community & AI

04:57 Computer Scientists Discover Limits of Stochastic Gradient Descent

06:00 Fake sponsor

07:55 Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

09:39 Uncertainty in Natural Language Generation: From Theory to Applications

11:59 Robust Distortion-free Watermarks for Language Models

13:37 Outro