The RT-2 model translates vision and language into action, showing improved generalization capabilities and semantic and visual understanding beyond the robotic data it was exposed to.
OverflowAI is a new space for Stack Overflow's community and customers to explore the future of knowledge sharing together, featuring semantic search, enterprise knowledge ingestion, Slack integration, a Visual Studio Code extension, and AI community discussions.
"Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback" surveys the fundamental limitations and open problems of RLHF, as well as techniques to improve and complement it in practice.
"Robust Distortion-free Watermarks for Language Models" proposes a way to plant watermarks in text generated by language models that are robust to perturbations without changing the distribution over text up to a certain maximum generation budget.
Contact:Â Â sergi@earkind.com
Timestamps:
00:34 Introduction
01:43Â RT-2: New model translates vision and language into action
03:17Â Announcing OverflowAI, the future of community & AI
04:57Â Computer Scientists Discover Limits of Stochastic Gradient Descent
06:00 Fake sponsor
07:55Â Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
09:39Â Uncertainty in Natural Language Generation: From Theory to Applications
11:59Â Robust Distortion-free Watermarks for Language Models
13:37 Outro