Computer vision and pattern recognition research from arXiv for December 1, 2023.
Today's Themes (AI Generated)
- Improving neural video synthesis techniques like diffusion models to generate higher quality and customized video content.
- Leveraging vision-language models for open-world tasks like open-vocabulary object pose estimation and few-shot generalizable referring image segmentation.
- Enabling efficient learning of large vision models through techniques like sequential modeling that avoid linguistic data.
- Applying generative neural models to tasks beyond image synthesis, like guiding streetview visualization of social processes.
- Advancing neural rendering of dynamic real-world scenes, enhancing realism and completion especially for 360 capture.