Computer vision and pattern recognition research from arXiv for November 30, 2023.
Today's Themes (AI Generated)
- Improving image and video generation with diffusion models and prompt learning
- Leveraging language models for multi-modal tasks like image captioning and text-to-image generation
- Self-supervised representation learning from images, video, and 3D data
- Addressing domain shift for semantic segmentation and other vision tasks
- Reconstructing and rendering dynamic 3D scenes from images and video