arXiv research summaries for Computation Vision and Pattern Recognition from October 28, 2023.
Research Themes (AI Summary)
- Self-supervised learning for medical imaging using foundation models like Vision Transformers and diffusion models.
- Knowledge distillation to create efficient student models, like for object detection in aerial images.
- Multi-modal fusion for tasks like audio-visual segmentation and multimodal re-identification.
- Diffusion models for text-to-image generation, like synthesizing customized 360-degree panoramas.
- Point cloud processing, including classification, registration, and depth estimation.