arXiv research summaries for Computation Vision and Pattern Recognition from October 25, 2023.
Today's Themes (LLM-Generated)
- Image generation using text and diffusion models
- Improving generalizability and robustness of models through techniques like domain adaptation and test time augmentation
- Applications of vision and language models like CLIP for tasks like emotion recognition and sound symbolism
- 3D scene understanding through neural radiance fields and point clouds
- Document understanding via information extraction and visual question answering