1 code implementation • CVPR 2024 • Hyeongjun Kwon, Jinhyun Jang, Jin Kim, Kwonyoung Kim, Kwanghoon Sohn
Visual scenes are naturally organized in a hierarchy, where a coarse semantic is recursively comprised of several fine details.
1 code implementation • ICCV 2023 • Jinhyun Jang, Jungin Park, Jin Kim, Hyeongjun Kwon, Kwanghoon Sohn
Recent DETR-based video grounding models have made the model directly predict moment timestamps without any hand-crafted components, such as a pre-defined proposal or non-maximum suppression, by learning moment queries.
1 code implementation • 14 Aug 2023 • Jinhyun Jang, Taeyong Song, Kwanghoon Sohn
Aerial-to-ground image synthesis is an emerging and challenging problem that aims to synthesize a ground image from an aerial image.
no code implementations • CVPR 2023 • Hyeongjun Kwon, Taeyong Song, Somi Jeong, Jin Kim, Jinhyun Jang, Kwanghoon Sohn
Recent progress in deterministic prompt learning has become a promising alternative to various downstream vision tasks, enabling models to learn powerful visual representations with the help of pre-trained vision-language models.