1 code implementation • ECCV 2020 • Minho Shim, Hsuan-I Ho, Jinhyung Kim, Dongyoon Wee
Person re-identification (re-ID) is the problem of visually identifying a person given a database of identities.
Image-To-Video Person Re-Identification Video-Based Person Re-Identification
1 code implementation • 1 Aug 2024 • Juseung Yun, Yi Hu, Jinhyung Kim, Jongseong Jang, Soonyoung Lee
To address this issue, we introduce EXAONEPath, a novel foundational model trained on patches that have undergone stain normalization.
1 code implementation • 13 Jun 2024 • Youngtaek Oh, Pyunghwan Ahn, Jinhyung Kim, Gwangmo Song, Soonyoung Lee, In So Kweon, Junmo Kim
Vision and language models (VLMs) such as CLIP have showcased remarkable zero-shot recognition abilities yet face challenges in visio-linguistic compositionality, particularly in linguistic comprehension and fine-grained image-text alignment.
no code implementations • 19 Dec 2023 • Bumsoo Kim, Yeonsik Jo, Jinhyung Kim, Seung Hwan Kim
Contrastive Language-Image Pretraining has emerged as a prominent approach for training vision and text encoders with uncurated image-text pairs from the web.
no code implementations • 19 Dec 2023 • Bumsoo Kim, Jinhyung Kim, Yeonsik Jo, Seung Hwan Kim
Based on the unified text embedding space, ECLIPSE compensates for the additional computational cost of the momentum image encoder by expediting the online image encoder.
no code implementations • 2 Jun 2023 • Minho Shim, Taeoh Kim, Jinhyung Kim, Dongyoon Wee
Summarizing a video requires a diverse understanding of the video, ranging from recognizing scenes to evaluating how much each frame is essential enough to be selected as a summary.
no code implementations • ICCV 2023 • Bumsoo Kim, Yeonsik Jo, Jinhyung Kim, Seunghwan Kim
Contrastive Language-Image Pretraining has emerged as a prominent approach for training vision and text encoders with uncurated image-text pairs from the web.
no code implementations • 30 Jun 2022 • Taeoh Kim, Jinhyung Kim, Minho Shim, Sangdoo Yun, Myunggu Kang, Dongyoon Wee, Sangyoun Lee
The magnitude of augmentation operations on each frame is changed by an effective mechanism, Fourier Sampling that parameterizes diverse, smooth, and realistic temporal variations.
no code implementations • 8 Apr 2022 • Jinhyung Kim, Taeoh Kim, Minho Shim, Dongyoon Han, Dongyoon Wee, Junmo Kim
FreqAug stochastically removes specific frequency components from the video so that learned representation captures essential features more from the remaining information for various downstream tasks.
3 code implementations • 7 Dec 2020 • Sangdoo Yun, Seong Joon Oh, Byeongho Heo, Dongyoon Han, Jinhyung Kim
Recent data augmentation strategies have been reported to address the overfitting problems in static image classifiers.
no code implementations • CVPR 2020 • Jinhyung Kim, Seunghwan Cha, Dongyoon Wee, Soonmin Bae, Junmo Kim
We present that selective regularization on this locally smoothed feature makes a model handle the low-frequency and high-frequency component distinctively, resulting in performance improvement.
no code implementations • 8 Jun 2017 • Jungsik Hwang, Jinhyung Kim, Ahmadreza Ahmadi, Minkyu Choi, Jun Tani
This study presents a dynamic neural network model based on the predictive coding framework for perceiving and predicting the dynamic visuo-proprioceptive patterns.
no code implementations • 9 Jul 2015 • Jungsik Hwang, Minju Jung, Naveen Madapana, Jinhyung Kim, Minkyu Choi, Jun Tani
The current study examines how adequate coordination among different cognitive processes including visual recognition, attention switching, action preparation and generation can be developed via learning of robots by introducing a novel model, the Visuo-Motor Deep Dynamic Neural Network (VMDNN).