no code implementations • ECCV 2020 • Sangryul Jeon, Dongbo Min, Seungryong Kim, Jihwan Choe, Kwanghoon Sohn
Establishing dense semantic correspondences requires dealing with large geometric variations caused by the unconstrained setting of images.
no code implementations • 8 Jan 2025 • Seungmin Baek, Soyul Lee, Hayeon Jo, Hyesong Choi, Dongbo Min
Transfer learning paradigm has driven substantial advancements in various vision tasks.
no code implementations • 26 Dec 2024 • Hyesong Choi, Daeun Kim, Sungmin Cha, Kwang Moo Yi, Dongbo Min
In this work, we dive deep into the impact of additive noise in pre-training deep networks.
no code implementations • 13 Sep 2024 • Hyewon Park, Hyejin Park, Jueun Ko, Dongbo Min
Continual Test Time Adaptation (CTTA) has emerged as a critical approach for bridging the domain gap between the controlled training environments and the real-world scenarios, enhancing model adaptability and robustness.
no code implementations • 4 Sep 2024 • SooMin Kim, Hyesong Choi, Jihye Ahn, Dongbo Min
Unlike other vision tasks where Transformer-based approaches are becoming increasingly common, stereo depth estimation is still dominated by convolution-based approaches.
no code implementations • 4 Sep 2024 • Hayeon Jo, Hyesong Choi, Minhee Cho, Dongbo Min
To secure flexible learning ability on input instances in various downstream tasks, we introduce an input-Conditioned Network (iCoN) in the dynamic adapter that enables instance-level feature transformation.
no code implementations • 4 Sep 2024 • Jihye Ahn, Hyesong Choi, SooMin Kim, Dongbo Min
In stereo matching, CNNs have traditionally served as the predominant architectures.
no code implementations • 4 Sep 2024 • Sumin Son, Hyesong Choi, Dongbo Min
Masked Image Modeling (MIM) techniques have redefined the landscape of computer vision, enabling pre-trained models to achieve exceptional performance across a broad spectrum of tasks.
no code implementations • 4 Sep 2024 • Minhee Cho, Hyesong Choi, Hayeon Jo, Dongbo Min
Unsupervised Domain Adaptation (UDA) endeavors to bridge the gap between a model trained on a labeled source domain and its deployment in an unlabeled target domain.
1 code implementation • 3 Sep 2024 • Hyejin Park, Dongbo Min
In the realm of Adversarial Distillation (AD), strategic and precise knowledge transfer from an adversarially robust teacher model to a less robust student model is paramount.
1 code implementation • 22 Jun 2024 • Xu Yin, Woobin Im, Dongbo Min, Yuchi Huo, Fei Pan, Sung-Eui Yoon
Generating reliable pseudo masks from image-level labels is challenging in the weakly supervised semantic segmentation (WSSS) task due to the lack of spatial information.
Ranked #3 on
Weakly-Supervised Semantic Segmentation
on PASCAL VOC 2012 train
(using extra training data)
no code implementations • 12 Apr 2024 • Hyesong Choi, Hunsang Lee, Seyoung Joung, Hyejin Park, Jiyeong Kim, Dongbo Min
Initially, we delve into an exploration of the inherent properties that a masked token ought to possess.
no code implementations • 12 Apr 2024 • Hyesong Choi, Hyejin Park, Kwang Moo Yi, Sungmin Cha, Dongbo Min
In this paper, we introduce Saliency-Based Adaptive Masking (SBAM), a novel and cost-effective approach that significantly enhances the pre-training performance of Masked Image Modeling (MIM) approaches by prioritizing token salience.
1 code implementation • ICCV 2023 • Hyesong Choi, Hunsang Lee, Seongwon Jeong, Dongbo Min
Generalization capability of vision-based deep reinforcement learning (RL) is indispensable to deal with dynamic environment changes that exist in visual observations.
no code implementations • CVPR 2023 • Hyesong Choi, Hunsang Lee, Wonil Song, Sangryul Jeon, Kwanghoon Sohn, Dongbo Min
Recent vision-based reinforcement learning (RL) methods have found extracting high-level features from raw pixels with self-supervised learning to be effective in learning policies.
1 code implementation • 6 Oct 2022 • Sunghwan Hong, Jisu Nam, Seokju Cho, Susung Hong, Sangryul Jeon, Dongbo Min, Seungryong Kim
Existing pipelines of semantic correspondence commonly include extracting high-level semantic features for the invariance against intra-class variations and background clutters.
1 code implementation • 1 Oct 2022 • Xu Yin, Dongbo Min, Yuchi Huo, Sung-Eui Yoon
This novel module transfers the predicted/ground-truth semantic labels to a self-defined potential domain to learn and infer decision boundaries along customized directions.
1 code implementation • 6 Sep 2022 • Sunkyung Kim, Hyesong Choi, Dongbo Min
The cross-task attention module (CTAM) is first applied to facilitate the exchange of relevant information between the multiple task features of the same scale.
1 code implementation • 27 Jul 2022 • Kwonyoung Kim, Jungin Park, Jiyoung Lee, Dongbo Min, Kwanghoon Sohn
To mitigate this issue, we propose to incorporate an auxiliary point-selective network into a meta-learning framework, called PointFix, to provide a robust initialization of stereo models for online stereo adaptation.
1 code implementation • CVPR 2022 • Jin Kim, Jiyoung Lee, Jungin Park, Dongbo Min, Kwanghoon Sohn
The rise of deep neural networks has led to several breakthroughs for semantic segmentation.
no code implementations • CVPR 2022 • Hunsang Lee, Hyesong Choi, Kwanghoon Sohn, Dongbo Min
In this way, the pair-wise operation establishes non-local connectivity while maintaining the desired properties of the local attention, i. e., inductive bias of locality and linear complexity to input resolution.
no code implementations • 22 Oct 2021 • Jaehoon Cho, Dongbo Min, Youngjung Kim, Kwanghoon Sohn
This manual is intended to provide a detailed description of the DIML/CVL RGB-D dataset.
no code implementations • 29 Sep 2021 • Hyesong Choi, Hunsang Lee, Wonil Song, Sangryul Jeon, Kwanghoon Sohn, Dongbo Min
The proposed method imposes similarity constraints on the three latent volumes; warped query representations by estimated flows, predicted target representations from the transition model, and target representations of future state.
no code implementations • 29 Sep 2021 • Wonil Song, Sangryul Jeon, Hyesong Choi, Kwanghoon Sohn, Dongbo Min
Given the latent representations as skills, a skill-based policy network is trained to generate similar trajectories to the learned decoder of the trajectory VAE.
no code implementations • 31 Aug 2021 • Jin Kim, Jiyoung Lee, Jungin Park, Dongbo Min, Kwanghoon Sohn
Domain generalization aims to learn a prediction model on multi-domain source data such that the model can generalize to a target domain with unknown statistics.
no code implementations • CVPR 2021 • Sangryul Jeon, Dongbo Min, Seungryong Kim, Kwanghoon Sohn
We present a novel framework for contrastive learning of pixel-level representation using only unlabeled video.
1 code implementation • 2 Jan 2021 • Matteo Poggi, Seungryong Kim, Fabio Tosi, Sunok Kim, Filippo Aleotti, Dongbo Min, Kwanghoon Sohn, Stefano Mattoccia
Stereo matching is one of the most popular techniques to estimate dense depth maps by finding the disparity between matching pixels on two, synchronized and rectified images.
no code implementations • 1 Jan 2021 • Sunkyung Kim, Hyesong Choi, Dongbo Min
More importantly, the pseudo depth labels serve to impose a cross-view consistency on the estimated monocular depth and segmentation maps of two views.
1 code implementation • ICCV 2021 • Hyesong Choi, Hunsang Lee, Sunkyung Kim, Sunok Kim, Seungryong Kim, Kwanghoon Sohn, Dongbo Min
To cope with the prediction error of the confidence map itself, we also leverage the threshold network that learns the threshold dynamically conditioned on the pseudo depth maps.
no code implementations • ICCV 2019 • Sangryul Jeon, Dongbo Min, Seungryong Kim, Kwanghoon Sohn
Based on the key insight that the two tasks can mutually provide supervisions to each other, our networks accomplish this through a joint loss function that alternatively imposes a consistency constraint between the two tasks, thereby boosting the performance and addressing the lack of training data in a principled manner.
no code implementations • 23 Apr 2019 • Jaehoon Cho, Dongbo Min, Youngjung Kim, Kwanghoon Sohn
In this paper, we present a simple yet effective approach for monocular depth estimation using stereo image pairs.
no code implementations • CVPR 2019 • Seungryong Kim, Dongbo Min, Somi Jeong, Sunok Kim, Sangryul Jeon, Kwanghoon Sohn
SAM-Net accomplishes this through an iterative process of establishing reliable correspondences by reducing the attribute discrepancy between the images and synthesizing attribute transferred images using the learned correspondences.
1 code implementation • NeurIPS 2018 • Seungryong Kim, Stephen Lin, Sangryul Jeon, Dongbo Min, Kwanghoon Sohn
Our networks accomplish this through an iterative process of estimating spatial transformations between the input images and using these transformations to generate aligned convolutional activations.
no code implementations • ECCV 2018 • Sangryul Jeon, Seungryong Kim, Dongbo Min, Kwanghoon Sohn
To the best of our knowledge, it is the first work that attempts to estimate dense affine transformation fields in a coarse-to-fine manner within deep networks.
no code implementations • ICCV 2017 • Seungryong Kim, Dongbo Min, Stephen Lin, Kwanghoon Sohn
In this way, our approach draws solutions from the continuous space of affine transformations in a manner that can be computed efficiently through constant-time edge-aware filtering and a proposed affine-varying CNN-based descriptor.
no code implementations • 18 Apr 2017 • Suhyuk Um, Jaeyoon Kim, Dongbo Min
To obtain the 2-D complex Gabor filter bank output, existing methods are repeatedly applied with respect to multiple orientations and frequencies.
1 code implementation • CVPR 2017 • Seungryong Kim, Dongbo Min, Bumsub Ham, Sangryul Jeon, Stephen Lin, Kwanghoon Sohn
The sampling patterns of local structure and the self-similarity measure are jointly learned within the proposed network in an end-to-end and multi-scale manner.
no code implementations • CVPR 2017 • Youngjung Kim, Hyungjoo Jung, Dongbo Min, Kwanghoon Sohn
The proposed framework enables the convolutional neural networks (CNNs) to operate as a prior or regularizer in the AM algorithm.
no code implementations • 27 Apr 2016 • Seungryong Kim, Dongbo Min, Bumsub Ham, Minh N. Do, Kwanghoon Sohn
In this paper, we propose a novel dense descriptor, called dense adaptive self-correlation (DASC), to estimate multi-modal and multi-spectral dense correspondences.
no code implementations • 26 Apr 2016 • Youngjung Kim, Dongbo Min, Bumsub Ham, Kwanghoon Sohn
In this paper, we introduce a highly efficient splitting-based method for global EPS that minimizes the objective function of ${l_2}$ data and prior terms (possibly non-smooth and non-convex) in linear time.
no code implementations • 21 Mar 2016 • Seungryong Kim, Dongbo Min, Stephen Lin, Kwanghoon Sohn
We present a novel descriptor, called deep self-convolutional activations (DeSCA), designed for establishing dense correspondences between images taken under different imaging modalities, such as different spectral ranges or lighting conditions.
no code implementations • ICCV 2015 • Yu Li, Dongbo Min, Michael S. Brown, Minh N. Do, Jiangbo Lu
However, the quality of the PMBP solution is tightly coupled with the local window size, over which the raw data cost is aggregated to mitigate ambiguity in the data constraint.
no code implementations • CVPR 2015 • Seungryong Kim, Dongbo Min, Bumsub Ham, Seungchul Ryu, Minh N. Do, Kwanghoon Sohn
To further improve the matching quality and runtime efficiency, we propose a patch-wise receptive field pooling, in which a sampling pattern is optimized with a discriminative learning.
1 code implementation • CVPR 2014 • Kang Zhang, Yuqiang Fang, Dongbo Min, Lifeng Sun, Shiqiang Yang. Shuicheng Yan, Qi Tian
We firstly reformulate cost aggregation from a unified optimization perspective and show that different cost aggregation methods essentially differ in the choices of similarity kernels.
no code implementations • CVPR 2013 • Jiangbo Lu, Hongsheng Yang, Dongbo Min, Minh N. Do
Recent studies on fast cost volume filtering based on efficient edge-aware filters have provided a fast alternative to solve discrete labeling problems, with the complexity independent of the support window size.