no code implementations • 2 Dec 2024 • Chanho Eom, Geon Lee, Kyunghwan Cho, Hyeonseok Jung, Moonsub Jin, Bumsub Ham
Our framework performs individual comparisons of local and global person representations between query and gallery images for attribute-based reID.
no code implementations • 9 Sep 2024 • Chanho Eom, Wonkyung Lee, Geon Lee, Bumsub Ham
To tackle this problem, we propose to factorize person images into identity-related and unrelated features.
no code implementations • 17 Jul 2024 • Dohyung Kim, Junghyup Lee, Jeimin Jeon, Jaehyeon Moon, Bumsub Ham
Based on this, we derive an upper bound of quantization error for the large gradients in terms of the quantization interval, and obtain an optimal condition for the interval minimizing the quantization error for large gradients.
no code implementations • 11 Jul 2024 • Byunggwan Son, Youngmin Oh, Donghyeon Baek, Bumsub Ham
Dataset distillation synthesizes a small set of images from a large-scale real dataset such that synthetic and real images share similar behavioral properties (e. g, distributions of gradients or features) during a training process.
no code implementations • 30 Apr 2024 • Junghyup Lee, Jeimin Jeon, Dohyung Kim, Bumsub Ham
It is thus difficult to control the degree of changes for quantized weights by scheduling the LR manually.
no code implementations • CVPR 2024 • Jaehyeon Moon, Dohyung Kim, Junyong Cheon, Bumsub Ham
In particular, the distribution of activations for each channel vary drastically according to input instances, making PTQ methods for CNNs inappropriate for ViTs.
no code implementations • CVPR 2024 • Junghyup Lee, Bumsub Ham
To address this issue, we propose AZ-NAS, a novel approach that leverages the ensemble of various zero-cost proxies to enhance the correlation between a predicted ranking of networks and the ground truth substantially in terms of the performance.
no code implementations • ICCV 2023 • Jongyoun Noh, Hyekang Park, Junghyup Lee, Bumsub Ham
In this paper, we present RankMixup, a novel mixup-based framework alleviating the problem of the mixture of labels for network calibration.
no code implementations • ICCV 2023 • Geon Lee, SangHoon Lee, Dohyung Kim, Younghoon Shin, Yongsang Yoon, Bumsub Ham
To address the camera bias problem, we also introduce a camera-diversity (CD) loss encouraging person images of the same pseudo label, but captured across various cameras, to involve more for discriminative feature learning, providing person representations robust to inter-camera variations.
Domain Adaptation Domain Adaptive Person Re-Identification +4
no code implementations • ICCV 2023 • Hyekang Park, Jongyoun Noh, Youngmin Oh, Donghyeon Baek, Bumsub Ham
We present in this paper an in-depth analysis of existing regularization-based methods, providing a better understanding on how they affect to network calibration.
no code implementations • 13 Oct 2022 • Youngmin Oh, Donghyeon Baek, Bumsub Ham
Based on this, we then introduce an adaptive logit regularizer (ALI) that enables our model to better learn new categories, while retaining knowledge for previous ones.
1 code implementation • 12 Oct 2022 • Donghyeon Baek, Youngmin Oh, SangHoon Lee, Junghyup Lee, Bumsub Ham
We introduce a CISS framework that alleviates the forgetting problem and facilitates learning novel classes effectively.
Class-Incremental Semantic Segmentation Knowledge Distillation
no code implementations • 22 Jul 2022 • Geon Lee, Chanho Eom, Wonkyung Lee, Hyekang Park, Bumsub Ham
We present a novel unsupervised domain adaptation method for semantic segmentation that generalizes a model trained with source images and corresponding ground-truth labels to a target domain.
1 code implementation • 21 Jul 2022 • SangHoon Lee, Youngmin Oh, Donghyeon Baek, Junghyup Lee, Bumsub Ham
To this end, we introduce a novel normalization layer, dubbed ProtoNorm, that calibrates features from pedestrian proposals, while considering a long-tail distribution of person IDs, enabling L2 normalized person representations to be discriminative.
1 code implementation • ICCV 2021 • Chanho Eom, Geon Lee, Junghyup Lee, Bumsub Ham
Spatial and temporal distractors in person videos, such as background clutter and partial occlusions over frames, respectively, make this task much more challenging than image-based person reID.
Ranked #8 on Person Re-Identification on MARS
1 code implementation • ICCV 2021 • Hyunjong Park, SangHoon Lee, Junghyup Lee, Bumsub Ham
We address the problem of visible-infrared person re-identification (VI-reID), that is, retrieving a set of person images, captured by visible or infrared cameras, in a cross-modal setting.
no code implementations • ICCV 2021 • Dohyung Kim, Junghyup Lee, Bumsub Ham
This alleviates the gradient mismatch, but causes a quantizer gap problem.
no code implementations • ICCV 2021 • Donghyeon Baek, Youngmin Oh, Bumsub Ham
To this end, we leverage visual and semantic encoders to learn a joint embedding space, where the semantic encoder transforms semantic features to semantic prototypes that act as centers for visual features of corresponding classes.
1 code implementation • CVPR 2021 • Jongyoun Noh, SangHoon Lee, Bumsub Ham
To this end, we propose a new convolutional neural network (CNN) architecture, dubbed HVPR, that integrates both features into a single 3D representation effectively and efficiently.
2 code implementations • CVPR 2021 • Youngmin Oh, Beomjun Kim, Bumsub Ham
We address the problem of weakly-supervised semantic segmentation (WSSS) using bounding box annotations.
1 code implementation • CVPR 2021 • Junghyup Lee, Dohyung Kim, Bumsub Ham
Network quantization aims at reducing bit-widths of weights and/or activations, particularly important for implementing deep neural networks with limited hardware resources.
no code implementations • ECCV 2020 • Wonkyung Lee, Junghyup Lee, Dohyung Kim, Bumsub Ham
The student and the decoder in the teacher, having the same network architecture as FSRCNN, try to reconstruct HR images.
2 code implementations • CVPR 2020 • Hyunjong Park, Jongyoun Noh, Bumsub Ham
To address this problem, we present an unsupervised learning approach to anomaly detection that considers the diversity of normal patterns explicitly, while lessening the representation capacity of CNNs.
no code implementations • 29 Nov 2019 • Junghyup Lee, Dohyung Kim, Wonkyung Lee, Jean Ponce, Bumsub Ham
We address the problem of semantic correspondence, that is, establishing a dense flow field between images depicting different instances of the same object or scene category.
1 code implementation • 21 Nov 2019 • Hyunjong Park, Bumsub Ham
To address this issue, we propose a new relation network for person reID that considers relations between individual body parts and the rest of them.
Ranked #3 on Person Re-Identification on CUHK03-C
1 code implementation • NeurIPS 2019 • Chanho Eom, Bumsub Ham
We address the problem of person re-identification (reID), that is, retrieving person images from a large dataset, given a query image of the person of interest.
Ranked #46 on Person Re-Identification on DukeMTMC-reID
2 code implementations • 17 Oct 2019 • Beomjun Kim, Jean Ponce, Bumsub Ham
Previous methods based on convolutional neural networks (CNNs) combine nonlinear activations of spatially-invariant kernels to estimate structural details and regress the filtering result.
no code implementations • 16 Sep 2019 • Chanho Eom, Hyunjong Park, Bumsub Ham
To address this problem, we propose to memorize temporal consistency in the video sequence, and leverage it for the task of depth prediction.
no code implementations • CVPR 2019 • Junghyup Lee, Dohyung Kim, Jean Ponce, Bumsub Ham
We address the problem of semantic correspondence, that is, establishing a dense flow field between images depicting different instances of the same object or scene category.
no code implementations • 27 Mar 2019 • Beomjun Kim, Jean Ponce, Bumsub Ham
We address the problem of upsampling a low-resolution (LR) depth map using a registered high-resolution (HR) color image of the same scene.
1 code implementation • ICCV 2017 • Kai Han, Rafael S. Rezende, Bumsub Ham, Kwan-Yee K. Wong, Minsu Cho, Cordelia Schmid, Jean Ponce
This paper addresses the problem of establishing semantic correspondences between images depicting different instances of the same object or scene category.
no code implementations • 21 Mar 2017 • Bumsub Ham, Minsu Cho, Cordelia Schmid, Jean Ponce
Finding image correspondences remains a challenging problem in the presence of intra-class variations and large changes in scene layout.
1 code implementation • CVPR 2017 • Seungryong Kim, Dongbo Min, Bumsub Ham, Sangryul Jeon, Stephen Lin, Kwanghoon Sohn
The sampling patterns of local structure and the self-similarity measure are jointly learned within the proposed network in an end-to-end and multi-scale manner.
no code implementations • 27 Apr 2016 • Seungryong Kim, Dongbo Min, Bumsub Ham, Minh N. Do, Kwanghoon Sohn
In this paper, we propose a novel dense descriptor, called dense adaptive self-correlation (DASC), to estimate multi-modal and multi-spectral dense correspondences.
no code implementations • 26 Apr 2016 • Youngjung Kim, Dongbo Min, Bumsub Ham, Kwanghoon Sohn
In this paper, we introduce a highly efficient splitting-based method for global EPS that minimizes the objective function of ${l_2}$ data and prior terms (possibly non-smooth and non-convex) in linear time.
no code implementations • CVPR 2016 • Bumsub Ham, Minsu Cho, Cordelia Schmid, Jean Ponce
Finding image correspondences remains a challenging problem in the presence of intra-class variations and large changes in scene layout.~Semantic flow methods are designed to handle images depicting different instances of the same object or scene category.
no code implementations • CVPR 2015 • Bumsub Ham, Minsu Cho, Jean Ponce
Regularizing images under a guidance signal has been used in various tasks in computer vision and computational photography, particularly for noise reduction and joint upsampling.
no code implementations • CVPR 2015 • Seungryong Kim, Dongbo Min, Bumsub Ham, Seungchul Ryu, Minh N. Do, Kwanghoon Sohn
To further improve the matching quality and runtime efficiency, we propose a patch-wise receptive field pooling, in which a sampling pattern is optimized with a discriminative learning.