1 code implementation • ECCV 2020 • Seonguk Seo, Joon-Young Lee, Bohyung Han
We propose a unified referring video object segmentation network (URVOS).
Ranked #7 on Referring Video Object Segmentation on MeViS
1 code implementation • ECCV 2020 • Tackgeun You, Bohyung Han
We propose a brand new benchmark for analyzing causality in traffic accident videos by decomposing an accident into a pair of events, cause and effect.
no code implementations • ECCV 2020 • Jinyoung Choi, Bohyung Han
We propose to learn a deep neural network for JPEG image compression, which predicts image-specific optimized quantization tables fully compatible with the standard JPEG encoder and decoder.
no code implementations • 12 Sep 2024 • Junsung Lee, Minsoo Kang, Bohyung Han
Our approach revises the original noise prediction network of a pretrained diffusion model by introducing a noise correction term.
no code implementations • 25 Jul 2024 • Seonguk Seo, Dongwan Kim, Bohyung Han
Based on these findings, we introduce a novel evaluation metric for machine unlearning, coined dimensional alignment, which measures the alignment between the eigenspaces of the forget and retain set samples.
1 code implementation • 19 May 2024 • JiHwan Kim, Junoh Kang, Jinyoung Choi, Bohyung Han
We propose a novel inference technique based on a pretrained diffusion model for text-conditional video generation.
1 code implementation • 15 Apr 2024 • Minji Kim, Dongyoon Han, Taekyung Kim, Bohyung Han
To be specific, we introduce Temporal Contextualization (TC), a layer-wise temporal information infusion mechanism for videos, which 1) extracts core information from each frame, 2) connects relevant information across frames for the summarization into context tokens, and 3) leverages the context tokens for feature encoding.
Ranked #1 on Zero-Shot Action Recognition on Kinetics
no code implementations • 22 Jan 2024 • Myungseo Song, Jinyoung Choi, Bohyung Han
We study the robustness of learned image compression models against adversarial attacks and present a training-free defense technique based on simple image transform functions.
1 code implementation • CVPR 2024 • Seonguk Seo, Jinkyu Kim, Geeho Kim, Bohyung Han
We propose a novel contrastive learning framework to effectively address the challenges of data heterogeneity in federated learning.
no code implementations • 9 Jan 2024 • Heewon Kim, Hyun Sung Chang, Kiho Cho, Jaeyun Lee, Bohyung Han
In this framework, we provide a proper objective function and an optimization algorithm based on two expectation-maximization (EM) cycles.
1 code implementation • CVPR 2024 • Donghun Ryou, Inju Ha, Hyewon Yoo, Dongwan Kim, Bohyung Han
AFM leverages mixup in the frequency domain to generate noisy images with distinctive and challenging noise characteristics all the while preserving the properties of authentic real-world noise.
1 code implementation • CVPR 2024 • Junoh Kang, Jinyoung Choi, Sungik Choi, Bohyung Han
We propose a novel diffusion-based image generation method called the observation-guided diffusion probabilistic model (OGDM), which effectively addresses the tradeoff between quality control and fast sampling.
no code implementations • 5 Sep 2023 • TaeHoon Kim, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Mark Marsden, Alessandra Sala, Seung Hwan Kim, Bohyung Han, Kyoung Mu Lee, Honglak Lee, Kyounghoon Bae, Xiangyu Wu, Yi Gao, Hailiang Zhang, Yang Yang, Weili Guo, Jianfeng Lu, Youngtaek Oh, Jae Won Cho, Dong-Jin Kim, In So Kweon, Junmo Kim, Wooyoung Kang, Won Young Jhoo, Byungseok Roh, Jonghwan Mun, Solgil Oh, Kenan Emir Ak, Gwang-Gook Lee, Yan Xu, Mingwei Shen, Kyomin Hwang, Wonsik Shin, Kamin Lee, Wonhark Park, Dongkwan Lee, Nojun Kwak, Yujin Wang, Yimu Wang, Tiancheng Gu, Xingchang Lv, Mingmao Sun
In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project and share the results and outcomes of 2023 challenge.
no code implementations • 27 Apr 2023 • Mijeong Kim, Hyunjoon Lee, Bohyung Han
Although 3D-aware GANs based on neural radiance fields have achieved competitive performance, their applicability is still limited to objects or scenes with the ground-truths or prediction models for clearly defined canonical camera poses.
no code implementations • CVPR 2023 • Dongwan Kim, Bohyung Han
A primary goal of class-incremental learning is to strike a balance between stability and plasticity, where models should be both stable enough to retain knowledge learned from previously seen classes, and plastic enough to learn concepts from new classes.
no code implementations • 4 Apr 2023 • TaeHoon Kim, Bohyung Han
We propose a novel domain generalization technique, referred to as Randomized Adversarial Style Perturbation (RASP), which is motivated by the observation that the characteristics of each domain are captured by the feature statistics corresponding to style.
no code implementations • 4 Apr 2023 • TaeHoon Kim, Jaeyoo Park, Bohyung Han
The proposed approach has a unique perspective to utilize the previous knowledge in class incremental learning since it augments features of arbitrary target classes using examples in other classes via adversarial attacks on a previously learned classifier.
no code implementations • CVPR 2023 • Jaeyoo Park, Bohyung Han
We propose a visual-linguistic representation learning approach within a self-supervised learning framework by introducing a new operation, loss, and data augmentation strategy.
no code implementations • CVPR 2023 • Minsoo Kang, Doyup Lee, Jiseob Kim, Saehoon Kim, Bohyung Han
We propose a text-to-image generation algorithm based on deep neural networks when text captions for images are unavailable during training.
no code implementations • 28 Mar 2023 • Minsoo Kang, Hyewon Yoo, Eunhee Kang, Sehwan Ki, Hyong-Euk Lee, Bohyung Han
We propose an information-theoretic knowledge distillation approach for the compression of generative adversarial networks, which aims to maximize the mutual information between teacher and student networks via a variational optimization based on an energy-based model.
1 code implementation • NeurIPS 2023 • Zunzhi You, Daochang Liu, Bohyung Han, Chang Xu
Experimental results demonstrate that, in terms of adversarial robustness, NIM is superior to MIM thanks to its effective denoising capability.
no code implementations • 10 Jan 2023 • Seonguk Seo, Mustafa Gokhan Uzunbas, Bohyung Han, Sara Cao, Joena Zhang, Taipeng Tian, Ser-Nam Lim
Backfilling is the process of re-extracting all gallery embeddings from upgraded models in image retrieval systems.
2 code implementations • 11 Aug 2022 • Minji Kim, Seungkwan Lee, Jungseul Ok, Bohyung Han, Minsu Cho
Despite the extensive adoption of machine learning on the task of visual object tracking, recent learning-based approaches have largely overlooked the fact that visual tracking is a sequence-level task in its nature; they rely heavily on frame-level training, which inevitably induces inconsistency between training and testing in terms of both data distributions and task objectives.
Ranked #19 on Visual Object Tracking on TrackingNet
2 code implementations • ICML 2022 • Jinkyu Kim, Geeho Kim, Bohyung Han
A critical challenge of federated learning is data heterogeneity and imbalance across clients, which leads to inconsistency between local networks and unstable convergence of global models.
no code implementations • CVPR 2022 • Dong-Hwan Jang, Sanghyeok Chu, Joonhyuk Kim, Bohyung Han
The size and shape of the receptive field determine how the network aggregates local information and affect the overall performance of a model considerably.
no code implementations • 23 May 2022 • Ilchae Jung, Minji Kim, Eunhyeok Park, Bohyung Han
This paper presents a novel hybrid representation learning framework for streaming data, where an image frame in a video is modeled by an ensemble of two distinct deep neural networks; one is a low-bit quantized network and the other is a lightweight full-precision network.
1 code implementation • CVPR 2022 • Minsoo Kang, Jaeyoo Park, Bohyung Han
We present a novel class incremental learning approach based on deep neural networks, which continually learns new tasks with limited memory for storing examples in the previous tasks.
no code implementations • ICCV 2021 • Jaeyoo Park, Minsoo Kang, Bohyung Han
We tackle catastrophic forgetting problem in the context of class-incremental learning for video recognition, which has not been explored actively despite the popularity of continual learning.
1 code implementation • 25 Mar 2022 • Jaeyoo Park, Junha Kim, Bohyung Han
We present a meta-learning framework for weakly supervised anomaly detection in videos, where the detector learns to adapt to unseen types of abnormal activities effectively when only video-level annotations of binary labels are available.
no code implementations • 28 Feb 2022 • Dongwan Kim, Yi-Hsuan Tsai, Yumin Suh, Masoud Faraki, Sparsh Garg, Manmohan Chandraker, Bohyung Han
First, a gradient conflict in training due to mismatched label spaces is identified and a class-independent binary cross-entropy loss is proposed to alleviate such label conflicts.
1 code implementation • CVPR 2024 • Geeho Kim, Jinkyu Kim, Bohyung Han
To address this challenge, we propose a simple but effective federated learning framework, which improves the consistency across clients and facilitates the convergence of the server model.
no code implementations • 10 Jan 2022 • Seonguk Seo, Joon-Young Lee, Bohyung Han
We propose an information-theoretic bias measurement technique through a causal interpretation of spurious correlation, which is effective to identify the feature-level algorithmic bias by taking advantage of conditional mutual information.
no code implementations • CVPR 2022 • Minsu Ko, Eunju Cha, Sungjoo Suh, Huijin Lee, Jae-Joon Han, Jinwoo Shin, Bohyung Han
Unsupervised image-to-image translation has gained considerable attention due to the recent impressive progress based on generative adversarial networks (GANs).
2 code implementations • CVPR 2022 • Mijeong Kim, Seonguk Seo, Bohyung Han
We present an information-theoretic regularization technique for few-shot novel view synthesis based on neural implicit representation.
no code implementations • NeurIPS 2021 • Sanghyeok Chu, Dongwan Kim, Bohyung Han
By randomly eliminating certain class information in each training iteration, we effectively reduce feature dependencies among classes, and the model is able to learn more debiased and disentangled feature representations.
1 code implementation • ICCV 2021 • Myungseo Song, Jinyoung Choi, Bohyung Han
In addition, the proposed framework allows us to perform task-aware image compressions for various tasks, e. g., classification, by efficiently estimating optimized quality maps specific to target tasks for our encoding network.
1 code implementation • CVPR 2022 • Seonguk Seo, Joon-Young Lee, Bohyung Han
Dataset bias is a critical challenge in machine learning since it often leads to a negative impact on a model due to the unintended decision rules captured by spurious correlations.
no code implementations • 15 Jul 2021 • Jinyoung Choi, Bohyung Han
We propose a framework of generative adversarial networks with multiple discriminators, which collaborate to represent a real dataset more effectively.
no code implementations • CVPR 2023 • Geeho Kim, Junoh Kang, Bohyung Han
Visual recognition tasks are often limited to dealing with a small subset of classes simply because the labels for the remaining classes are unavailable.
no code implementations • CVPR 2021 • Jaehyoung Yoo, Dongwook Lee, Changyong Son, Sangil Jung, ByungIn Yoo, Changkyu Choi, Jae-Joon Han, Bohyung Han
RaScaNet reads only a few rows of pixels at a time using a convolutional neural network and then sequentially learns the representation of the whole image using a recurrent neural network.
1 code implementation • CVPR 2021 • Seungmin Lee, Dongwan Kim, Bohyung Han
We focus on designing an image-text compositor, i. e., integrating multi-modal inputs to produce a representation similar to that of the target image.
Ranked #17 on Image Retrieval on Fashion IQ
3 code implementations • CVPR 2021 • Jaedong Hwang, Seoung Wug Oh, Joon-Young Lee, Bohyung Han
We extend panoptic segmentation to the open-world and introduce an open-set panoptic segmentation (OPS) task.
no code implementations • NeurIPS 2021 • Dae Young Park, Moon-Hyun Cha, Changwook Jeong, Dae Sin Kim, Bohyung Han
In other words, at the time of optimizing a teacher model, the proposed algorithm learns the student branches jointly to obtain student-friendly representations.
no code implementations • NeurIPS 2020 • Seohyun Kim, Jaeyoo Park, Bohyung Han
We propose a local-to-global representation learning algorithm for 3D point cloud data, which is appropriate to handle various geometric transformations, especially rotation, without explicit data augmentation with respect to the transformations.
1 code implementation • ICML 2020 • Minsoo Kang, Bohyung Han
We propose a simple but effective data-driven channel pruning algorithm, which compresses deep neural networks in a differentiable way by exploiting the characteristics of operations.
1 code implementation • CVPR 2020 • Jonghwan Mun, Minsu Cho, Bohyung Han
This paper addresses the problem of text-to-video temporal grounding, which aims to identify the time interval in a video semantically relevant to a text query.
1 code implementation • AAAI Conference on Artificial Intelligence 2020 • Myungsub Choi, Heewon Kim, Bohyung Han, Ning Xu, Kyoung Mu Lee
Prevailing video frame interpolation techniques rely heavily on optical flow estimation and require additional model complexity and computational cost; it is also susceptible to error propagation in challenging scenarios with large motion and heavy occlusion.
no code implementations • 30 Jan 2020 • Jaedong Hwang, Seohyun Kim, Jeany Son, Bohyung Han
We present a weakly supervised instance segmentation algorithm based on deep community learning with multiple tasks.
Image-level Supervised Instance Segmentation object-detection +4
1 code implementation • NeurIPS 2019 • Paul Hongsuck Seo, Geeho Kim, Bohyung Han
Label noise is one of the critical sources that degrade generalization performance of deep neural networks significantly.
no code implementations • 29 Nov 2019 • Minsoo Kang, Jonghwan Mun, Bohyung Han
We present a novel framework of knowledge distillation that is capable of learning powerful and efficient student models from ensemble teacher networks.
no code implementations • 25 Nov 2019 • Ilchae Jung, Kihyun You, Hyeonwoo Noh, Minsu Cho, Bohyung Han
We propose a novel meta-learning framework for real-time object tracking with efficient model adaptation and channel pruning.
no code implementations • 21 Nov 2019 • Paul Hongsuck Seo, Piyush Sharma, Tomer Levinboim, Bohyung Han, Radu Soricut
Human ratings are currently the most accurate way to assess the quality of an image captioning model, yet most often the only used outcome of an expensive human rating evaluation is a few overall statistics over the evaluation dataset.
no code implementations • 18 Nov 2019 • Heewon Kim, Seokil Hong, Bohyung Han, Heesoo Myeong, Kyoung Mu Lee
We present an elegant framework of fine-grained neural architecture search (FGNAS), which allows to employ multiple heterogeneous operations within a single layer and can even generate compositional feature maps using several different base operations.
1 code implementation • 23 Oct 2019 • Heung-Chang Lee, Do-Guk Kim, Bohyung Han
We propose a novel neural architecture search algorithm via reinforcement learning by decoupling structure and operation search processes.
no code implementations • 3 Oct 2019 • Wonpyo Park, Paul Hongsuck Seo, Bohyung Han, Minsu Cho
We introduce a novel stochastic regularization technique for deep neural networks, which decomposes a layer into multiple branches with different parameters and merges stochastically sampled combinations of the outputs from the branches during training.
no code implementations • ICCV 2019 • Dongmin Park, Seokil Hong, Bohyung Han, Kyoung Mu Lee
Catastrophic forgetting is a critical challenge in training deep neural networks.
no code implementations • ECCV 2020 • Seonguk Seo, Yumin Suh, Dongwan Kim, Geeho Kim, Jongwoo Han, Bohyung Han
We propose a simple but effective multi-source domain generalization technique based on deep neural networks by incorporating optimized normalization layers that are specific to individual domains.
Ranked #3 on Unsupervised Domain Adaptation on PACS
1 code implementation • CVPR 2019 • Woong-Gi Chang, Tackgeun You, Seonguk Seo, Suha Kwak, Bohyung Han
In the first stage, we estimate pseudo-labels for the examples in the target domain using an external unsupervised domain adaptation algorithm---for example, MSTN or CPUA---integrating the proposed domain-specific batch normalization.
1 code implementation • 19 Apr 2019 • Ruotian Luo, Ning Zhang, Bohyung Han, Linjie Yang
We present a novel problem setting in zero-shot learning, zero-shot object recognition and detection in the context.
1 code implementation • CVPR 2019 • Jonghwan Mun, Linjie Yang, Zhou Ren, Ning Xu, Bohyung Han
Dense video captioning is an extremely challenging task since accurate and coherent description of events in a video requires holistic understanding of video contents as well as contextual reasoning of individual events.
no code implementations • NeurIPS 2018 • Jonghwan Mun, Kimin Lee, Jinwoo Shin, Bohyung Han
The proposed framework is model-agnostic and applicable to any tasks other than VQA, e. g., image classification with a large number of labels but few per-class examples, which is known to be difficult under existing MCL schemes.
1 code implementation • CVPR 2019 • Hyeonwoo Noh, Tae-hoon Kim, Jonghwan Mun, Bohyung Han
Specifically, we employ linguistic knowledge sources such as structured lexical database (e. g. WordNet) and visual descriptions for unsupervised task discovery, and transfer a learned task conditional visual classifier as an answering unit in a visual question answering model.
no code implementations • CVPR 2019 • Seonguk Seo, Paul Hongsuck Seo, Bohyung Han
The proposed loss function enables us to learn deep neural networks that predict confidence calibrated scores using a single inference.
3 code implementations • ECCV 2018 • Ilchae Jung, Jeany Son, Mooyeol Baek, Bohyung Han
We present a fast and accurate visual tracking algorithm based on the multi-domain convolutional neural network (MDNet).
no code implementations • ECCV 2018 • Paul Hongsuck Seo, Jongmin Lee, Deunsol Jung, Bohyung Han, Minsu Cho
Semantic correspondence is the problem of establishing correspondences across images depicting different instances of the same object or scene class.
no code implementations • ECCV 2018 • Paul Hongsuck Seo, Tobias Weyand, Jack Sim, Bohyung Han
Image geolocalization is the task of identifying the location depicted in a photo based only on its visual information.
Ranked #1 on Photo geolocation estimation on Im2GPS (Reference images metric)
3 code implementations • CVPR 2018 • Phuc Nguyen, Ting Liu, Gautam Prasad, Bohyung Han
We propose a weakly supervised temporal action localization algorithm on untrimmed videos using convolutional neural networks.
Ranked #13 on Weakly Supervised Action Localization on ActivityNet-1.3 (mAP@0.5 metric)
no code implementations • NeurIPS 2017 • Hyeonwoo Noh, Tackgeun You, Jonghwan Mun, Bohyung Han
Overfitting is one of the most critical challenges in deep neural networks, and there are various types of regularization methods to improve generalization performance.
no code implementations • NeurIPS 2017 • Paul Hongsuck Seo, Andreas Lehrmann, Bohyung Han, Leonid Sigal
From this memory, the model retrieves the previous attention, taking into account recency, which is most relevant for the current question, in order to resolve potentially ambiguous references.
Ranked #13 on Visual Dialog on VisDial v0.9 val (R@1 metric)
no code implementations • CVPR 2017 • Bohyung Han, Jack Sim, Hartwig Adam
We propose an extremely simple but effective regularization technique of convolutional neural networks (CNNs), referred to as BranchOut, for online ensemble tracking.
no code implementations • CVPR 2017 • Jeany Son, Mooyeol Baek, Minsu Cho, Bohyung Han
We propose Quadruplet Convolutional Neural Networks (Quad-CNN) for multi-object tracking, which learn to associate object detections across frames using quadruplet losses.
no code implementations • CVPR 2017 • Donghun Yeo, Jeany Son, Bohyung Han, Joon Hee Han
We propose a simple but effective tracking-by-segmentation algorithm using Absorbing Markov Chain (AMC) on superpixel segmentation, where target state is estimated by a combination of bottom-up and top-down approaches, and target segmentation is propagated to subsequent frames in a recursive manner.
no code implementations • CVPR 2017 • Seunghoon Hong, Donghun Yeo, Suha Kwak, Honglak Lee, Bohyung Han
Our goal is to overcome this limitation with no additional human intervention by retrieving videos relevant to target class labels from web repository, and generating segmentation labels from the retrieved videos to simulate strong supervision for semantic segmentation.
12 code implementations • ICCV 2017 • Hyeonwoo Noh, Andre Araujo, Jack Sim, Tobias Weyand, Bohyung Han
We propose an attentive local feature descriptor suitable for large-scale image retrieval, referred to as DELF (DEep Local Feature).
Ranked #2 on Image Retrieval on Oxf105k
1 code implementation • 12 Dec 2016 • Jonghwan Mun, Minsu Cho, Bohyung Han
Visual attention plays an important role to understand images and demonstrates its effectiveness in generating natural language descriptions of images.
no code implementations • ICCV 2017 • Jonghwan Mun, Paul Hongsuck Seo, Ilchae Jung, Bohyung Han
To address this objective, we automatically generate a customized synthetic VideoQA dataset using {\em Super Mario Bros.} gameplay videos so that it contains events with different levels of reasoning complexity.
no code implementations • 25 Aug 2016 • Hyeonseob Nam, Mooyeol Baek, Bohyung Han
We present an online visual tracking algorithm by managing multiple target appearance models in a tree structure.
no code implementations • 12 Jun 2016 • Hyeonwoo Noh, Bohyung Han
We propose a novel algorithm for visual question answering based on a recurrent deep neural network, where every module in the network corresponds to a complete answering unit with attention mechanism by itself.
Ranked #2 on Visual Question Answering (VQA) on VQA v1 test-std
1 code implementation • 8 Jun 2016 • Paul Hongsuck Seo, Zhe Lin, Scott Cohen, Xiaohui Shen, Bohyung Han
We propose a novel attention model that can accurately attends to target objects of various scales and shapes in images.
no code implementations • CVPR 2016 • Seunghoon Hong, Junhyuk Oh, Bohyung Han, Honglak Lee
We propose a novel weakly-supervised semantic segmentation algorithm based on Deep Convolutional Neural Network (DCNN).
no code implementations • ICCV 2015 • Jeany Son, Ilchae Jung, Kayoung Park, Bohyung Han
We evaluate the performance of our tracking algorithm based on the measures for segmentation masks, where our algorithm illustrates superior accuracy compared to the state-of-the-art segmentation-based tracking methods.
1 code implementation • CVPR 2016 • Hyeonwoo Noh, Paul Hongsuck Seo, Bohyung Han
We tackle image question answering (ImageQA) problem by learning a convolutional neural network (CNN) with a dynamic parameter layer whose weights are determined adaptively based on questions.
Image Retrieval with Multi-Modal Query Parameter Prediction +2
2 code implementations • CVPR 2016 • Hyeonseob Nam, Bohyung Han
Our algorithm pretrains a CNN using a large set of videos with tracking ground-truths to obtain a generic target representation.
3 code implementations • NeurIPS 2015 • Seunghoon Hong, Hyeonwoo Noh, Bohyung Han
We propose a novel deep neural network architecture for semi-supervised semantic segmentation using heterogeneous annotations.
no code implementations • CVPR 2016 • Yong-Deok Kim, Taewoong Jang, Bohyung Han, Seungjin Choi
We propose a Bayesian evidence framework to facilitate transfer learning from pre-trained deep convolutional neural networks (CNNs).
5 code implementations • ICCV 2015 • Hyeonwoo Noh, Seunghoon Hong, Bohyung Han
We propose a novel semantic segmentation algorithm by learning a deconvolution network.
Ranked #3 on Curved Text Detection on SCUT-CTW1500
no code implementations • 24 Feb 2015 • Seunghoon Hong, Tackgeun You, Suha Kwak, Bohyung Han
We propose an online visual tracking algorithm by learning discriminative saliency map using Convolutional Neural Network (CNN).
no code implementations • NeurIPS 2014 • Jan Feyereisl, Suha Kwak, Jeany Son, Bohyung Han
We propose a structured prediction algorithm for object localization based on Support Vector Machines (SVMs) using privileged information.
no code implementations • CVPR 2013 • Suha Kwak, Bohyung Han, Joon Hee Han
We present a joint estimation technique of event localization and role assignment when the target video event is described by a scenario.