no code implementations • 7 May 2024 • Jihyun Kim, Changjae Oh, Hoseok Do, Soohyun Kim, Kwanghoon Sohn
To do this, we combine the strengths of Generative Adversarial networks (GANs) and diffusion models (DMs) by employing the multi-modal features in the DM into the latent space of the pre-trained GANs.
no code implementations • 2 May 2024 • Yik Lung Pang, Changjae Oh, Andrea Cavallaro
In contrast, sparse multi-view methods can take advantage of the additional views to tackle occlusion, while keeping the computational cost low compared to dense multi-view methods.
no code implementations • 1 Dec 2023 • Jaime Corsetti, Davide Boscaini, Changjae Oh, Andrea Cavallaro, Fabio Poiesi
We introduce the new setting of open-vocabulary object 6D pose estimation, in which a textual prompt is used to specify the object of interest.
1 code implementation • 2 May 2022 • Hengyi Wang, Changjae Oh
We present a refinement framework to boost the performance of pre-trained semi-supervised video object segmentation (VOS) models.
no code implementations • 23 Apr 2022 • Meng Xing, Zhiyong Feng, Yong Su, Changjae Oh
For a new in-distribution (ID) dataset, existing methods require retraining to capture the dataset-specific feature representation or data distribution.
Out-of-Distribution Detection Out of Distribution (OOD) Detection
no code implementations • 2 Mar 2022 • Hengyi Wang, Chaoran Zhu, Ziyin Ma, Changjae Oh
We present methods to estimate the physical properties of household containers and their fillings manipulated by humans.
no code implementations • 17 Feb 2022 • Ziyin Ma, Changjae Oh
The multi-color space fusion network takes the decomposed structure image as input and estimates the color corrected output by employing the feature representations from diverse color spaces of the input.
no code implementations • 22 Oct 2021 • Jaehoon Cho, Jiyoung Lee, Changjae Oh, Wonil Song, Kwanghoon Sohn
Video prediction, forecasting the future frames from a sequence of input frames, is a challenging task since the view changes are influenced by various factors, such as the global context surrounding the scene and local motion dynamics.
2 code implementations • 13 Aug 2020 • Ali Shahin Shamsabadi, Changjae Oh, Andrea Cavallaro
The proposed framework combines a structure loss and a semantic adversarial loss in a multi-task objective function to train a fully convolutional neural network.
2 code implementations • 27 Oct 2019 • Ali Shahin Shamsabadi, Changjae Oh, Andrea Cavallaro
This loss function accounts for both image detail enhancement and class misleading objectives.