Search Results for author: Junho Kim

Found 39 papers, 28 papers with code

Fully Geometric Panoramic Localization

no code implementations29 Mar 2024 Junho Kim, Jiwon Jeong, Young Min Kim

We introduce a lightweight and accurate localization method that only utilizes the geometry of 2D-3D lines.

What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models

1 code implementation20 Mar 2024 Junho Kim, Yeon Ju Kim, Yong Man Ro

This paper presents a way of enhancing the reliability of Large Multimodal Models (LMMs) in addressing hallucination effects, where models generate incorrect or unrelated responses.

counterfactual Hallucination

Visual Style Prompting with Swapping Self-Attention

1 code implementation20 Feb 2024 Jaeseok Jeong, Junho Kim, Yunjey Choi, Gayoung Lee, Youngjung Uh

Despite their remarkable capability, existing models still face challenges in achieving controlled generation with a consistent style, requiring costly fine-tuning or often inadequately transferring the visual elements due to content leakage.

Denoising Style Transfer +1

Causal Unsupervised Semantic Segmentation

1 code implementation11 Oct 2023 Junho Kim, Byung-Kwan Lee, Yong Man Ro

Unsupervised semantic segmentation aims to achieve high-quality semantic grouping without human-labeled annotations.

Causal Inference Segmentation +2

Sequential Data Generation with Groupwise Diffusion Process

no code implementations2 Oct 2023 Sangyun Lee, Gayoung Lee, Hyunsu Kim, Junho Kim, Youngjung Uh

We present the Groupwise Diffusion Model (GDM), which divides data into multiple groups and diffuses one group at one time interval in the forward diffusion process.

Disentanglement

Calibrating Panoramic Depth Estimation for Practical Localization and Mapping

1 code implementation ICCV 2023 Junho Kim, Eun Sun Lee, Young Min Kim

While panoramic images can easily capture the surrounding context from commodity devices, the estimated depth shares the limitations of conventional image-based depth estimation; the performance deteriorates under large domain shifts and the absolute values are still ambiguous to infer from 2D observations.

Depth Estimation Depth Prediction +1

LDL: Line Distance Functions for Panoramic Localization

1 code implementation ICCV 2023 Junho Kim, Changwoon Choi, Hojun Jang, Young Min Kim

We introduce LDL, a fast and robust algorithm that localizes a panorama to a 3D map using line segments.

Mitigating Adversarial Vulnerability through Causal Parameter Estimation by Adversarial Double Machine Learning

1 code implementation ICCV 2023 Byung-Kwan Lee, Junho Kim, Yong Man Ro

Adversarial examples derived from deliberately crafted perturbations on visual inputs can easily harm decision process of deep neural networks.

Adversarial Robustness

User-friendly Image Editing with Minimal Text Input: Leveraging Captioning and Injection Techniques

no code implementations5 Jun 2023 Sunwoo Kim, Wooseok Jang, Hyunsu Kim, Junho Kim, Yunjey Choi, Seungryong Kim, Gayeong Lee

From the users' standpoint, prompt engineering is a labor-intensive process, and users prefer to provide a target word for editing instead of a full sentence.

Prompt Engineering Sentence

Context-Preserving Two-Stage Video Domain Translation for Portrait Stylization

no code implementations30 May 2023 Doyeon Kim, Eunji Ko, Hyunsu Kim, Yunji Kim, Junho Kim, Dongchan Min, Junmo Kim, Sung Ju Hwang

Portrait stylization, which translates a real human face image into an artistically stylized image, has attracted considerable interest and many prior works have shown impressive quality in recent years.

Translation

Custom-Edit: Text-Guided Image Editing with Customized Diffusion Models

no code implementations25 May 2023 Jooyoung Choi, Yunjey Choi, Yunji Kim, Junho Kim, Sungroh Yoon

Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts.

text-guided-image-editing

Panoramic Image-to-Image Translation

no code implementations11 Apr 2023 Soohyun Kim, Junho Kim, Taekyung Kim, Hwan Heo, Seungryong Kim, Jiyoung Lee, Jin-Hwa Kim

This task is difficult due to the geometric distortion of panoramic images and the lack of a panoramic image dataset with diverse conditions, like weather or time.

Image-to-Image Translation Translation

Dynamic Structure Pruning for Compressing CNNs

1 code implementation17 Mar 2023 Jun-Hyung Park, Yeachan Kim, Junho Kim, Joon-Young Choi, SangKeun Lee

In this work, we introduce a novel structure pruning method, termed as dynamic structure pruning, to identify optimal pruning granularities for intra-channel pruning.

Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation

1 code implementation14 Mar 2023 Junyoung Seo, Wooseok Jang, Min-Seop Kwak, Hyeonsu Kim, Jaehoon Ko, Junho Kim, Jin-Hwa Kim, Jiyoung Lee, Seungryong Kim

Text-to-3D generation has shown rapid progress in recent days with the advent of score distillation, a methodology of using pretrained text-to-2D diffusion models to optimize neural radiance field (NeRF) in the zero-shot setting.

3D Generation Single-View 3D Reconstruction +1

Demystifying Causal Features on Adversarial Examples and Causal Inoculation for Robust Network by Adversarial Instrumental Variable Regression

1 code implementation CVPR 2023 Junho Kim, Byung-Kwan Lee, Yong Man Ro

The origin of adversarial examples is still inexplicable in research fields, and it arouses arguments from various viewpoints, albeit comprehensive investigations.

Adversarial Robustness

Learning Input-agnostic Manipulation Directions in StyleGAN with Text Guidance

1 code implementation26 Feb 2023 Yoonjeon Kim, Hyunsu Kim, Junho Kim, Yunjey Choi, Eunho Yang

With the advantages of fast inference and human-friendly flexible manipulation, image-agnostic style manipulation via text guidance enables new applications that were not previously available.

Disentanglement

Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking

1 code implementation15 Dec 2022 Mingyu Lee, Jun-Hyung Park, Junho Kim, Kang-Min Kim, SangKeun Lee

Masked language modeling (MLM) has been widely used for pre-training effective bidirectional representations, but incurs substantial training costs.

Language Modelling Masked Language Modeling

Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding

no code implementations CVPR 2023 Gyeongman Kim, Hajin Shim, Hyunsu Kim, Yunjey Choi, Junho Kim, Eunho Yang

Inspired by the impressive performance of recent face image editing methods, several studies have been naturally proposed to extend these methods to the face video editing task.

Video Editing

MoDA: Map style transfer for self-supervised Domain Adaptation of embodied agents

no code implementations29 Nov 2022 Eun Sun Lee, Junho Kim, SangWon Park, Young Min Kim

We propose a domain adaptation method, MoDA, which adapts a pretrained embodied agent to a new, noisy environment without ground-truth supervision.

Domain Adaptation Style Transfer +1

Generator Knows What Discriminator Should Learn in Unconditional GANs

1 code implementation27 Jul 2022 Gayoung Lee, Hyunsu Kim, Junho Kim, Seonghyeon Kim, Jung-Woo Ha, Yunjey Choi

Here we explore the efficacy of dense supervision in unconditional generation and find generator feature maps can be an alternative of cost-expensive semantic label maps.

Conditional Image Generation Unconditional Image Generation

CPO: Change Robust Panorama to Point Cloud Localization

1 code implementation12 Jul 2022 Junho Kim, Hojun Jang, Changwoon Choi, Young Min Kim

By utilizing the unique equivariance of spherical projections, we propose very fast color histogram generation for a large number of camera poses without explicitly rendering images for all candidate poses.

Visual Localization

Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Images

1 code implementation17 Jun 2022 Jiyeon Han, Hwanil Choi, Yunjey Choi, Junho Kim, Jung-Woo Ha, Jaesik Choi

In this work, we propose a new evaluation metric, called `rarity score', to measure the individual rarity of each image synthesized by generative models.

Image Generation

Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck

1 code implementation NeurIPS 2021 Junho Kim, Byung-Kwan Lee, Yong Man Ro

Adversarial examples, generated by carefully crafted perturbation, have attracted considerable attention in research fields.

Adversarial Robustness

Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks

1 code implementation ICLR 2022 Sihyun Yu, Jihoon Tack, Sangwoo Mo, Hyunsu Kim, Junho Kim, Jung-Woo Ha, Jinwoo Shin

In this paper, we found that the recent emerging paradigm of implicit neural representations (INRs) that encodes a continuous signal into a parameterized neural network effectively mitigates the issue.

Generative Adversarial Network Video Generation

Feature Statistics Mixing Regularization for Generative Adversarial Networks

1 code implementation CVPR 2022 Junho Kim, Yunjey Choi, Youngjung Uh

In generative adversarial networks, improving discriminators is one of the key components for generation performance.

Self-Supervised Domain Adaptation for Visual Navigation with Global Map Consistency

no code implementations14 Oct 2021 Eun Sun Lee, Junho Kim, Young Min Kim

We propose a light-weight, self-supervised adaptation for a visual navigation agent to generalize to unseen environment.

Test-time Adaptation Visual Navigation

SGoLAM: Simultaneous Goal Localization and Mapping for Multi-Object Goal Navigation

1 code implementation14 Oct 2021 Junho Kim, Eun Sun Lee, MinGi Lee, Donsu Zhang, Young Min Kim

We present SGoLAM, short for simultaneous goal localization and mapping, which is a simple and efficient algorithm for Multi-Object Goal navigation.

Navigate Visual Navigation

CTRL-C: Camera calibration TRansformer with Line-Classification

1 code implementation ICCV 2021 Jinwoo Lee, Hyunsung Go, Hyunjoon Lee, Sunghyun Cho, Minhyuk Sung, Junho Kim

In this work, we propose Camera calibration TRansformer with Line-Classification (CTRL-C), an end-to-end neural network-based approach to single image camera calibration, which directly estimates the camera parameters from an image and a set of line segments.

Camera Calibration Classification

PICCOLO: Point Cloud-Centric Omnidirectional Localization

2 code implementations ICCV 2021 Junho Kim, Changwoon Choi, Hojun Jang, Young Min Kim

Our loss function, called sampling loss, is point cloud-centric, evaluated at the projected location of every point in the point cloud.

Visual Localization

Exploiting Spatial Dimensions of Latent in GAN for Real-time Image Editing

1 code implementation CVPR 2021 Hyunsu Kim, Yunjey Choi, Junho Kim, Sungjoo Yoo, Youngjung Uh

Although manipulating the latent vectors controls the synthesized outputs, editing real images with GANs suffers from i) time-consuming optimization for projecting real images to the latent vectors, ii) or inaccurate embedding through an encoder.

Image Manipulation valid

A StyleMap-Based Generator for Real-Time Image Projection and Local Editing

no code implementations1 Jan 2021 Hyunsu Kim, Yunjey Choi, Junho Kim, Sungjoo Yoo, Youngjung Uh

State-of-the-art GAN-based methods for editing real images suffer from time-consuming operations in projecting real images to latent vectors.

Image Manipulation

Neural Geometric Parser for Single Image Camera Calibration

1 code implementation ECCV 2020 Jinwoo Lee, Minhyuk Sung, Hyunjoon Lee, Junho Kim

With the supervision of datasets consisting of the horizontal line and focal length of the images, our networks can be trained to estimate the same camera parameters.

Camera Calibration

Cannot find the paper you are looking for? You can Submit a new open access paper.