Search Results for author: Junho Kim

Found 51 papers, 34 papers with code

DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization

no code implementations12 Dec 2024 Geonhui Jang, Jin-Hwa Kim, Yong-Hyun Park, Junho Kim, Gayoung Lee, Yonghyun Jeong

However, fine-tuning with a limited number of reference images often leads to overfitting, resulting in issues such as prompt misalignment or content leakage.

Look Every Frame All at Once: Video-Ma$^2$mba for Efficient Long-form Video Understanding with Multi-Axis Gradient Checkpointing

no code implementations29 Nov 2024 Hosu Lee, Junho Kim, Hyunjun Kim, Yong Man Ro

With the growing scale and complexity of video data, efficiently processing long video sequences poses significant challenges due to the quadratic increase in memory and computational demands associated with existing transformer-based Large Multi-modal Models (LMMs).

Mamba State Space Models +1

SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis

no code implementations25 Nov 2024 Junho Kim, Hyunjun Kim, Hosu Lee, Yong Man Ro

Despite advances in Large Multi-modal Models, applying them to long and untrimmed video content remains challenging due to limitations in context length and substantial memory overhead.

Descriptive Retrieval

D-Cube: Exploiting Hyper-Features of Diffusion Model for Robust Medical Classification

1 code implementation17 Nov 2024 Minhee Jang, Juheon Son, Thanaporn Viriyasaranon, Junho Kim, Jang-Hwan Choi

The integration of deep learning technologies in medical imaging aims to enhance the efficiency and accuracy of cancer diagnosis, particularly for pancreatic and breast cancers, which present significant diagnostic challenges due to their high mortality rates and complex imaging characteristics.

Contrastive Learning feature selection

CapeLLM: Support-Free Category-Agnostic Pose Estimation with Multimodal Large Language Models

no code implementations11 Nov 2024 Junho Kim, Hyungjin Chung, Byung-Hoon Kim

Our approach sets a new state-of-the-art on the MP-100 benchmark in the challenging 1-shot setting, marking a significant advancement in the field of category-agnostic pose estimation.

Category-Agnostic Pose Estimation Language Modeling +5

C2A: Client-Customized Adaptation for Parameter-Efficient Federated Learning

1 code implementation1 Nov 2024 Yeachan Kim, Junho Kim, Wing-Lam Mok, Jun-Hyung Park, SangKeun Lee

Despite the versatility of pre-trained language models (PLMs) across domains, their large memory footprints pose significant challenges in federated learning (FL), where the training model has to be distributed between a server and clients.

Federated Learning parameter-efficient fine-tuning

CleaR: Towards Robust and Generalized Parameter-Efficient Fine-Tuning for Noisy Label Learning

1 code implementation31 Oct 2024 Yeachan Kim, Junho Kim, SangKeun Lee

However, as datasets in such environments often contain noisy labels that adversely affect performance, PEFT methods are inevitably exposed to noisy labels.

parameter-efficient fine-tuning

HiPPO-KAN: Efficient KAN Model for Time Series Analysis

no code implementations19 Oct 2024 SangJong Lee, Jin-Kwang Kim, Junho Kim, Taehan Kim, James Lee

In this study, we introduces a parameter-efficient model that outperforms traditional models in time series forecasting, by integrating High-order Polynomial Projection (HiPPO) theory into the Kolmogorov-Arnold network (KAN) framework.

Time Series Time Series Forecasting

MELT: Materials-aware Continued Pre-training for Language Model Adaptation to Materials Science

1 code implementation19 Oct 2024 Junho Kim, Yeachan Kim, Jun-Hyung Park, Yerim Oh, Suho Kim, SangKeun Lee

We introduce a novel continued pre-training method, MELT (MatEriaLs-aware continued pre-Training), specifically designed to efficiently adapt the pre-trained language models (PLMs) for materials science.

Language Modeling Language Modelling

Mentor-KD: Making Small Language Models Better Multi-step Reasoners

1 code implementation11 Oct 2024 Hojae Lee, Junho Kim, SangKeun Lee

Large Language Models (LLMs) have displayed remarkable performances across various complex tasks by leveraging Chain-of-Thought (CoT) prompting.

Knowledge Distillation

Direct Unlearning Optimization for Robust and Safe Text-to-Image Models

no code implementations17 Jul 2024 Yong-Hyun Park, Sangdoo Yun, Jin-Hwa Kim, Junho Kim, Geonhui Jang, Yonghyun Jeong, Junghyo Jo, Gayoung Lee

In this paper, we propose Direct Unlearning Optimization (DUO), a novel framework for removing Not Safe For Work (NSFW) content from T2I models while preserving their performance on unrelated topics.

Red Teaming

CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models

no code implementations4 Jun 2024 Junho Kim, Hyunjun Kim, Yeonju Kim, Yong Man Ro

Large Multi-modal Models (LMMs) have recently demonstrated remarkable abilities in visual context understanding and coherent response generation.

Hallucination Informativeness +1

Fully Geometric Panoramic Localization

1 code implementation CVPR 2024 Junho Kim, Jiwon Jeong, Young Min Kim

We introduce a lightweight and accurate localization method that only utilizes the geometry of 2D-3D lines.

Indoor Localization Visual Localization

What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-modal Models

1 code implementation20 Mar 2024 Junho Kim, Yeon Ju Kim, Yong Man Ro

This paper presents a way of enhancing the reliability of Large Multi-modal Models (LMMs) in addressing hallucination, where the models generate cross-modal inconsistent responses.

counterfactual Hallucination +1

Visual Style Prompting with Swapping Self-Attention

1 code implementation20 Feb 2024 Jaeseok Jeong, Junho Kim, Yunjey Choi, Gayoung Lee, Youngjung Uh

Despite their remarkable capability, existing models still face challenges in achieving controlled generation with a consistent style, requiring costly fine-tuning or often inadequately transferring the visual elements due to content leakage.

Denoising Style Transfer +1

Causal Unsupervised Semantic Segmentation

1 code implementation11 Oct 2023 Junho Kim, Byung-Kwan Lee, Yong Man Ro

Unsupervised semantic segmentation aims to achieve high-quality semantic grouping without human-labeled annotations.

Causal Inference Segmentation +2

Sequential Data Generation with Groupwise Diffusion Process

no code implementations2 Oct 2023 Sangyun Lee, Gayoung Lee, Hyunsu Kim, Junho Kim, Youngjung Uh

We present the Groupwise Diffusion Model (GDM), which divides data into multiple groups and diffuses one group at one time interval in the forward diffusion process.

Disentanglement

LDL: Line Distance Functions for Panoramic Localization

1 code implementation ICCV 2023 Junho Kim, Changwoon Choi, Hojun Jang, Young Min Kim

We introduce LDL, a fast and robust algorithm that localizes a panorama to a 3D map using line segments.

Calibrating Panoramic Depth Estimation for Practical Localization and Mapping

1 code implementation ICCV 2023 Junho Kim, Eun Sun Lee, Young Min Kim

While panoramic images can easily capture the surrounding context from commodity devices, the estimated depth shares the limitations of conventional image-based depth estimation; the performance deteriorates under large domain shifts and the absolute values are still ambiguous to infer from 2D observations.

Depth Estimation Depth Prediction +1

Mitigating Adversarial Vulnerability through Causal Parameter Estimation by Adversarial Double Machine Learning

1 code implementation ICCV 2023 Byung-Kwan Lee, Junho Kim, Yong Man Ro

Adversarial examples derived from deliberately crafted perturbations on visual inputs can easily harm decision process of deep neural networks.

Adversarial Robustness

User-friendly Image Editing with Minimal Text Input: Leveraging Captioning and Injection Techniques

no code implementations5 Jun 2023 Sunwoo Kim, Wooseok Jang, Hyunsu Kim, Junho Kim, Yunjey Choi, Seungryong Kim, Gayeong Lee

From the users' standpoint, prompt engineering is a labor-intensive process, and users prefer to provide a target word for editing instead of a full sentence.

Prompt Engineering Sentence

Context-Preserving Two-Stage Video Domain Translation for Portrait Stylization

no code implementations30 May 2023 Doyeon Kim, Eunji Ko, Hyunsu Kim, Yunji Kim, Junho Kim, Dongchan Min, Junmo Kim, Sung Ju Hwang

Portrait stylization, which translates a real human face image into an artistically stylized image, has attracted considerable interest and many prior works have shown impressive quality in recent years.

Translation

Custom-Edit: Text-Guided Image Editing with Customized Diffusion Models

no code implementations25 May 2023 Jooyoung Choi, Yunjey Choi, Yunji Kim, Junho Kim, Sungroh Yoon

Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts.

text-guided-image-editing

Panoramic Image-to-Image Translation

no code implementations11 Apr 2023 Soohyun Kim, Junho Kim, Taekyung Kim, Hwan Heo, Seungryong Kim, Jiyoung Lee, Jin-Hwa Kim

This task is difficult due to the geometric distortion of panoramic images and the lack of a panoramic image dataset with diverse conditions, like weather or time.

Image-to-Image Translation Translation

Dynamic Structure Pruning for Compressing CNNs

1 code implementation17 Mar 2023 Jun-Hyung Park, Yeachan Kim, Junho Kim, Joon-Young Choi, SangKeun Lee

In this work, we introduce a novel structure pruning method, termed as dynamic structure pruning, to identify optimal pruning granularities for intra-channel pruning.

Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation

1 code implementation14 Mar 2023 Junyoung Seo, Wooseok Jang, Min-Seop Kwak, Hyeonsu Kim, Jaehoon Ko, Junho Kim, Jin-Hwa Kim, Jiyoung Lee, Seungryong Kim

Text-to-3D generation has shown rapid progress in recent days with the advent of score distillation, a methodology of using pretrained text-to-2D diffusion models to optimize neural radiance field (NeRF) in the zero-shot setting.

3D Generation Single-View 3D Reconstruction +1

Demystifying Causal Features on Adversarial Examples and Causal Inoculation for Robust Network by Adversarial Instrumental Variable Regression

1 code implementation CVPR 2023 Junho Kim, Byung-Kwan Lee, Yong Man Ro

The origin of adversarial examples is still inexplicable in research fields, and it arouses arguments from various viewpoints, albeit comprehensive investigations.

Adversarial Robustness

Learning Input-agnostic Manipulation Directions in StyleGAN with Text Guidance

1 code implementation26 Feb 2023 Yoonjeon Kim, Hyunsu Kim, Junho Kim, Yunjey Choi, Eunho Yang

With the advantages of fast inference and human-friendly flexible manipulation, image-agnostic style manipulation via text guidance enables new applications that were not previously available.

Disentanglement

Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking

1 code implementation15 Dec 2022 Mingyu Lee, Jun-Hyung Park, Junho Kim, Kang-Min Kim, SangKeun Lee

Masked language modeling (MLM) has been widely used for pre-training effective bidirectional representations, but incurs substantial training costs.

Language Modeling Language Modelling +1

Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding

no code implementations CVPR 2023 Gyeongman Kim, Hajin Shim, Hyunsu Kim, Yunjey Choi, Junho Kim, Eunho Yang

Inspired by the impressive performance of recent face image editing methods, several studies have been naturally proposed to extend these methods to the face video editing task.

Video Editing

MoDA: Map style transfer for self-supervised Domain Adaptation of embodied agents

no code implementations29 Nov 2022 Eun Sun Lee, Junho Kim, SangWon Park, Young Min Kim

We propose a domain adaptation method, MoDA, which adapts a pretrained embodied agent to a new, noisy environment without ground-truth supervision.

Domain Adaptation Style Transfer +1

Generator Knows What Discriminator Should Learn in Unconditional GANs

1 code implementation27 Jul 2022 Gayoung Lee, Hyunsu Kim, Junho Kim, Seonghyeon Kim, Jung-Woo Ha, Yunjey Choi

Here we explore the efficacy of dense supervision in unconditional generation and find generator feature maps can be an alternative of cost-expensive semantic label maps.

Conditional Image Generation Unconditional Image Generation

CPO: Change Robust Panorama to Point Cloud Localization

1 code implementation12 Jul 2022 Junho Kim, Hojun Jang, Changwoon Choi, Young Min Kim

By utilizing the unique equivariance of spherical projections, we propose very fast color histogram generation for a large number of camera poses without explicitly rendering images for all candidate poses.

Visual Localization

Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Images

1 code implementation17 Jun 2022 Jiyeon Han, Hwanil Choi, Yunjey Choi, Junho Kim, Jung-Woo Ha, Jaesik Choi

In this work, we propose a new evaluation metric, called `rarity score', to measure the individual rarity of each image synthesized by generative models.

Diversity Image Generation

Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck

1 code implementation NeurIPS 2021 Junho Kim, Byung-Kwan Lee, Yong Man Ro

Adversarial examples, generated by carefully crafted perturbation, have attracted considerable attention in research fields.

Adversarial Robustness

Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks

1 code implementation ICLR 2022 Sihyun Yu, Jihoon Tack, Sangwoo Mo, Hyunsu Kim, Junho Kim, Jung-Woo Ha, Jinwoo Shin

In this paper, we found that the recent emerging paradigm of implicit neural representations (INRs) that encodes a continuous signal into a parameterized neural network effectively mitigates the issue.

Generative Adversarial Network Video Generation

Feature Statistics Mixing Regularization for Generative Adversarial Networks

1 code implementation CVPR 2022 Junho Kim, Yunjey Choi, Youngjung Uh

In generative adversarial networks, improving discriminators is one of the key components for generation performance.

SGoLAM: Simultaneous Goal Localization and Mapping for Multi-Object Goal Navigation

1 code implementation14 Oct 2021 Junho Kim, Eun Sun Lee, MinGi Lee, Donsu Zhang, Young Min Kim

We present SGoLAM, short for simultaneous goal localization and mapping, which is a simple and efficient algorithm for Multi-Object Goal navigation.

Navigate Visual Navigation

Self-Supervised Domain Adaptation for Visual Navigation with Global Map Consistency

no code implementations14 Oct 2021 Eun Sun Lee, Junho Kim, Young Min Kim

We propose a light-weight, self-supervised adaptation for a visual navigation agent to generalize to unseen environment.

Test-time Adaptation Visual Navigation

CTRL-C: Camera calibration TRansformer with Line-Classification

1 code implementation ICCV 2021 Jinwoo Lee, Hyunsung Go, Hyunjoon Lee, Sunghyun Cho, Minhyuk Sung, Junho Kim

In this work, we propose Camera calibration TRansformer with Line-Classification (CTRL-C), an end-to-end neural network-based approach to single image camera calibration, which directly estimates the camera parameters from an image and a set of line segments.

Camera Calibration Classification

PICCOLO: Point Cloud-Centric Omnidirectional Localization

2 code implementations ICCV 2021 Junho Kim, Changwoon Choi, Hojun Jang, Young Min Kim

Our loss function, called sampling loss, is point cloud-centric, evaluated at the projected location of every point in the point cloud.

Visual Localization

Exploiting Spatial Dimensions of Latent in GAN for Real-time Image Editing

1 code implementation CVPR 2021 Hyunsu Kim, Yunjey Choi, Junho Kim, Sungjoo Yoo, Youngjung Uh

Although manipulating the latent vectors controls the synthesized outputs, editing real images with GANs suffers from i) time-consuming optimization for projecting real images to the latent vectors, ii) or inaccurate embedding through an encoder.

Image Manipulation valid

A StyleMap-Based Generator for Real-Time Image Projection and Local Editing

no code implementations1 Jan 2021 Hyunsu Kim, Yunjey Choi, Junho Kim, Sungjoo Yoo, Youngjung Uh

State-of-the-art GAN-based methods for editing real images suffer from time-consuming operations in projecting real images to latent vectors.

Image Manipulation

Neural Geometric Parser for Single Image Camera Calibration

1 code implementation ECCV 2020 Jinwoo Lee, Minhyuk Sung, Hyunjoon Lee, Junho Kim

With the supervision of datasets consisting of the horizontal line and focal length of the images, our networks can be trained to estimate the same camera parameters.

Camera Calibration

Cannot find the paper you are looking for? You can Submit a new open access paper.