1 code implementation • Findings (ACL) 2022 • Yong-Ho Jung, Jun-Hyung Park, Joon-Young Choi, Mingyu Lee, Junho Kim, Kang-Min Kim, SangKeun Lee
Commonsense inference poses a unique challenge to reason and generate the physical, social, and causal conditions of a given event.
no code implementations • 12 Dec 2024 • Geonhui Jang, Jin-Hwa Kim, Yong-Hyun Park, Junho Kim, Gayoung Lee, Yonghyun Jeong
However, fine-tuning with a limited number of reference images often leads to overfitting, resulting in issues such as prompt misalignment or content leakage.
no code implementations • 29 Nov 2024 • Hosu Lee, Junho Kim, Hyunjun Kim, Yong Man Ro
With the growing scale and complexity of video data, efficiently processing long video sequences poses significant challenges due to the quadratic increase in memory and computational demands associated with existing transformer-based Large Multi-modal Models (LMMs).
no code implementations • 25 Nov 2024 • Junho Kim, Hyunjun Kim, Hosu Lee, Yong Man Ro
Despite advances in Large Multi-modal Models, applying them to long and untrimmed video content remains challenging due to limitations in context length and substantial memory overhead.
1 code implementation • 17 Nov 2024 • Minhee Jang, Juheon Son, Thanaporn Viriyasaranon, Junho Kim, Jang-Hwan Choi
The integration of deep learning technologies in medical imaging aims to enhance the efficiency and accuracy of cancer diagnosis, particularly for pancreatic and breast cancers, which present significant diagnostic challenges due to their high mortality rates and complex imaging characteristics.
no code implementations • 11 Nov 2024 • Junho Kim, Hyungjin Chung, Byung-Hoon Kim
Our approach sets a new state-of-the-art on the MP-100 benchmark in the challenging 1-shot setting, marking a significant advancement in the field of category-agnostic pose estimation.
Ranked #1 on 2D Pose Estimation on MP-100
1 code implementation • 1 Nov 2024 • Yeachan Kim, Junho Kim, Wing-Lam Mok, Jun-Hyung Park, SangKeun Lee
Despite the versatility of pre-trained language models (PLMs) across domains, their large memory footprints pose significant challenges in federated learning (FL), where the training model has to be distributed between a server and clients.
1 code implementation • 31 Oct 2024 • Yeachan Kim, Junho Kim, SangKeun Lee
However, as datasets in such environments often contain noisy labels that adversely affect performance, PEFT methods are inevitably exposed to noisy labels.
no code implementations • 19 Oct 2024 • SangJong Lee, Jin-Kwang Kim, Junho Kim, Taehan Kim, James Lee
In this study, we introduces a parameter-efficient model that outperforms traditional models in time series forecasting, by integrating High-order Polynomial Projection (HiPPO) theory into the Kolmogorov-Arnold network (KAN) framework.
1 code implementation • 19 Oct 2024 • Junho Kim, Yeachan Kim, Jun-Hyung Park, Yerim Oh, Suho Kim, SangKeun Lee
We introduce a novel continued pre-training method, MELT (MatEriaLs-aware continued pre-Training), specifically designed to efficiently adapt the pre-trained language models (PLMs) for materials science.
1 code implementation • 11 Oct 2024 • Hojae Lee, Junho Kim, SangKeun Lee
Large Language Models (LLMs) have displayed remarkable performances across various complex tasks by leveraging Chain-of-Thought (CoT) prompting.
no code implementations • 17 Jul 2024 • Yong-Hyun Park, Sangdoo Yun, Jin-Hwa Kim, Junho Kim, Geonhui Jang, Yonghyun Jeong, Junghyo Jo, Gayoung Lee
In this paper, we propose Direct Unlearning Optimization (DUO), a novel framework for removing Not Safe For Work (NSFW) content from T2I models while preserving their performance on unrelated topics.
no code implementations • 4 Jun 2024 • Junho Kim, Hyunjun Kim, Yeonju Kim, Yong Man Ro
Large Multi-modal Models (LMMs) have recently demonstrated remarkable abilities in visual context understanding and coherent response generation.
1 code implementation • CVPR 2024 • Junho Kim, Jiwon Jeong, Young Min Kim
We introduce a lightweight and accurate localization method that only utilizes the geometry of 2D-3D lines.
1 code implementation • 20 Mar 2024 • Junho Kim, Yeon Ju Kim, Yong Man Ro
This paper presents a way of enhancing the reliability of Large Multi-modal Models (LMMs) in addressing hallucination, where the models generate cross-modal inconsistent responses.
1 code implementation • 20 Feb 2024 • Jaeseok Jeong, Junho Kim, Yunjey Choi, Gayoung Lee, Youngjung Uh
Despite their remarkable capability, existing models still face challenges in achieving controlled generation with a consistent style, requiring costly fine-tuning or often inadequately transferring the visual elements due to content leakage.
1 code implementation • 11 Oct 2023 • Junho Kim, Byung-Kwan Lee, Yong Man Ro
Unsupervised semantic segmentation aims to achieve high-quality semantic grouping without human-labeled annotations.
Ranked #1 on Unsupervised Semantic Segmentation on COCO-Stuff-81
no code implementations • 2 Oct 2023 • Sangyun Lee, Gayoung Lee, Hyunsu Kim, Junho Kim, Youngjung Uh
We present the Groupwise Diffusion Model (GDM), which divides data into multiple groups and diffuses one group at one time interval in the forward diffusion process.
1 code implementation • ICCV 2023 • Junho Kim, Changwoon Choi, Hojun Jang, Young Min Kim
We introduce LDL, a fast and robust algorithm that localizes a panorama to a 3D map using line segments.
1 code implementation • ICCV 2023 • Junho Kim, Eun Sun Lee, Young Min Kim
While panoramic images can easily capture the surrounding context from commodity devices, the estimated depth shares the limitations of conventional image-based depth estimation; the performance deteriorates under large domain shifts and the absolute values are still ambiguous to infer from 2D observations.
1 code implementation • ICCV 2023 • Byung-Kwan Lee, Junho Kim, Yong Man Ro
Adversarial examples derived from deliberately crafted perturbations on visual inputs can easily harm decision process of deep neural networks.
no code implementations • 5 Jun 2023 • Sunwoo Kim, Wooseok Jang, Hyunsu Kim, Junho Kim, Yunjey Choi, Seungryong Kim, Gayeong Lee
From the users' standpoint, prompt engineering is a labor-intensive process, and users prefer to provide a target word for editing instead of a full sentence.
no code implementations • 30 May 2023 • Doyeon Kim, Eunji Ko, Hyunsu Kim, Yunji Kim, Junho Kim, Dongchan Min, Junmo Kim, Sung Ju Hwang
Portrait stylization, which translates a real human face image into an artistically stylized image, has attracted considerable interest and many prior works have shown impressive quality in recent years.
no code implementations • 25 May 2023 • Jooyoung Choi, Yunjey Choi, Yunji Kim, Junho Kim, Sungroh Yoon
Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts.
no code implementations • 11 Apr 2023 • Soohyun Kim, Junho Kim, Taekyung Kim, Hwan Heo, Seungryong Kim, Jiyoung Lee, Jin-Hwa Kim
This task is difficult due to the geometric distortion of panoramic images and the lack of a panoramic image dataset with diverse conditions, like weather or time.
1 code implementation • 17 Mar 2023 • Jun-Hyung Park, Yeachan Kim, Junho Kim, Joon-Young Choi, SangKeun Lee
In this work, we introduce a novel structure pruning method, termed as dynamic structure pruning, to identify optimal pruning granularities for intra-channel pruning.
1 code implementation • 14 Mar 2023 • Junyoung Seo, Wooseok Jang, Min-Seop Kwak, Hyeonsu Kim, Jaehoon Ko, Junho Kim, Jin-Hwa Kim, Jiyoung Lee, Seungryong Kim
Text-to-3D generation has shown rapid progress in recent days with the advent of score distillation, a methodology of using pretrained text-to-2D diffusion models to optimize neural radiance field (NeRF) in the zero-shot setting.
1 code implementation • CVPR 2023 • Junho Kim, Byung-Kwan Lee, Yong Man Ro
The origin of adversarial examples is still inexplicable in research fields, and it arouses arguments from various viewpoints, albeit comprehensive investigations.
1 code implementation • 26 Feb 2023 • Yoonjeon Kim, Hyunsu Kim, Junho Kim, Yunjey Choi, Eunho Yang
With the advantages of fast inference and human-friendly flexible manipulation, image-agnostic style manipulation via text guidance enables new applications that were not previously available.
1 code implementation • 15 Dec 2022 • Mingyu Lee, Jun-Hyung Park, Junho Kim, Kang-Min Kim, SangKeun Lee
Masked language modeling (MLM) has been widely used for pre-training effective bidirectional representations, but incurs substantial training costs.
no code implementations • CVPR 2023 • Gyeongman Kim, Hajin Shim, Hyunsu Kim, Yunjey Choi, Junho Kim, Eunho Yang
Inspired by the impressive performance of recent face image editing methods, several studies have been naturally proposed to extend these methods to the face video editing task.
1 code implementation • 4 Dec 2022 • Junho Kim, Young Min Kim, Yicheng Wu, Ramzi Zahreddine, Weston A. Welge, Gurunandan Krishnan, Sizhuo Ma, Jian Wang
We present a robust, privacy-preserving visual localization algorithm using event cameras.
no code implementations • 29 Nov 2022 • Eun Sun Lee, Junho Kim, SangWon Park, Young Min Kim
We propose a domain adaptation method, MoDA, which adapts a pretrained embodied agent to a new, noisy environment without ground-truth supervision.
1 code implementation • 27 Jul 2022 • Gayoung Lee, Hyunsu Kim, Junho Kim, Seonghyeon Kim, Jung-Woo Ha, Yunjey Choi
Here we explore the efficacy of dense supervision in unconditional generation and find generator feature maps can be an alternative of cost-expensive semantic label maps.
1 code implementation • 12 Jul 2022 • Junho Kim, Hojun Jang, Changwoon Choi, Young Min Kim
By utilizing the unique equivariance of spherical projections, we propose very fast color histogram generation for a large number of camera poses without explicitly rendering images for all candidate poses.
no code implementations • 24 Jun 2022 • Inwoo Hwang, Junho Kim, Young Min Kim
We present Ev-NeRF, a Neural Radiance Field derived from event data.
1 code implementation • 17 Jun 2022 • Jiyeon Han, Hwanil Choi, Yunjey Choi, Junho Kim, Jung-Woo Ha, Jaesik Choi
In this work, we propose a new evaluation metric, called `rarity score', to measure the individual rarity of each image synthesized by generative models.
1 code implementation • NeurIPS 2021 • Junho Kim, Byung-Kwan Lee, Yong Man Ro
Adversarial examples, generated by carefully crafted perturbation, have attracted considerable attention in research fields.
1 code implementation • CVPR 2022 • Byung-Kwan Lee, Junho Kim, Yong Man Ro
Adversarial examples provoke weak reliability and potential security issues in deep neural networks.
1 code implementation • CVPR 2022 • Junho Kim, Inwoo Hwang, Young Min Kim
We introduce Ev-TTA, a simple, effective test-time adaptation algorithm for event-based object recognition.
Ranked #1 on Robust classification on N-ImageNet
1 code implementation • ICLR 2022 • Sihyun Yu, Jihoon Tack, Sangwoo Mo, Hyunsu Kim, Junho Kim, Jung-Woo Ha, Jinwoo Shin
In this paper, we found that the recent emerging paradigm of implicit neural representations (INRs) that encodes a continuous signal into a parameterized neural network effectively mitigates the issue.
Ranked #33 on Video Generation on UCF-101
1 code implementation • CVPR 2022 • Junho Kim, Yunjey Choi, Youngjung Uh
In generative adversarial networks, improving discriminators is one of the key components for generation performance.
1 code implementation • ICCV 2021 • Junho Kim, Jaehyeok Bae, Gangin Park, Dongsu Zhang, Young Min Kim
We introduce N-ImageNet, a large-scale dataset targeted for robust, fine-grained object recognition with event cameras.
Ranked #1 on Classification on N-ImageNet (mini)
1 code implementation • 14 Oct 2021 • Junho Kim, Eun Sun Lee, MinGi Lee, Donsu Zhang, Young Min Kim
We present SGoLAM, short for simultaneous goal localization and mapping, which is a simple and efficient algorithm for Multi-Object Goal navigation.
no code implementations • 14 Oct 2021 • Eun Sun Lee, Junho Kim, Young Min Kim
We propose a light-weight, self-supervised adaptation for a visual navigation agent to generalize to unseen environment.
1 code implementation • ICCV 2021 • Jinwoo Lee, Hyunsung Go, Hyunjoon Lee, Sunghyun Cho, Minhyuk Sung, Junho Kim
In this work, we propose Camera calibration TRansformer with Line-Classification (CTRL-C), an end-to-end neural network-based approach to single image camera calibration, which directly estimates the camera parameters from an image and a set of line segments.
2 code implementations • ICCV 2021 • Junho Kim, Changwoon Choi, Hojun Jang, Young Min Kim
Our loss function, called sampling loss, is point cloud-centric, evaluated at the projected location of every point in the point cloud.
1 code implementation • CVPR 2021 • Hyunsu Kim, Yunjey Choi, Junho Kim, Sungjoo Yoo, Youngjung Uh
Although manipulating the latent vectors controls the synthesized outputs, editing real images with GANs suffers from i) time-consuming optimization for projecting real images to the latent vectors, ii) or inaccurate embedding through an encoder.
no code implementations • 1 Jan 2021 • Hyunsu Kim, Yunjey Choi, Junho Kim, Sungjoo Yoo, Youngjung Uh
State-of-the-art GAN-based methods for editing real images suffer from time-consuming operations in projecting real images to latent vectors.
1 code implementation • ECCV 2020 • Jinwoo Lee, Minhyuk Sung, Hyunjoon Lee, Junho Kim
With the supervision of datasets consisting of the horizontal line and focal length of the images, our networks can be trained to estimate the same camera parameters.
28 code implementations • ICLR 2020 • Junho Kim, Minjae Kim, Hyeonwoo Kang, Kwanghee Lee
We propose a novel method for unsupervised image-to-image translation, which incorporates a new attention module and a new learnable normalization function in an end-to-end manner.
Ranked #1 on Image-to-Image Translation on photo2vangogh