Search Results for author: Junsik Kim

Found 31 papers, 9 papers with code

Joint-Task Regularization for Partially Labeled Multi-Task Learning

1 code implementation2 Apr 2024 Kento Nishi, Junsik Kim, Wanhua Li, Hanspeter Pfister

Multi-task learning has become increasingly popular in the machine learning field, but its practicality is hindered by the need for large, labeled datasets.

Multi-Task Learning

R^2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding

1 code implementation2 Apr 2024 Ye Liu, Jixuan He, Wanhua Li, Junsik Kim, Donglai Wei, Hanspeter Pfister, Chang Wen Chen

Video temporal grounding (VTG) is a fine-grained video understanding problem that aims to ground relevant clips in untrimmed videos given natural language queries.

Highlight Detection Moment Retrieval +4

$R^2$-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding

1 code implementation31 Mar 2024 Ye Liu, Jixuan He, Wanhua Li, Junsik Kim, Donglai Wei, Hanspeter Pfister, Chang Wen Chen

Video temporal grounding (VTG) is a fine-grained video understanding problem that aims to ground relevant clips in untrimmed videos given natural language queries.

Highlight Detection Moment Retrieval +4

Blurry Video Compression: A Trade-off between Visual Enhancement and Data Compression

no code implementations8 Nov 2023 Dawit Mureja Argaw, Junsik Kim, In So Kweon

Existing video compression (VC) methods primarily aim to reduce the spatial and temporal redundancies between consecutive frames in a video while preserving its quality.

Data Compression Video Compression

How Does Artificial Intelligence Improve Human Decision-Making? Evidence from the AI-Powered Go Program

no code implementations12 Oct 2023 Sukwoong Choi, Hyo Kang, Namil Kim, Junsik Kim

We study how humans learn from AI, exploiting an introduction of an AI-powered Go program (APG) that unexpectedly outperformed the best professional player.

Decision Making

Sound Source Localization is All about Cross-Modal Alignment

no code implementations ICCV 2023 Arda Senocak, Hyeonggon Ryu, Junsik Kim, Tae-Hyun Oh, Hanspeter Pfister, Joon Son Chung

However, prior arts and existing benchmarks do not account for a more important aspect of the problem, cross-modal semantic understanding, which is essential for genuine sound source localization.

Cross-Modal Retrieval Retrieval

Active anomaly detection based on deep one-class classification

no code implementations18 Sep 2023 Minkyung Kim, Junsik Kim, Jongmin Yu, Jun Kyun Choi

In an active learning framework, a model queries samples to be labeled by experts and re-trains the model with the labeled data samples.

Active Learning One-Class Classification

An Iterative Method for Unsupervised Robust Anomaly Detection Under Data Contamination

no code implementations18 Sep 2023 Minkyung Kim, Jongmin Yu, Junsik Kim, Tae-Hyun Oh, Jun Kyun Choi

Therefore, it has been a common practice to learn normality under the assumption that anomalous data are absent in a training dataset, which we call normality assumption.

One-Class Classification

ENInst: Enhancing Weakly-supervised Low-shot Instance Segmentation

no code implementations20 Feb 2023 Moon Ye-Bin, Dongmin Choi, Yongjin Kwon, Junsik Kim, Tae-Hyun Oh

We address a weakly-supervised low-shot instance segmentation, an annotation-efficient training method to deal with novel classes effectively.

Instance Segmentation Semantic Segmentation

Unsupervised Deep One-Class Classification with Adaptive Threshold based on Training Dynamics

no code implementations13 Feb 2023 Minkyung Kim, Junsik Kim, Jongmin Yu, Jun Kyun Choi

One-class classification has been a prevailing method in building deep anomaly detection models under the assumption that a dataset consisting of normal samples is available.

One-Class Classification Outlier Detection

Audio-Visual Fusion Layers for Event Type Aware Video Recognition

no code implementations12 Feb 2022 Arda Senocak, Junsik Kim, Tae-Hyun Oh, Hyeonggon Ryu, DIngzeyu Li, In So Kweon

Human brain is continuously inundated with the multisensory information and their complex interactions coming from the outside world at any given moment.

Multi-Task Learning Video Recognition +1

Learning Sound Localization Better From Semantically Similar Samples

no code implementations7 Feb 2022 Arda Senocak, Hyeonggon Ryu, Junsik Kim, In So Kweon

Thus, these semantically correlated pairs, "hard positives", are mistakenly grouped as negatives.

Contrastive Learning

Normality-Calibrated Autoencoder for Unsupervised Anomaly Detection on Data Contamination

1 code implementation28 Oct 2021 Jongmin Yu, Hyeontaek Oh, Minkyung Kim, Junsik Kim

In this paper, we propose Normality-Calibrated Autoencoder (NCAE), which can boost anomaly detection performance on the contaminated datasets without any prior information or explicit abnormal samples in the training phase.

Unsupervised Anomaly Detection

FoxInst: A Frustratingly Simple Baseline for Weakly Few-shot Instance Segmentation

no code implementations29 Sep 2021 Dongmin Choi, Moon Ye-Bin, Junsik Kim, Tae-Hyun Oh

We propose the first weakly-supervised few-shot instance segmentation task and a frustratingly simple but strong baseline model, FoxInst.

Instance Segmentation Semantic Segmentation

Restoration of Video Frames from a Single Blurred Image with Motion Understanding

no code implementations19 Apr 2021 Dawit Mureja Argaw, Junsik Kim, Francois Rameau, Chaoning Zhang, In So Kweon

We formulate video restoration from a single blurred image as an inverse problem by setting clean image sequence and their respective motion as latent factors, and the blurred image as an observation.

Video Restoration

Motion-blurred Video Interpolation and Extrapolation

no code implementations4 Mar 2021 Dawit Mureja Argaw, Junsik Kim, Francois Rameau, In So Kweon

Abrupt motion of camera or objects in a scene result in a blurry video, and therefore recovering high quality video requires two types of enhancements: visual enhancement and temporal upsampling.

Deblurring Optical Flow Estimation

Optical Flow Estimation from a Single Motion-blurred Image

no code implementations4 Mar 2021 Dawit Mureja Argaw, Junsik Kim, Francois Rameau, Jae Won Cho, In So Kweon

A flow estimator network is then used to estimate optical flow from the decoded features in a coarse-to-fine manner.

Deblurring Optical Flow Estimation +1

Learning to Localize Sound Source in Visual Scenes

no code implementations CVPR 2018 Arda Senocak, Tae-Hyun Oh, Junsik Kim, Ming-Hsuan Yang, In So Kweon

We show that even with a few supervision, false conclusion is able to be corrected and the source of sound in a visual scene can be localized effectively.

VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition

3 code implementations ICCV 2017 Seokju Lee, Junsik Kim, Jae Shin Yoon, Seunghak Shin, Oleksandr Bailo, Namil Kim, Tae-Hee Lee, Hyun Seok Hong, Seung-Hoon Han, In So Kweon

In this paper, we propose a unified end-to-end trainable multi-task network that jointly handles lane and road marking detection and recognition that is guided by a vanishing point under adverse weather conditions.

Lane Detection

Globally Optimal Manhattan Frame Estimation in Real-Time

no code implementations CVPR 2016 Kyungdon Joo, Tae-Hyun Oh, Junsik Kim, In So Kweon

Given a set of surface normals, we pose a Manhattan Frame (MF) estimation problem as a consensus set maximization that maximizes the number of inliers over the rotation search space.

Video Stabilization

Robust and Globally Optimal Manhattan Frame Estimation in Near Real Time

no code implementations12 May 2016 Kyungdon Joo, Tae-Hyun Oh, Junsik Kim, In So Kweon

Most man-made environments, such as urban and indoor scenes, consist of a set of parallel and orthogonal planar structures.

Clustering Video Stabilization

Cannot find the paper you are looking for? You can Submit a new open access paper.