Search Results for author: Suhwan Cho

Found 33 papers, 16 papers with code

GenCLIP: Generalizing CLIP Prompts for Zero-shot Anomaly Detection

no code implementations21 Apr 2025 Donghyeong Kim, Chaewon Park, Suhwan Cho, Hyeonjeong Lim, Minseok Kang, Jungho Lee, Sangyoun Lee

Zero-shot anomaly detection (ZSAD) aims to identify anomalies in unseen categories by leveraging CLIP's zero-shot capabilities to match text prompts with visual features.

Anomaly Detection Specificity +1

CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred Images

1 code implementation7 Mar 2025 Jungho Lee, Donghyeong Kim, Dogyoon Lee, Suhwan Cho, Minhyeok Lee, Wonjoon Lee, Taeoh Kim, Dongyoon Wee, Sangyoun Lee

3D Gaussian Splatting (3DGS) has gained significant attention for their high-quality novel view rendering, motivating research to address real-world challenges.

3DGS 3D Scene Reconstruction

CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images

no code implementations20 Dec 2024 Jungho Lee, Suhwan Cho, Taeoh Kim, Ho-Deok Jang, Minhyeok Lee, Geonho Cha, Dongyoon Wee, Dogyoon Lee, Sangyoun Lee

While conventional methods depend on sharp images for accurate scene reconstruction, real-world scenarios are often affected by defocus blur due to finite depth of field, making it essential to account for realistic 3D scene representation.

3DGS

Elevating Flow-Guided Video Inpainting with Reference Generation

1 code implementation12 Dec 2024 Suhwan Cho, Seoung Wug Oh, Sangyoun Lee, Joon-Young Lee

Powered by a strong generative model, our method not only significantly enhances frame-level quality for object removal but also synthesizes new content in the missing areas based on user-provided text prompts.

2k Video Inpainting

Effective SAM Combination for Open-Vocabulary Semantic Segmentation

no code implementations22 Nov 2024 Minhyeok Lee, Suhwan Cho, Jungho Lee, Sunghun Yang, Heeseung Choi, Ig-Jae Kim, Sangyoun Lee

Open-vocabulary semantic segmentation aims to assign pixel-level labels to images across an unlimited range of classes.

Decoder Language Modeling +4

Transforming Static Images Using Generative Models for Video Salient Object Detection

1 code implementation21 Nov 2024 Suhwan Cho, Minhyeok Lee, Jungho Lee, Sangyoun Lee

This ability allows the model to generate plausible optical flows, preserving semantic integrity while reflecting the independent motion of scene elements.

 Ranked #1 on Video Salient Object Detection on DAVSOD-easy35 (using extra training data)

object-detection Salient Object Detection +2

Video Diffusion Models are Strong Video Inpainter

no code implementations21 Aug 2024 Minhyeok Lee, Suhwan Cho, Chajin Shin, Jungho Lee, Sunghun Yang, Sangyoun Lee

However, it has limitations such as the inaccuracy of optical flow prediction and the propagation of noise over time.

Optical Flow Estimation Video Inpainting

Improving Unsupervised Video Object Segmentation via Fake Flow Generation

1 code implementation16 Jul 2024 Suhwan Cho, Minhyeok Lee, Jungho Lee, Donghyeong Kim, Seunghoon Lee, Sungmin Woo, Sangyoun Lee

Unsupervised video object segmentation (VOS), also known as video salient object detection, aims to detect the most prominent object in a video at the pixel level.

Object object-detection +6

CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion-Blurred Images

no code implementations4 Jul 2024 Jungho Lee, Donghyeong Kim, Dogyoon Lee, Suhwan Cho, Minhyeok Lee, Sangyoun Lee

3D Gaussian Splatting (3DGS) has gained significant attention for their high-quality novel view rendering, motivating research to address real-world challenges.

3DGS 3D Scene Reconstruction

Synchronizing Vision and Language: Bidirectional Token-Masking AutoEncoder for Referring Image Segmentation

no code implementations29 Nov 2023 Minhyeok Lee, Dogyoon Lee, Jungho Lee, Suhwan Cho, Heeseung Choi, Ig-Jae Kim, Sangyoun Lee

While these methods match language features with image features to effectively identify likely target objects, they often struggle to correctly understand contextual information in complex and ambiguous sentences and scenes.

Image Segmentation Semantic Segmentation

Treating Motion as Option with Output Selection for Unsupervised Video Object Segmentation

1 code implementation26 Sep 2023 Suhwan Cho, Minhyeok Lee, Jungho Lee, MyeongAh Cho, Sangyoun Lee

Unsupervised video object segmentation (VOS) is a task that aims to detect the most salient object in a video without external guidance about the object.

Object Optical Flow Estimation +3

Adaptive Graph Convolution Module for Salient Object Detection

no code implementations17 Mar 2023 Yongwoo Lee, Minhyeok Lee, Suhwan Cho, Sangyoun Lee

Salient object detection (SOD) is a task that involves identifying and segmenting the most visually prominent object in an image.

Object object-detection +2

Tsanet: Temporal and Scale Alignment for Unsupervised Video Object Segmentation

no code implementations8 Mar 2023 Seunghoon Lee, Suhwan Cho, Dogyoon Lee, Minhyeok Lee, Sangyoun Lee

In recent works, two approaches for UVOS have been discussed that can be divided into: appearance and appearance-motion-based methods, which have limitations respectively.

Decoder Object +4

One-Shot Video Inpainting

no code implementations28 Feb 2023 Sangjin Lee, Suhwan Cho, Sangyoun Lee

Usually, a video sequence and object segmentation masks for all frames are required as the input for this task.

Object Segmentation +4

Two-stream Decoder Feature Normality Estimating Network for Industrial Anomaly Detection

no code implementations20 Feb 2023 Chaewon Park, Minhyeok Lee, Suhwan Cho, Donghyeong Kim, Sangyoun Lee

Image reconstruction-based anomaly detection has recently been in the spotlight because of the difficulty of constructing anomaly datasets.

Anomaly Detection Decoder +2

Occluded Person Re-Identification via Relational Adaptive Feature Correction Learning

no code implementations9 Dec 2022 Minjung Kim, MyeongAh Cho, Heansung Lee, Suhwan Cho, Sangyoun Lee

Occluded person re-identification (Re-ID) in images captured by multiple cameras is challenging because the target person is occluded by pedestrians or objects, especially in crowded scenes.

Occluded Person Re-Identification

Leveraging Spatio-Temporal Dependency for Skeleton-Based Action Recognition

1 code implementation ICCV 2023 Jungho Lee, Minhyeok Lee, Suhwan Cho, Sungmin Woo, Sungjun Jang, Sangyoun Lee

In this paper, we propose the Spatio-Temporal Curve Network (STC-Net) to effectively leverage the spatio-temporal dependency of the human skeleton.

Action Recognition Skeleton Based Action Recognition

Boundary-aware Camouflaged Object Detection via Deformable Point Sampling

no code implementations22 Nov 2022 Minhyeok Lee, Suhwan Cho, Chaewon Park, Dogyoon Lee, Jungho Lee, Sangyoun Lee

The proposed DPS-Net utilizes a Deformable Point Sampling transformer (DPS transformer) that can effectively capture sparse local boundary information of significant object boundaries in COD using a deformable point sampling method.

Object object-detection +2

FAPM: Fast Adaptive Patch Memory for Real-time Industrial Anomaly Detection

1 code implementation14 Nov 2022 Donghyeong Kim, Chaewon Park, Suhwan Cho, Sangyoun Lee

Feature embedding-based methods have shown exceptional performance in detecting industrial anomalies by comparing features of target images with normal images.

Ranked #46 on Anomaly Detection on MVTec AD (using extra training data)

Anomaly Detection

Unsupervised Video Object Segmentation via Prototype Memory Network

1 code implementation8 Sep 2022 Minhyeok Lee, Suhwan Cho, Seunghoon Lee, Chaewon Park, Sangyoun Lee

The proposed model effectively extracts the RGB and motion information by extracting superpixel-based component prototypes from the input RGB images and optical flow maps.

Object Optical Flow Estimation +4

Pixel-Level Equalized Matching for Video Object Segmentation

no code implementations4 Sep 2022 Suhwan Cho, Woo Jin Kim, MyeongAh Cho, Seunghoon Lee, Minhyeok Lee, Chaewon Park, Sangyoun Lee

Feature similarity matching, which transfers the information of the reference frame to the query frame, is a key component in semi-supervised video object segmentation.

Object Semantic Segmentation +2

SPSN: Superpixel Prototype Sampling Network for RGB-D Salient Object Detection

1 code implementation16 Jul 2022 Minhyeok Lee, Chaewon Park, Suhwan Cho, Sangyoun Lee

However, despite advances in deep learning-based methods, RGB-D SOD is still challenging due to the large domain gap between an RGB image and the depth map and low-quality depth maps.

object-detection RGB-D Salient Object Detection +2

Pixel-Level Bijective Matching for Video Object Segmentation

1 code implementation4 Oct 2021 Suhwan Cho, Heansung Lee, Minjung Kim, Sungjun Jang, Sangyoun Lee

Before finding the best matches for the query frame pixels, the optimal matches for the reference frame pixels are first considered to prevent each reference frame pixel from being overly referenced.

Object Semantic Segmentation +2

Multi-object tracking with self-supervised associating network

no code implementations26 Oct 2020 Tae-young Chung, Heansung Lee, Myeong Ah Cho, Suhwan Cho, Sangyoun Lee

So in this paper, we propose a novel self-supervised learning method using a lot of short videos which has no human labeling, and improve the tracking performance through the re-identification network trained in the self-supervised manner to solve the lack of training data problem.

Multi-Object Tracking Object +1

Unsupervised Video Anomaly Detection via Normalizing Flows with Implicit Latent Features

no code implementations15 Oct 2020 MyeongAh Cho, Taeoh Kim, Woo Jin Kim, Suhwan Cho, Sangyoun Lee

For the complex distribution of normal scenes, we suggest normal density estimation of ITAE features through normalizing flow (NF)-based generative models to learn the tractable likelihoods and identify anomalies using out of distribution detection.

Anomaly Detection Decoder +4

PMVOS: Pixel-Level Matching-Based Video Object Segmentation

no code implementations18 Sep 2020 Suhwan Cho, Heansung Lee, Sungmin Woo, Sungjun Jang, Sangyoun Lee

Semi-supervised video object segmentation (VOS) aims to segment arbitrary target objects in video when the ground truth segmentation mask of the initial frame is provided.

Object One-shot visual object segmentation +3

CRVOS: Clue Refining Network for Video Object Segmentation

1 code implementation10 Feb 2020 Suhwan Cho, MyeongAh Cho, Tae-young Chung, Heansung Lee, Sangyoun Lee

The encoder-decoder based methods for semi-supervised video object segmentation (Semi-VOS) have received extensive attention due to their superior performances.

Decoder Object +5

Cannot find the paper you are looking for? You can Submit a new open access paper.