no code implementations • 21 Apr 2025 • Donghyeong Kim, Chaewon Park, Suhwan Cho, Hyeonjeong Lim, Minseok Kang, Jungho Lee, Sangyoun Lee
Zero-shot anomaly detection (ZSAD) aims to identify anomalies in unseen categories by leveraging CLIP's zero-shot capabilities to match text prompts with visual features.
1 code implementation • 7 Mar 2025 • Jungho Lee, Donghyeong Kim, Dogyoon Lee, Suhwan Cho, Minhyeok Lee, Wonjoon Lee, Taeoh Kim, Dongyoon Wee, Sangyoun Lee
3D Gaussian Splatting (3DGS) has gained significant attention for their high-quality novel view rendering, motivating research to address real-world challenges.
1 code implementation • 5 Mar 2025 • Suhwan Cho, Seunghoon Lee, Minhyeok Lee, Jungho Lee, Sangyoun Lee
This reference is then utilized by a dedicated propagation module to track and segment the object across the entire video.
Ranked #1 on
Referring Video Object Segmentation
on Ref-DAVIS17
no code implementations • 20 Dec 2024 • Jungho Lee, Suhwan Cho, Taeoh Kim, Ho-Deok Jang, Minhyeok Lee, Geonho Cha, Dongyoon Wee, Dogyoon Lee, Sangyoun Lee
While conventional methods depend on sharp images for accurate scene reconstruction, real-world scenarios are often affected by defocus blur due to finite depth of field, making it essential to account for realistic 3D scene representation.
1 code implementation • 12 Dec 2024 • Suhwan Cho, Seoung Wug Oh, Sangyoun Lee, Joon-Young Lee
Powered by a strong generative model, our method not only significantly enhances frame-level quality for object removal but also synthesizes new content in the missing areas based on user-provided text prompts.
Ranked #1 on
Video Inpainting
on HQVI (2K)
no code implementations • 2 Dec 2024 • Sunghun Yang, Minhyeok Lee, Suhwan Cho, Jungho Lee, Sangyoun Lee
For static area, the Masked Static (MS) module enhances temporal consistency by focusing on stable regions.
no code implementations • 22 Nov 2024 • Minhyeok Lee, Suhwan Cho, Jungho Lee, Sunghun Yang, Heeseung Choi, Ig-Jae Kim, Sangyoun Lee
Open-vocabulary semantic segmentation aims to assign pixel-level labels to images across an unlimited range of classes.
1 code implementation • 21 Nov 2024 • Suhwan Cho, Minhyeok Lee, Jungho Lee, Sangyoun Lee
This ability allows the model to generate plausible optical flows, preserving semantic integrity while reflecting the independent motion of scene elements.
Ranked #1 on
Video Salient Object Detection
on DAVSOD-easy35
(using extra training data)
no code implementations • 21 Aug 2024 • Minhyeok Lee, Suhwan Cho, Chajin Shin, Jungho Lee, Sunghun Yang, Sangyoun Lee
However, it has limitations such as the inaccuracy of optical flow prediction and the propagation of noise over time.
1 code implementation • 16 Jul 2024 • Suhwan Cho, Minhyeok Lee, Jungho Lee, Donghyeong Kim, Seunghoon Lee, Sungmin Woo, Sangyoun Lee
Unsupervised video object segmentation (VOS), also known as video salient object detection, aims to detect the most prominent object in a video at the pixel level.
Ranked #1 on
Unsupervised Video Object Segmentation
on FBMS test
no code implementations • 4 Jul 2024 • Jungho Lee, Donghyeong Kim, Dogyoon Lee, Suhwan Cho, Minhyeok Lee, Sangyoun Lee
3D Gaussian Splatting (3DGS) has gained significant attention for their high-quality novel view rendering, motivating research to address real-world challenges.
no code implementations • 29 Nov 2023 • Minhyeok Lee, Dogyoon Lee, Jungho Lee, Suhwan Cho, Heeseung Choi, Ig-Jae Kim, Sangyoun Lee
While these methods match language features with image features to effectively identify likely target objects, they often struggle to correctly understand contextual information in complex and ambiguous sentences and scenes.
1 code implementation • 26 Sep 2023 • Suhwan Cho, Minhyeok Lee, Jungho Lee, MyeongAh Cho, Sangyoun Lee
Unsupervised video object segmentation (VOS) is a task that aims to detect the most salient object in a video without external guidance about the object.
Ranked #3 on
Unsupervised Video Object Segmentation
on FBMS test
no code implementations • 17 Mar 2023 • Yongwoo Lee, Minhyeok Lee, Suhwan Cho, Sangyoun Lee
Salient object detection (SOD) is a task that involves identifying and segmenting the most visually prominent object in an image.
1 code implementation • CVPR 2024 • Minhyeok Lee, Suhwan Cho, Dogyoon Lee, Chaewon Park, Jungho Lee, Sangyoun Lee
Unsupervised video object segmentation aims to segment the most prominent object in a video sequence.
no code implementations • 8 Mar 2023 • Seunghoon Lee, Suhwan Cho, Dogyoon Lee, Minhyeok Lee, Sangyoun Lee
In recent works, two approaches for UVOS have been discussed that can be divided into: appearance and appearance-motion-based methods, which have limitations respectively.
no code implementations • 28 Feb 2023 • Sangjin Lee, Suhwan Cho, Sangyoun Lee
Usually, a video sequence and object segmentation masks for all frames are required as the input for this task.
no code implementations • 20 Feb 2023 • Chaewon Park, Minhyeok Lee, Suhwan Cho, Donghyeong Kim, Sangyoun Lee
Image reconstruction-based anomaly detection has recently been in the spotlight because of the difficulty of constructing anomaly datasets.
no code implementations • 9 Dec 2022 • Minjung Kim, MyeongAh Cho, Heansung Lee, Suhwan Cho, Sangyoun Lee
Occluded person re-identification (Re-ID) in images captured by multiple cameras is challenging because the target person is occluded by pedestrians or objects, especially in crowded scenes.
1 code implementation • ICCV 2023 • Jungho Lee, Minhyeok Lee, Suhwan Cho, Sungmin Woo, Sungjun Jang, Sangyoun Lee
In this paper, we propose the Spatio-Temporal Curve Network (STC-Net) to effectively leverage the spatio-temporal dependency of the human skeleton.
1 code implementation • CVPR 2024 • Suhwan Cho, Minhyeok Lee, Seunghoon Lee, Dogyoon Lee, Heeseung Choi, Ig-Jae Kim, Sangyoun Lee
Unsupervised video object segmentation (VOS) aims to detect and segment the most salient object in videos.
Ranked #2 on
Unsupervised Video Object Segmentation
on FBMS test
no code implementations • 22 Nov 2022 • Minhyeok Lee, Suhwan Cho, Chaewon Park, Dogyoon Lee, Jungho Lee, Sangyoun Lee
The proposed DPS-Net utilizes a Deformable Point Sampling transformer (DPS transformer) that can effectively capture sparse local boundary information of significant object boundaries in COD using a deformable point sampling method.
1 code implementation • 14 Nov 2022 • Donghyeong Kim, Chaewon Park, Suhwan Cho, Sangyoun Lee
Feature embedding-based methods have shown exceptional performance in detecting industrial anomalies by comparing features of target images with normal images.
Ranked #46 on
Anomaly Detection
on MVTec AD
(using extra training data)
1 code implementation • 8 Sep 2022 • Minhyeok Lee, Suhwan Cho, Seunghoon Lee, Chaewon Park, Sangyoun Lee
The proposed model effectively extracts the RGB and motion information by extracting superpixel-based component prototypes from the input RGB images and optical flow maps.
Ranked #9 on
Unsupervised Video Object Segmentation
on FBMS test
no code implementations • 4 Sep 2022 • Suhwan Cho, Woo Jin Kim, MyeongAh Cho, Seunghoon Lee, Minhyeok Lee, Chaewon Park, Sangyoun Lee
Feature similarity matching, which transfers the information of the reference frame to the query frame, is a key component in semi-supervised video object segmentation.
2 code implementations • 4 Sep 2022 • Suhwan Cho, Minhyeok Lee, Seunghoon Lee, Chaewon Park, Donghyeong Kim, Sangyoun Lee
Unsupervised video object segmentation (VOS) aims to detect the most salient object in a video sequence at the pixel level.
Ranked #6 on
Unsupervised Video Object Segmentation
on FBMS test
1 code implementation • 16 Jul 2022 • Minhyeok Lee, Chaewon Park, Suhwan Cho, Sangyoun Lee
However, despite advances in deep learning-based methods, RGB-D SOD is still challenging due to the large domain gap between an RGB image and the depth map and low-quality depth maps.
Ranked #4 on
RGB-D Salient Object Detection
on NJU2K
1 code implementation • 14 Jul 2022 • Suhwan Cho, Heansung Lee, Minhyeok Lee, Chaewon Park, Sungjun Jang, Minjung Kim, Sangyoun Lee
Semi-supervised video object segmentation (VOS) aims to densely track certain designated objects in videos.
1 code implementation • 4 Oct 2021 • Suhwan Cho, Heansung Lee, Minjung Kim, Sungjun Jang, Sangyoun Lee
Before finding the best matches for the query frame pixels, the optimal matches for the reference frame pixels are first considered to prevent each reference frame pixel from being overly referenced.
no code implementations • 26 Oct 2020 • Tae-young Chung, Heansung Lee, Myeong Ah Cho, Suhwan Cho, Sangyoun Lee
So in this paper, we propose a novel self-supervised learning method using a lot of short videos which has no human labeling, and improve the tracking performance through the re-identification network trained in the self-supervised manner to solve the lack of training data problem.
no code implementations • 15 Oct 2020 • MyeongAh Cho, Taeoh Kim, Woo Jin Kim, Suhwan Cho, Sangyoun Lee
For the complex distribution of normal scenes, we suggest normal density estimation of ITAE features through normalizing flow (NF)-based generative models to learn the tractable likelihoods and identify anomalies using out of distribution detection.
no code implementations • 18 Sep 2020 • Suhwan Cho, Heansung Lee, Sungmin Woo, Sungjun Jang, Sangyoun Lee
Semi-supervised video object segmentation (VOS) aims to segment arbitrary target objects in video when the ground truth segmentation mask of the initial frame is provided.
1 code implementation • 10 Feb 2020 • Suhwan Cho, MyeongAh Cho, Tae-young Chung, Heansung Lee, Sangyoun Lee
The encoder-decoder based methods for semi-supervised video object segmentation (Semi-VOS) have received extensive attention due to their superior performances.
Ranked #60 on
Semi-Supervised Video Object Segmentation
on DAVIS 2016