Search Results for author: Sangyoun Lee

Found 51 papers, 21 papers with code

SMURF: Continuous Dynamics for Motion-Deblurring Radiance Fields

1 code implementation • 12 Mar 2024 • Jungho Lee, Dogyoon Lee, Minhyeok Lee, Donghyung Kim, Sangyoun Lee

Neural radiance fields (NeRF) has attracted considerable attention for their exceptional ability in synthesizing novel views with high fidelity.

Deblurring

Paper
Code

FIMP: Future Interaction Modeling for Multi-Agent Motion Prediction

no code implementations • 29 Jan 2024 • Sungmin Woo, Minjung Kim, Donghyeong Kim, Sungjun Jang, Sangyoun Lee

Multi-agent motion prediction is a crucial concern in autonomous driving, yet it remains a challenge owing to the ambiguous intentions of dynamic agents and their intricate interactions.

Motion Forecasting motion prediction

Paper
Add Code

Synchronizing Vision and Language: Bidirectional Token-Masking AutoEncoder for Referring Image Segmentation

no code implementations • 29 Nov 2023 • Minhyeok Lee, Dogyoon Lee, Jungho Lee, Suhwan Cho, Heeseung Choi, Ig-Jae Kim, Sangyoun Lee

While these methods match language features with image features to effectively identify likely target objects, they often struggle to correctly understand contextual information in complex and ambiguous sentences and scenes.

Image Segmentation Semantic Segmentation

Paper
Add Code

Treating Motion as Option with Output Selection for Unsupervised Video Object Segmentation

1 code implementation • 26 Sep 2023 • Suhwan Cho, Minhyeok Lee, Jungho Lee, MyeongAh Cho, Sangyoun Lee

Unsupervised video object segmentation (VOS) is a task that aims to detect the most salient object in a video without external guidance about the object.

Object Optical Flow Estimation +3

Paper
Code

Adaptive Graph Convolution Module for Salient Object Detection

no code implementations • 17 Mar 2023 • Yongwoo Lee, Minhyeok Lee, Suhwan Cho, Sangyoun Lee

Salient object detection (SOD) is a task that involves identifying and segmenting the most visually prominent object in an image.

Object object-detection +2

Paper
Add Code

Guided Slot Attention for Unsupervised Video Object Segmentation

1 code implementation • 15 Mar 2023 • Minhyeok Lee, Suhwan Cho, Dogyoon Lee, Chaewon Park, Jungho Lee, Sangyoun Lee

Unsupervised video object segmentation aims to segment the most prominent object in a video sequence.

Object Semantic Segmentation +2

Paper
Code

Tsanet: Temporal and Scale Alignment for Unsupervised Video Object Segmentation

no code implementations • 8 Mar 2023 • Seunghoon Lee, Suhwan Cho, Dogyoon Lee, Minhyeok Lee, Sangyoun Lee

In recent works, two approaches for UVOS have been discussed that can be divided into: appearance and appearance-motion-based methods, which have limitations respectively.

Object Optical Flow Estimation +3

Paper
Add Code

One-Shot Video Inpainting

no code implementations • 28 Feb 2023 • Sangjin Lee, Suhwan Cho, Sangyoun Lee

Usually, a video sequence and object segmentation masks for all frames are required as the input for this task.

Object Segmentation +4

Paper
Add Code

Two-stream Decoder Feature Normality Estimating Network for Industrial Anomaly Detection

no code implementations • 20 Feb 2023 • Chaewon Park, Minhyeok Lee, Suhwan Cho, Donghyeong Kim, Sangyoun Lee

Image reconstruction-based anomaly detection has recently been in the spotlight because of the difficulty of constructing anomaly datasets.

Anomaly Detection Image Reconstruction +1

Paper
Add Code

Look Around for Anomalies: Weakly-Supervised Anomaly Detection via Context-Motion Relational Learning

no code implementations • CVPR 2023 • MyeongAh Cho, Minjung Kim, Sangwon Hwang, Chaewon Park, Kyungjae Lee, Sangyoun Lee

Furthermore, as the relationship between context and motion is important in order to identify the anomalies in complex and diverse scenes, we propose a Context--Motion Interrelation Module (CoMo), which models the relationship between the appearance of the surroundings and motion, rather than utilizing only temporal dependencies or motion information.

Relational Reasoning Supervised Anomaly Detection +2

Paper
Add Code

Feature Disentanglement Learning with Switching and Aggregation for Video-based Person Re-Identification

no code implementations • 16 Dec 2022 • Minjung Kim, MyeongAh Cho, Sangyoun Lee

In video person re-identification (Re-ID), the network must consistently extract features of the target person from successive frames.

Disentanglement Video-Based Person Re-Identification

Paper
Add Code

Leveraging Spatio-Temporal Dependency for Skeleton-Based Action Recognition

1 code implementation • ICCV 2023 • Jungho Lee, Minhyeok Lee, Suhwan Cho, Sungmin Woo, Sungjun Jang, Sangyoun Lee

In this paper, we propose the Spatio-Temporal Curve Network (STC-Net) to effectively leverage the spatio-temporal dependency of the human skeleton.

Action Recognition Skeleton Based Action Recognition

Paper
Code

Occluded Person Re-Identification via Relational Adaptive Feature Correction Learning

no code implementations • 9 Dec 2022 • Minjung Kim, MyeongAh Cho, Heansung Lee, Suhwan Cho, Sangyoun Lee

Occluded person re-identification (Re-ID) in images captured by multiple cameras is challenging because the target person is occluded by pedestrians or objects, especially in crowded scenes.

Person Re-Identification

Paper
Add Code

DP-NeRF: Deblurred Neural Radiance Field with Physical Scene Priors

1 code implementation • CVPR 2023 • Dogyoon Lee, Minhyeok Lee, Chajin Shin, Sangyoun Lee

The few studies that have investigated NeRF for blurred images have not considered geometric and appearance consistency in 3D space, which is one of the most important factors in 3D reconstruction.

3D Reconstruction Novel View Synthesis

Paper
Code

Boundary-aware Camouflaged Object Detection via Deformable Point Sampling

no code implementations • 22 Nov 2022 • Minhyeok Lee, Suhwan Cho, Chaewon Park, Dogyoon Lee, Jungho Lee, Sangyoun Lee

The proposed DPS-Net utilizes a Deformable Point Sampling transformer (DPS transformer) that can effectively capture sparse local boundary information of significant object boundaries in COD using a deformable point sampling method.

Object object-detection +2

Paper
Add Code

Dual Prototype Attention for Unsupervised Video Object Segmentation

2 code implementations • 22 Nov 2022 • Suhwan Cho, Minhyeok Lee, Seunghoon Lee, Dogyoon Lee, Heeseung Choi, Ig-Jae Kim, Sangyoun Lee

Unsupervised video object segmentation (VOS) aims to detect and segment the most salient object in videos.

Ranked #2 on Unsupervised Video Object Segmentation on FBMS test

Object Semantic Segmentation +2

Paper
Code

FAPM: Fast Adaptive Patch Memory for Real-time Industrial Anomaly Detection

1 code implementation • 14 Nov 2022 • Donghyeong Kim, Chaewon Park, Suhwan Cho, Sangyoun Lee

Feature embedding-based methods have shown exceptional performance in detecting industrial anomalies by comparing features of target images with normal images.

Ranked #29 on Anomaly Detection on MVTec AD

Anomaly Detection

Paper
Code

Unsupervised Video Object Segmentation via Prototype Memory Network

1 code implementation • 8 Sep 2022 • Minhyeok Lee, Suhwan Cho, Seunghoon Lee, Chaewon Park, Sangyoun Lee

The proposed model effectively extracts the RGB and motion information by extracting superpixel-based component prototypes from the input RGB images and optical flow maps.

Ranked #5 on Unsupervised Video Object Segmentation on FBMS test

Object Optical Flow Estimation +4

Paper
Code

Treating Motion as Option to Reduce Motion Dependency in Unsupervised Video Object Segmentation

2 code implementations • 4 Sep 2022 • Suhwan Cho, Minhyeok Lee, Seunghoon Lee, Chaewon Park, Donghyeong Kim, Sangyoun Lee

Unsupervised video object segmentation (VOS) aims to detect the most salient object in a video sequence at the pixel level.

Ranked #2 on Unsupervised Video Object Segmentation on YouTube-Objects

Optical Flow Estimation Semantic Segmentation +2

Paper
Code

Pixel-Level Equalized Matching for Video Object Segmentation

no code implementations • 4 Sep 2022 • Suhwan Cho, Woo Jin Kim, MyeongAh Cho, Seunghoon Lee, Minhyeok Lee, Chaewon Park, Sangyoun Lee

Feature similarity matching, which transfers the information of the reference frame to the query frame, is a key component in semi-supervised video object segmentation.

Object Semantic Segmentation +2

Paper
Add Code

Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action Recognition

1 code implementation • ICCV 2023 • Jungho Lee, Minhyeok Lee, Dogyoon Lee, Sangyoun Lee

Graph convolutional networks (GCNs) are the most commonly used methods for skeleton-based action recognition and have achieved remarkable performance.

Ranked #4 on Skeleton Based Action Recognition on NTU RGB+D 120

Action Recognition Skeleton Based Action Recognition

108

Paper
Code

Expanded Adaptive Scaling Normalization for End to End Image Compression

1 code implementation • 5 Aug 2022 • Chajin Shin, Hyeongmin Lee, Hanbin Son, Sangjin Lee, Dogyoon Lee, Sangyoun Lee

Then, we increase the receptive field to make the adaptive rescaling module consider the spatial correlation.

Image Compression

Paper
Code

NIR-to-VIS Face Recognition via Embedding Relations and Coordinates of the Pairwise Features

no code implementations • 4 Aug 2022 • MyeongAh Cho, Tae-young Chun, g Taeoh Kim, Sangyoun Lee

With the proposed module, we achieve 14. 81% rank-1 accuracy and 15. 47% verification rate of 0. 1% FAR improvements compare to two baseline models.

Face Recognition Relation

Paper
Add Code

N-RPN: Hard Example Learning for Region Proposal Networks

no code implementations • 3 Aug 2022 • MyeongAh Cho, Tae-young Chung, Hyeongmin Lee, Sangyoun Lee

The region proposal task is to generate a set of candidate regions that contain an object.

Region Proposal

Paper
Add Code

SPSN: Superpixel Prototype Sampling Network for RGB-D Salient Object Detection

1 code implementation • 16 Jul 2022 • Minhyeok Lee, Chaewon Park, Suhwan Cho, Sangyoun Lee

However, despite advances in deep learning-based methods, RGB-D SOD is still challenging due to the large domain gap between an RGB image and the depth map and low-quality depth maps.

Ranked #3 on RGB-D Salient Object Detection on NJU2K

object-detection RGB-D Salient Object Detection +2

Paper
Code

Tackling Background Distraction in Video Object Segmentation

1 code implementation • 14 Jul 2022 • Suhwan Cho, Heansung Lee, Minhyeok Lee, Chaewon Park, Sungjun Jang, Minjung Kim, Sangyoun Lee

Semi-supervised video object segmentation (VOS) aims to densely track certain designated objects in videos.

Ranked #2 on Semi-Supervised Video Object Segmentation on DAVIS (no YouTube-VOS training)

Object Semantic Segmentation +2

Paper
Code

Exploring Temporally Dynamic Data Augmentation for Video Recognition

no code implementations • 30 Jun 2022 • Taeoh Kim, Jinhyung Kim, Minho Shim, Sangdoo Yun, Myunggu Kang, Dongyoon Wee, Sangyoun Lee

The magnitude of augmentation operations on each frame is changed by an effective mechanism, Fourier Sampling that parameterizes diverse, smooth, and realistic temporal variations.

Action Segmentation Image Augmentation +3

Paper
Add Code

Exploring Discontinuity for Video Frame Interpolation

1 code implementation • CVPR 2023 • Sangjin Lee, Hyeongmin Lee, Chajin Shin, Hanbin Son, Sangyoun Lee

Lastly, we propose loss functions to give supervisions of the discontinuous motion areas which can be applied along with FTM and D-map.

Data Augmentation Video Frame Interpolation

Paper
Code

RandomSEMO: Normality Learning Of Moving Objects For Video Anomaly Detection

no code implementations • 13 Feb 2022 • Chaewon Park, Minhyeok Lee, MyeongAh Cho, Sangyoun Lee

Moreover, MOLoss urges the model to focus on learning normal objects captured within RandomSEMO by amplifying the loss on the pixels near the moving objects.

Anomaly Detection Superpixels +1

Paper
Add Code

Saliency Detection via Global Context Enhanced Feature Fusion and Edge Weighted Loss

no code implementations • 13 Oct 2021 • Chaewon Park, Minhyeok Lee, MyeongAh Cho, Sangyoun Lee

1) Indiscriminately integrating the encoder feature, which contains spatial information for multiple objects, and the decoder feature, which contains global information of the salient object, is likely to convey unnecessary details of non-salient objects to the decoder, hindering saliency detection.

Ranked #1 on RGB Salient Object Detection on PASCAL-S

Object object-detection +3

Paper
Add Code

Pixel-Level Bijective Matching for Video Object Segmentation

1 code implementation • 4 Oct 2021 • Suhwan Cho, Heansung Lee, Minjung Kim, Sungjun Jang, Sangyoun Lee

Before finding the best matches for the query frame pixels, the optimal matches for the reference frame pixels are first considered to prevent each reference frame pixel from being overly referenced.

Ranked #14 on Semi-Supervised Video Object Segmentation on DAVIS (no YouTube-VOS training)

Object Semantic Segmentation +2

Paper
Code

MKConv: Multidimensional Feature Representation for Point Cloud Analysis

no code implementations • 27 Jul 2021 • Sungmin Woo, Dogyoon Lee, Sangwon Hwang, Woojin Kim, Sangyoun Lee

In this paper, we present Multidimensional Kernel Convolution (MKConv), a novel convolution operator that learns to transform the point feature representation from a vector to a multidimensional matrix.

Ranked #13 on 3D Part Segmentation on ShapeNet-Part

3D Part Segmentation 3D Point Cloud Classification

Paper
Add Code

FastAno: Fast Anomaly Detection via Spatio-temporal Patch Transformation

1 code implementation • 16 Jun 2021 • Chaewon Park, MyeongAh Cho, Minhyeok Lee, Sangyoun Lee

Video anomaly detection has gained significant attention due to the increasing requirements of automatic monitoring for surveillance videos.

Ranked #5 on Anomaly Detection In Surveillance Videos on UCSD Peds2

Anomaly Detection In Surveillance Videos Optical Flow Estimation +1

Paper
Code

EdgeConv with Attention Module for Monocular Depth Estimation

no code implementations • 16 Jun 2021 • Minhyeok Lee, Sangwon Hwang, Chaewon Park, Sangyoun Lee

Monocular depth estimation is an especially important task in robotics and autonomous driving, where 3D structural information is essential.

Autonomous Driving Monocular Depth Estimation

Paper
Add Code

Robust Lane Detection via Expanded Self Attention

1 code implementation • 14 Feb 2021 • Minhyeok Lee, Junhyeop Lee, Dogyoon Lee, Woojin Kim, Sangwon Hwang, Sangyoun Lee

Modern deep learning methods achieve high performance in lane detection, but it is still difficult to accurately detect lanes in challenging situations such as congested roads and extreme lighting conditions.

Ranked #42 on Lane Detection on CULane

Lane Detection

Paper
Code

Regularization Strategy for Point Cloud via Rigidly Mixed Sample

1 code implementation • CVPR 2021 • Dogyoon Lee, Jaeha Lee, Junhyeop Lee, Hyeongmin Lee, Minhyeok Lee, Sungmin Woo, Sangyoun Lee

Data augmentation is an effective regularization strategy to alleviate the overfitting, which is an inherent drawback of the deep neural networks.

Ranked #3 on 3D Point Cloud Classification on ModelNet40-C

3D Object Classification Data Augmentation +1

Paper
Code

Test-Time Adaptation for Out-of-distributed Image Inpainting

no code implementations • 2 Feb 2021 • Chajin Shin, Taeoh Kim, Sangjin Lee, Sangyoun Lee

From this test-time adaptation, our network can exploit externally learned image priors from the pre-trained features as well as the internal prior of the test image explicitly.

Image Inpainting Test-time Adaptation +1

Paper
Add Code

A NIR-to-VIS face recognition via part adaptive and relation attention module

no code implementations • 1 Feb 2021 • Rushuang Xu, MyeongAh Cho, Sangyoun Lee

In the face recognition application scenario, we need to process facial images captured in various conditions, such as at night by near-infrared (NIR) surveillance cameras.

Face Recognition Heterogeneous Face Recognition +1

Paper
Add Code

Multi-object tracking with self-supervised associating network

no code implementations • 26 Oct 2020 • Tae-young Chung, Heansung Lee, Myeong Ah Cho, Suhwan Cho, Sangyoun Lee

So in this paper, we propose a novel self-supervised learning method using a lot of short videos which has no human labeling, and improve the tracking performance through the re-identification network trained in the self-supervised manner to solve the lack of training data problem.

Multi-Object Tracking Object +1

Paper
Add Code

Unsupervised Video Anomaly Detection via Normalizing Flows with Implicit Latent Features

no code implementations • 15 Oct 2020 • MyeongAh Cho, Taeoh Kim, Woo Jin Kim, Suhwan Cho, Sangyoun Lee

For the complex distribution of normal scenes, we suggest normal density estimation of ITAE features through normalizing flow (NF)-based generative models to learn the tractable likelihoods and identify anomalies using out of distribution detection.

Anomaly Detection Density Estimation +3

Paper
Add Code

Smoother Network Tuning and Interpolation for Continuous-level Image Processing

no code implementations • 5 Oct 2020 • Hyeongmin Lee, Taeoh Kim, Hanbin Son, Sangwook Baek, Minsu Cheon, Sangyoun Lee

Extensive results for various image processing tasks indicate that the performance of FTN is comparable in multiple continuous levels, and is significantly smoother and lighter than that of other frameworks.

Paper
Add Code

Enhanced Standard Compatible Image Compression Framework based on Auxiliary Codec Networks

no code implementations • 30 Sep 2020 • Hanbin Son, Taeoh Kim, Hyeongmin Lee, Sangyoun Lee

The postprocessing network increases the quality of decoded images using an example-based learning.

Image Compression

Paper
Add Code

PMVOS: Pixel-Level Matching-Based Video Object Segmentation

no code implementations • 18 Sep 2020 • Suhwan Cho, Heansung Lee, Sungmin Woo, Sungjun Jang, Sangyoun Lee

Semi-supervised video object segmentation (VOS) aims to segment arbitrary target objects in video when the ground truth segmentation mask of the initial frame is provided.

Object One-shot visual object segmentation +3

Paper
Add Code

Learning Temporally Invariant and Localizable Features via Data Augmentation for Video Recognition

1 code implementation • 13 Aug 2020 • Taeoh Kim, Hyeongmin Lee, MyeongAh Cho, Ho Seong Lee, Dong Heon Cho, Sangyoun Lee

Based on our novel temporal data augmentation algorithms, video recognition performances are improved using only a limited amount of training data compared to the spatial-only data augmentation algorithms, including the 1st Visual Inductive Priors (VIPriors) for data-efficient action recognition challenge.

Action Recognition Data Augmentation +1

Paper
Code

Extrapolative-Interpolative Cycle-Consistency Learning for Video Frame Extrapolation

no code implementations • 27 May 2020 • Sangjin Lee, Hyeongmin Lee, Taeoh Kim, Sangyoun Lee

Unlike previous studies that usually have been focused on the design of modules or construction of networks, we propose a novel Extrapolative-Interpolative Cycle (EIC) loss using pre-trained frame interpolation module to improve extrapolation performance.

Paper
Add Code

False Positive Removal for 3D Vehicle Detection with Penetrated Point Classifier

no code implementations • 27 May 2020 • Sungmin Woo, Sangwon Hwang, Woojin Kim, Junhyeop Lee, Dogyoon Lee, Sangyoun Lee

Recently, researchers have been leveraging LiDAR point cloud for higher accuracy in 3D vehicle detection.

Paper
Add Code

Regularized Adaptation for Stable and Efficient Continuous-Level Learning on Image Processing Networks

no code implementations • 11 Mar 2020 • Hyeongmin Lee, Taeoh Kim, Hanbin Son, Sangwook Baek, Minsu Cheon, Sangyoun Lee

In this paper, we propose a novel continuous-level learning framework using a Filter Transition Network (FTN) which is a non-linear module that easily adapt to new levels, and is regularized to prevent undesirable side-effects.

Paper
Add Code

Relational Deep Feature Learning for Heterogeneous Face Recognition

no code implementations • 2 Mar 2020 • MyeongAh Cho, Taeoh Kim, Ig-Jae Kim, Kyungjae Lee, Sangyoun Lee

Due to the lack of databases, HFR methods usually exploit the pre-trained features on a large-scale visual database that contain general facial information.

Face Recognition Heterogeneous Face Recognition

Paper
Add Code

CRVOS: Clue Refining Network for Video Object Segmentation

1 code implementation • 10 Feb 2020 • Suhwan Cho, MyeongAh Cho, Tae-young Chung, Heansung Lee, Sangyoun Lee

The encoder-decoder based methods for semi-supervised video object segmentation (Semi-VOS) have received extensive attention due to their superior performances.

Ranked #60 on Semi-Supervised Video Object Segmentation on DAVIS 2016