Search Results for author: Doyeon Kim

Found 17 papers, 8 papers with code

MF-PAM: Accurate Pitch Estimation through Periodicity Analysis and Multi-level Feature Fusion

1 code implementation • 16 Jun 2023 • Woo-Jin Chung, Doyeon Kim, Soo-Whan Chung, Hong-Goo Kang

We introduce Multi-level feature Fusion-based Periodicity Analysis Model (MF-PAM), a novel deep learning-based pitch estimation model that accurately estimates pitch trajectory in noisy and reverberant acoustic environments.

Audio Signal Processing

Paper
Code

HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders

no code implementations • 2 Jun 2023 • Doyeon Kim, Soo-Whan Chung, Hyewon Han, Youna Ji, Hong-Goo Kang

This paper introduces an end-to-end neural speech restoration model, HD-DEMUCS, demonstrating efficacy across multiple distortion environments.

Paper
Add Code

Context-Preserving Two-Stage Video Domain Translation for Portrait Stylization

no code implementations • 30 May 2023 • Doyeon Kim, Eunji Ko, Hyunsu Kim, Yunji Kim, Junho Kim, Dongchan Min, Junmo Kim, Sung Ju Hwang

Portrait stylization, which translates a real human face image into an artistically stylized image, has attracted considerable interest and many prior works have shown impressive quality in recent years.

Translation

Paper
Add Code

Fix the Noise: Disentangling Source Feature for Controllable Domain Translation

1 code implementation • CVPR 2023 • Dongyeun Lee, Jae Young Lee, Doyeon Kim, Jaehyun Choi, Jaejun Yoo, Junmo Kim

This allows our method to smoothly control the degree to which it preserves source features while generating images from an entirely new domain using only a single model.

Transfer Learning Translation

169

Paper
Code

Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting

1 code implementation • 30 Jun 2022 • Hyeon-Kyeong Shin, Hyewon Han, Doyeon Kim, Soo-Whan Chung, Hong-Goo Kang

In this paper, we propose a novel end-to-end user-defined keyword spotting method that utilizes linguistically corresponding patterns between speech and text sequences.

Keyword Spotting

Paper
Code

Fix the Noise: Disentangling Source Feature for Transfer Learning of StyleGAN

1 code implementation • 29 Apr 2022 • Dongyeun Lee, Jae Young Lee, Doyeon Kim, Jaehyun Choi, Junmo Kim

Owing to the disentangled feature space, our method can smoothly control the degree of the source features in a single model.

Transfer Learning

169

Paper
Code

Feature Structure Distillation with Centered Kernel Alignment in BERT Transferring

1 code implementation • 1 Apr 2022 • Hee-Jun Jung, Doyeon Kim, Seung-Hoon Na, Kangil Kim

To resolve it in transferring, we investigate distillation of structures of representations specified to three types: intra-feature, local inter-feature, global inter-feature structures.

Knowledge Distillation Language Modelling

Paper
Code

Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement

no code implementations • 24 Feb 2022 • Doyeon Kim, Hyewon Han, Hyeon-Kyeong Shin, Soo-Whan Chung, Hong-Goo Kang

Modern neural speech enhancement models usually include various forms of phase information in their training loss terms, either explicitly or implicitly.

Speech Enhancement

Paper
Add Code

FrePGAN: Robust Deepfake Detection Using Frequency-level Perturbations

no code implementations • 7 Feb 2022 • Yonghyun Jeong, Doyeon Kim, Youngmin Ro, Jongwon Choi

For experiments, we design new test scenarios varying from the training settings in GAN models, color manipulations, and object categories.

DeepFake Detection Face Swapping

Paper
Add Code

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth

3 code implementations • 19 Jan 2022 • Doyeon Kim, Woonghyun Ka, Pyungwhan Ahn, Donggyu Joo, Sehwan Chun, Junmo Kim

Depth estimation from a single image is an important task that can be applied to various fields in computer vision, and has grown rapidly with the development of convolutional neural networks.

Ranked #26 on Monocular Depth Estimation on KITTI Eigen split

Monocular Depth Estimation

124,527

Paper
Code

Progressive Seed Generation Auto-encoder for Unsupervised Point Cloud Learning

no code implementations • ICCV 2021 • JuYoung Yang, Pyunghwan Ahn, Doyeon Kim, Haeil Lee, Junmo Kim

With the development of 3D scanning technologies, 3D vision tasks have become a popular research area.

Ranked #8 on 3D Point Cloud Linear Classification on ModelNet40

3D Point Cloud Linear Classification Point cloud reconstruction

Paper
Add Code

A Worker-Task Specialization Model for Crowdsourcing: Efficient Inference and Fundamental Limits

1 code implementation • 19 Nov 2021 • Doyeon Kim, Jeonghwan Lee, Hye Won Chung

Inferring correct labels from multiple noisy answers on data, however, has been a challenging problem, since the quality of the answers varies widely across tasks and workers.

Paper
Code

Self-supervised GAN Detector

no code implementations • 12 Nov 2021 • Yonghyun Jeong, Doyeon Kim, Pyounggeon Kim, Youngmin Ro, Jongwon Choi

Although the recent advancement in generative models brings diverse advantages to society, it can also be abused with malicious purposes, such as fraud, defamation, and fake news.

Paper
Add Code

MToFNet: Object Anti-Spoofing with Mobile Time-of-Flight Data

no code implementations • 6 Oct 2021 • Yonghyun Jeong, Doyeon Kim, Jaehyeon Lee, Minki Hong, Solbi Hwang, Jongwon Choi

When images are recaptured on display screens, various patterns differing by the screens as known as the moir\'e patterns can be also captured in spoof images.

Paper
Add Code

FICGAN: Facial Identity Controllable GAN for De-identification

no code implementations • 2 Oct 2021 • Yonghyun Jeong, Jooyoung Choi, Sungwon Kim, Youngmin Ro, Tae-Hyun Oh, Doyeon Kim, Heonseok Ha, Sungroh Yoon

In this work, we present Facial Identity Controllable GAN (FICGAN) for not only generating high-quality de-identified face images with ensured privacy protection, but also detailed controllability on attribute preservation for enhanced data utility.

Attribute De-identification

Paper
Add Code

BiHPF: Bilateral High-Pass Filters for Robust Deepfake Detection

no code implementations • 16 Aug 2021 • Yonghyun Jeong, Doyeon Kim, Seungjai Min, Seongho Joe, Youngjune Gwon, Jongwon Choi

The advancement in numerous generative models has a two-fold effect: a simple and easy generation of realistic synthesized images, but also an increased risk of malicious abuse of those images.

DeepFake Detection Face Swapping +1

Paper
Add Code

TricubeNet: 2D Kernel-Based Object Representation for Weakly-Occluded Oriented Object Detection

1 code implementation • 23 Apr 2021 • Beomyoung Kim, Janghyeon Lee, Sihaeng Lee, Doyeon Kim, Junmo Kim

We present a novel approach for oriented object detection, named TricubeNet, which localizes oriented objects using visual cues ($i. e.,$ heatmap) instead of oriented box offsets regression.

Ranked #1 on One-stage Anchor-free Oriented Object Detection on SKU110K-R

Object object-detection +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.