Search Results for author: Doyeon Kim

Found 17 papers, 8 papers with code

MF-PAM: Accurate Pitch Estimation through Periodicity Analysis and Multi-level Feature Fusion

1 code implementation16 Jun 2023 Woo-Jin Chung, Doyeon Kim, Soo-Whan Chung, Hong-Goo Kang

We introduce Multi-level feature Fusion-based Periodicity Analysis Model (MF-PAM), a novel deep learning-based pitch estimation model that accurately estimates pitch trajectory in noisy and reverberant acoustic environments.

Audio Signal Processing

HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders

no code implementations2 Jun 2023 Doyeon Kim, Soo-Whan Chung, Hyewon Han, Youna Ji, Hong-Goo Kang

This paper introduces an end-to-end neural speech restoration model, HD-DEMUCS, demonstrating efficacy across multiple distortion environments.

Context-Preserving Two-Stage Video Domain Translation for Portrait Stylization

no code implementations30 May 2023 Doyeon Kim, Eunji Ko, Hyunsu Kim, Yunji Kim, Junho Kim, Dongchan Min, Junmo Kim, Sung Ju Hwang

Portrait stylization, which translates a real human face image into an artistically stylized image, has attracted considerable interest and many prior works have shown impressive quality in recent years.

Translation

Fix the Noise: Disentangling Source Feature for Controllable Domain Translation

1 code implementation CVPR 2023 Dongyeun Lee, Jae Young Lee, Doyeon Kim, Jaehyun Choi, Jaejun Yoo, Junmo Kim

This allows our method to smoothly control the degree to which it preserves source features while generating images from an entirely new domain using only a single model.

Transfer Learning Translation

Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting

1 code implementation30 Jun 2022 Hyeon-Kyeong Shin, Hyewon Han, Doyeon Kim, Soo-Whan Chung, Hong-Goo Kang

In this paper, we propose a novel end-to-end user-defined keyword spotting method that utilizes linguistically corresponding patterns between speech and text sequences.

Keyword Spotting

Fix the Noise: Disentangling Source Feature for Transfer Learning of StyleGAN

1 code implementation29 Apr 2022 Dongyeun Lee, Jae Young Lee, Doyeon Kim, Jaehyun Choi, Junmo Kim

Owing to the disentangled feature space, our method can smoothly control the degree of the source features in a single model.

Transfer Learning

Feature Structure Distillation with Centered Kernel Alignment in BERT Transferring

1 code implementation1 Apr 2022 Hee-Jun Jung, Doyeon Kim, Seung-Hoon Na, Kangil Kim

To resolve it in transferring, we investigate distillation of structures of representations specified to three types: intra-feature, local inter-feature, global inter-feature structures.

Knowledge Distillation Language Modelling

Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement

no code implementations24 Feb 2022 Doyeon Kim, Hyewon Han, Hyeon-Kyeong Shin, Soo-Whan Chung, Hong-Goo Kang

Modern neural speech enhancement models usually include various forms of phase information in their training loss terms, either explicitly or implicitly.

Speech Enhancement

FrePGAN: Robust Deepfake Detection Using Frequency-level Perturbations

no code implementations7 Feb 2022 Yonghyun Jeong, Doyeon Kim, Youngmin Ro, Jongwon Choi

For experiments, we design new test scenarios varying from the training settings in GAN models, color manipulations, and object categories.

DeepFake Detection Face Swapping

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth

3 code implementations19 Jan 2022 Doyeon Kim, Woonghyun Ka, Pyungwhan Ahn, Donggyu Joo, Sehwan Chun, Junmo Kim

Depth estimation from a single image is an important task that can be applied to various fields in computer vision, and has grown rapidly with the development of convolutional neural networks.

Monocular Depth Estimation

A Worker-Task Specialization Model for Crowdsourcing: Efficient Inference and Fundamental Limits

1 code implementation19 Nov 2021 Doyeon Kim, Jeonghwan Lee, Hye Won Chung

Inferring correct labels from multiple noisy answers on data, however, has been a challenging problem, since the quality of the answers varies widely across tasks and workers.

Self-supervised GAN Detector

no code implementations12 Nov 2021 Yonghyun Jeong, Doyeon Kim, Pyounggeon Kim, Youngmin Ro, Jongwon Choi

Although the recent advancement in generative models brings diverse advantages to society, it can also be abused with malicious purposes, such as fraud, defamation, and fake news.

MToFNet: Object Anti-Spoofing with Mobile Time-of-Flight Data

no code implementations6 Oct 2021 Yonghyun Jeong, Doyeon Kim, Jaehyeon Lee, Minki Hong, Solbi Hwang, Jongwon Choi

When images are recaptured on display screens, various patterns differing by the screens as known as the moir\'e patterns can be also captured in spoof images.

FICGAN: Facial Identity Controllable GAN for De-identification

no code implementations2 Oct 2021 Yonghyun Jeong, Jooyoung Choi, Sungwon Kim, Youngmin Ro, Tae-Hyun Oh, Doyeon Kim, Heonseok Ha, Sungroh Yoon

In this work, we present Facial Identity Controllable GAN (FICGAN) for not only generating high-quality de-identified face images with ensured privacy protection, but also detailed controllability on attribute preservation for enhanced data utility.

Attribute De-identification

BiHPF: Bilateral High-Pass Filters for Robust Deepfake Detection

no code implementations16 Aug 2021 Yonghyun Jeong, Doyeon Kim, Seungjai Min, Seongho Joe, Youngjune Gwon, Jongwon Choi

The advancement in numerous generative models has a two-fold effect: a simple and easy generation of realistic synthesized images, but also an increased risk of malicious abuse of those images.

DeepFake Detection Face Swapping +1

TricubeNet: 2D Kernel-Based Object Representation for Weakly-Occluded Oriented Object Detection

1 code implementation23 Apr 2021 Beomyoung Kim, Janghyeon Lee, Sihaeng Lee, Doyeon Kim, Junmo Kim

We present a novel approach for oriented object detection, named TricubeNet, which localizes oriented objects using visual cues ($i. e.,$ heatmap) instead of oriented box offsets regression.

Object object-detection +3

Cannot find the paper you are looking for? You can Submit a new open access paper.