Search Results for author: Chin-Yun Yu

Found 8 papers, 8 papers with code

Differentiable All-pole Filters for Time-varying Audio Systems

7 code implementations • 11 Apr 2024 • Chin-Yun Yu, Christopher Mitcheltree, Alistair Carson, Stefan Bilbao, Joshua D. Reiss, György Fazekas

Infinite impulse response filters are an essential building block of many time-varying audio systems, such as audio effects and synthesisers.

Audio Effects Modeling Audio Synthesis

144

Paper
Code

Zero-Shot Duet Singing Voices Separation with Diffusion Models

1 code implementation • 13 Nov 2023 • Chin-Yun Yu, Emilian Postolache, Emanuele Rodolà, György Fazekas

In this paper, we examine this problem in the context of duet singing voices separation, and propose a method to enforce the coherency of singer identity by splitting the mixture into overlapping segments and performing posterior sampling in an auto-regressive manner, conditioning on the previous segment.

Paper
Code

TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch

1 code implementation • 27 Oct 2023 • Jeff Hwang, Moto Hira, Caroline Chen, Xiaohui Zhang, Zhaoheng Ni, Guangzhi Sun, Pingchuan Ma, Ruizhe Huang, Vineel Pratap, Yuekai Zhang, Anurag Kumar, Chin-Yun Yu, Chuang Zhu, Chunxi Liu, Jacob Kahn, Mirco Ravanelli, Peng Sun, Shinji Watanabe, Yangyang Shi, Yumeng Tao, Robin Scheibler, Samuele Cornell, Sean Kim, Stavros Petridis

TorchAudio is an open-source audio and speech processing library built for PyTorch.

Self-Supervised Learning Speech Enhancement +2

2,379

Paper
Code

Singing Voice Synthesis Using Differentiable LPC and Glottal-Flow-Inspired Wavetables

2 code implementations • 29 Jun 2023 • Chin-Yun Yu, György Fazekas

This paper introduces GlOttal-flow LPC Filter (GOLF), a novel method for singing voice synthesis (SVS) that exploits the physical characteristics of the human voice using differentiable digital signal processing.

Singing Voice Synthesis

Paper
Code

Conditioning and Sampling in Variational Diffusion Models for Speech Super-Resolution

1 code implementation • 27 Oct 2022 • Chin-Yun Yu, Sung-Lin Yeh, György Fazekas, Hao Tang

Moreover, by coupling the proposed sampling method with an unconditional DM, i. e., a DM with no auxiliary inputs to its noise predictor, we can generalize it to a wide range of SR setups.

Super-Resolution

Paper
Code

Danna-Sep: Unite to separate them all

1 code implementation • 7 Dec 2021 • Chin-Yun Yu, Kin-Wai Cheuk

Deep learning-based music source separation has gained a lot of interest in the last decades.

Music Source Separation

Paper
Code

Music Demixing Challenge 2021

1 code implementation • 31 Aug 2021 • Yuki Mitsufuji, Giorgio Fabbro, Stefan Uhlich, Fabian-Robert Stöter, Alexandre Défossez, Minseok Kim, Woosung Choi, Chin-Yun Yu, Kin-Wai Cheuk

The main differences compared with the past challenges are 1) the competition is designed to more easily allow machine learning practitioners from other disciplines to participate, 2) evaluation is done on a hidden test set created by music professionals dedicated exclusively to the challenge to assure the transparency of the challenge, i. e., the test set is not accessible from anyone except the challenge organizers, and 3) the dataset provides a wider range of music genres and involved a greater number of mixing engineers.

Music Source Separation

Paper
Code

Multi-layered Cepstrum for Instantaneous Frequency Estimation

1 code implementation • 1 Feb 2019 • Chin-Yun Yu, Li Su

We propose the multi-layered cepstrum (MLC) method to estimate multiple fundamental frequencies (MF0) of a signal under challenging contamination such as high-pass filter noise.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.