Search Results for author: Xiaoyu Bie

Found 8 papers, 3 papers with code

Using Random Codebooks for Audio Neural AutoEncoders

no code implementations25 Sep 2024 Benoît Giniès, Xiaoyu Bie, Olivier Fercoq, Gaël Richard

Latent representation learning has been an active field of study for decades in numerous applications.

Audio Compression Quantization +1

Learning Source Disentanglement in Neural Audio Codec

no code implementations17 Sep 2024 Xiaoyu Bie, Xubo Liu, Gaël Richard

Neural audio codecs have significantly advanced audio compression by efficiently converting continuous audio signals into discrete tokens.

Audio Compression Audio Generation +2

WaveTransfer: A Flexible End-to-end Multi-instrument Timbre Transfer with Diffusion

no code implementations6 Sep 2024 Teysir Baoueb, Xiaoyu Bie, Hicham Janati, Gael Richard

As diffusion-based deep generative models gain prevalence, researchers are actively investigating their potential applications across various domains, including music synthesis and style alteration.

Denoising Scheduling

Speech Modeling with a Hierarchical Transformer Dynamical VAE

no code implementations7 Mar 2023 Xiaoyu Lin, Xiaoyu Bie, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda

The dynamical variational autoencoders (DVAEs) are a family of latent-variable deep generative models that extends the VAE to model a sequence of observed data and a corresponding sequence of latent vectors.

Speech Enhancement

Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders

1 code implementation23 Jun 2021 Xiaoyu Bie, Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin

We propose an unsupervised speech enhancement algorithm that combines a DVAE speech prior pre-trained on clean speech signals with a noise model based on nonnegative matrix factorization, and we derive a variational expectation-maximization (VEM) algorithm to perform speech enhancement.

Representation Learning Speech Enhancement +2

Multi-Person Extreme Motion Prediction

1 code implementation CVPR 2022 Wen Guo, Xiaoyu Bie, Xavier Alameda-Pineda, Francesc Moreno-Noguer

In this paper, we explore this problem when dealing with humans performing collaborative tasks, we seek to predict the future motion of two interacted persons given two sequences of their past skeletons.

Human motion prediction motion prediction +2

Dynamical Variational Autoencoders: A Comprehensive Review

1 code implementation28 Aug 2020 Laurent Girin, Simon Leglaive, Xiaoyu Bie, Julien Diard, Thomas Hueber, Xavier Alameda-Pineda

Recently, a series of papers have presented different extensions of the VAE to process sequential data, which model not only the latent space but also the temporal dependencies within a sequence of data vectors and corresponding latent vectors, relying on recurrent neural networks or state-space models.

3D Human Dynamics Resynthesis +3

Cannot find the paper you are looking for? You can Submit a new open access paper.