Search Results for author: Xiaoyu Bie

Found 5 papers, 3 papers with code

Speech Modeling with a Hierarchical Transformer Dynamical VAE

no code implementations • 7 Mar 2023 • Xiaoyu Lin, Xiaoyu Bie, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda

The dynamical variational autoencoders (DVAEs) are a family of latent-variable deep generative models that extends the VAE to model a sequence of observed data and a corresponding sequence of latent vectors.

Speech Enhancement

Paper
Add Code

HiT-DVAE: Human Motion Generation via Hierarchical Transformer Dynamical VAE

no code implementations • 4 Apr 2022 • Xiaoyu Bie, Wen Guo, Simon Leglaive, Lauren Girin, Francesc Moreno-Noguer, Xavier Alameda-Pineda

Studies on the automatic processing of 3D human pose data have flourished in the recent past.

motion prediction

Paper
Add Code

Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders

1 code implementation • 23 Jun 2021 • Xiaoyu Bie, Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin

We propose an unsupervised speech enhancement algorithm that combines a DVAE speech prior pre-trained on clean speech signals with a noise model based on nonnegative matrix factorization, and we derive a variational expectation-maximization (VEM) algorithm to perform speech enhancement.

Representation Learning Speech Enhancement +2

Paper
Code

Multi-Person Extreme Motion Prediction

1 code implementation • CVPR 2022 • Wen Guo, Xiaoyu Bie, Xavier Alameda-Pineda, Francesc Moreno-Noguer

In this paper, we explore this problem when dealing with humans performing collaborative tasks, we seek to predict the future motion of two interacted persons given two sequences of their past skeletons.

Ranked #2 on Multi-Person Pose forecasting on Expi - unseen actions split

Human motion prediction motion prediction +2

Paper
Code

Dynamical Variational Autoencoders: A Comprehensive Review

1 code implementation • 28 Aug 2020 • Laurent Girin, Simon Leglaive, Xiaoyu Bie, Julien Diard, Thomas Hueber, Xavier Alameda-Pineda

Recently, a series of papers have presented different extensions of the VAE to process sequential data, which model not only the latent space but also the temporal dependencies within a sequence of data vectors and corresponding latent vectors, relying on recurrent neural networks or state-space models.

3D Human Dynamics Resynthesis +2

195

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.