Search Results for author: Yanpei Shi

Found 9 papers, 1 papers with code

T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model

no code implementations29 Oct 2020 Yanpei Shi, Mingjie Chen, Qiang Huang, Thomas Hain

The use of memory mechanism could reach 10. 6% and 7. 7% relative improvement compared with not using memory mechanism.

Speaker Identification

Towards Low-Resource StarGAN Voice Conversion using Weight Adaptive Instance Normalization

1 code implementation22 Oct 2020 Mingjie Chen, Yanpei Shi, Thomas Hain

In this work, we aim at improving the data efficiency of the model and achieving a many-to-many non-parallel StarGAN-based voice conversion for a relatively large number of speakers with limited training samples.

Sound Audio and Speech Processing

Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification

no code implementations15 May 2020 Yanpei Shi, Qiang Huang, Thomas Hain

To evaluate the effectiveness of the proposed approach, artificial datasets based on Switchboard Cellular part1 (SWBC) and Voxceleb1 are constructed in two conditions, where speakers' voices are overlapped and not overlapped.

Speaker Identification

Speaker Re-identification with Speaker Dependent Speech Enhancement

no code implementations15 May 2020 Yanpei Shi, Qiang Huang, Thomas Hain

The obtained results show that the proposed approach using speaker dependent speech enhancement can yield better speaker recognition and speech enhancement performances than two baselines in various noise conditions.

Speaker Recognition Speech Enhancement

Robust Speaker Recognition Using Speech Enhancement And Attention Model

no code implementations14 Jan 2020 Yanpei Shi, Qiang Huang, Thomas Hain

Instead of individually processing speech enhancement and speaker recognition, the two modules are integrated into one framework by a joint optimisation using deep neural networks.

Speaker Identification Speaker Recognition +1

Supervised Speaker Embedding De-Mixing in Two-Speaker Environment

no code implementations14 Jan 2020 Yanpei Shi, Thomas Hain

The proposed approach separates different speaker properties from a two-speaker signal in embedding space.

Speaker Identification Vocal Bursts Valence Prediction

H-VECTORS: Utterance-level Speaker Embedding Using A Hierarchical Attention Model

no code implementations17 Oct 2019 Yanpei Shi, Qiang Huang, Thomas Hain

In the proposed approach, frame-level encoder and attention are applied on segments of an input utterance and generate individual segment vectors.

Speaker Identification

Contextual Joint Factor Acoustic Embeddings

no code implementations16 Oct 2019 Yanpei Shi, Thomas Hain

To evaluate the effectiveness of our approaches compared to prior work, two tasks are conducted -- phone classification and speaker recognition -- and test on different TIMIT data sets.

Classification General Classification +1

Improving Noise Robustness In Speaker Identification Using A Two-Stage Attention Model

no code implementations24 Sep 2019 Yanpei Shi, Qiang Huang, Thomas Hain

While the use of deep neural networks has significantly boosted speaker recognition performance, it is still challenging to separate speakers in poor acoustic environments.

Speaker Identification Speaker Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.