Search Results for author: Hongbin Suo

Found 6 papers, 0 papers with code

Task-Agnostic Structured Pruning of Speech Representation Models

no code implementations • 2 Jun 2023 • Haoyu Wang, Siyuan Wang, Wei-Qiang Zhang, Hongbin Suo, Yulong Wan

Self-supervised pre-trained models such as Wav2vec2, Hubert, and WavLM have been shown to significantly improve many speech tasks.

Model Compression

Paper
Add Code

Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models

no code implementations • 13 Oct 2022 • Haoyu Wang, Wei-Qiang Zhang, Hongbin Suo, Yulong Wan

Labeled audio data is insufficient to build satisfying speech recognition systems for most of the languages in the world.

Language Modelling speech-recognition +1

Paper
Add Code

Reformulating Speaker Diarization as Community Detection With Emphasis On Topological Structure

no code implementations • 26 Apr 2022 • Siqi Zheng, Hongbin Suo

In this paper we propose to view clustering-based diarization as a community detection problem.

Clustering Community Detection +2

Paper
Add Code

Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data

no code implementations • 25 Apr 2022 • Fuchuan Tong, Siqi Zheng, Min Zhang, Yafeng Chen, Hongbin Suo, Qingyang Hong, Lin Li

In this work, we present a GCN-based approach for semi-supervised learning.

Clustering Speaker Recognition

Paper
Add Code

BeamTransformer: Microphone Array-based Overlapping Speech Detection

no code implementations • 9 Sep 2021 • Siqi Zheng, Shiliang Zhang, Weilong Huang, Qian Chen, Hongbin Suo, Ming Lei, Jinwei Feng, Zhijie Yan

We propose BeamTransformer, an efficient architecture to leverage beamformer's edge in spatial filtering and transformer's capability in context sequence modeling.

Paper
Add Code

A Real-time Speaker Diarization System Based on Spatial Spectrum

no code implementations • 20 Jul 2021 • Siqi Zheng, Weilong Huang, Xianliang Wang, Hongbin Suo, Jinwei Feng, Zhijie Yan

In this paper we describe a speaker diarization system that enables localization and identification of all speakers present in a conversation or meeting.

speaker-diarization Speaker Diarization +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.