Search Results for author: Nian Shao

Found 4 papers, 4 papers with code

Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors

1 code implementation25 Sep 2023 Di Liang, Nian Shao, Xiaofei Li

This work proposes a frame-wise online/streaming end-to-end neural diarization (FS-EEND) method in a frame-in-frame-out fashion.

speaker-diarization Speaker Diarization

Fine-tune the pretrained ATST model for sound event detection

1 code implementation15 Sep 2023 Nian Shao, Xian Li, Xiaofei Li

In this work, we study the fine-tuning method of the pretrained models for SED.

 Ranked #1 on Sound Event Detection on DESED (using extra training data)

Event Detection Self-Supervised Learning +1

Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks

2 code implementations7 Jun 2023 Xian Li, Nian Shao, Xiaofei Li

In order to tackle both clip-level and frame-level tasks, this paper proposes Audio Teacher-Student Transformer (ATST), with a clip-level version (named ATST-Clip) and a frame-level version (named ATST-Frame), responsible for learning clip-level and frame-level representations, respectively.

Audio Classification Audio Tagging +8

RCT: Random Consistency Training for Semi-supervised Sound Event Detection

2 code implementations21 Oct 2021 Nian Shao, Erfan Loweimi, Xiaofei Li

Sound event detection (SED), as a core module of acoustic environmental analysis, suffers from the problem of data deficiency.

Data Augmentation Event Detection +1

Cannot find the paper you are looking for? You can Submit a new open access paper.