Search Results for author: Ya Jiang

Found 5 papers, 0 papers with code

The USTC-NERCSLIP Systems for the CHiME-8 MMCSG Challenge

no code implementations8 Oct 2024 Ya Jiang, Hongbo Lan, Jun Du, Qing Wang, Shutong Niu

In the two-person conversation scenario with one wearing smart glasses, transcribing and displaying the speaker's content in real-time is an intriguing application, providing a priori information for subsequent tasks such as translation and comprehension.

speech-recognition Speech Recognition

Multi-Bit Distortion-Free Watermarking for Large Language Models

no code implementations26 Feb 2024 Massieh Kordi Boroujeny, Ya Jiang, Kai Zeng, Brian Mark

Methods for watermarking large language models have been proposed that distinguish AI-generated text from human-generated text by slightly altering the model output distribution, but they also distort the quality of the text, exposing the watermark to adversarial detection.

Decoder

Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function

no code implementations26 Oct 2022 Qing Wang, Hang Chen, Ya Jiang, Zhe Wang, Yuyang Wang, Jun Du, Chin-Hui Lee

In this paper, we propose a deep learning based multi-speaker direction of arrival (DOA) estimation with audio and visual signals by using permutation-free loss function.

Active Speaker Detection Sound Source Localization

Cannot find the paper you are looking for? You can Submit a new open access paper.