Search Results for author: Xinyuan Zhou

Found 5 papers, 1 papers with code

The USTC-NELSLIP Offline Speech Translation Systems for IWSLT 2022

no code implementations IWSLT (ACL) 2022 Weitai Zhang, Zhongyi Ye, Haitao Tang, Xiaoxi Li, Xinyuan Zhou, Jing Yang, Jianwei Cui, Dan Liu, Junhua Liu, LiRong Dai

This paper describes USTC-NELSLIP’s submissions to the IWSLT 2022 Offline Speech Translation task, including speech translation of talks from English to German, English to Chinese and English to Japanese.

Translation

DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation

no code implementations26 Oct 2023 Yongxin Zhu, Zhujin Gao, Xinyuan Zhou, Zhongyi Ye, Linli Xu

While Diffusion Generative Models have achieved great success on image generation tasks, how to efficiently and effectively incorporate them into speech generation especially translation tasks remains a non-trivial problem.

Image Generation Speech-to-Speech Translation +1

Data-Centric Financial Large Language Models

no code implementations7 Oct 2023 Zhixuan Chu, Huaiyu Guo, Xinyuan Zhou, Yijia Wang, Fei Yu, Hong Chen, Wanqing Xu, Xin Lu, Qing Cui, Longfei Li, Jun Zhou, Sheng Li

Large language models (LLMs) show promise for natural language tasks but struggle when applied directly to complex domains like finance.

CNN-based Discriminative Training for Domain Compensation in Acoustic Event Detection with Frame-wise Classifier

no code implementations26 Mar 2021 Tiantian Tang, Xinyuan Zhou, Yanhua Long, Yijie Li, Jiaen Liang

Domain mismatch is a noteworthy issue in acoustic event detection tasks, as the target domain data is difficult to access in most real applications.

Event Detection

Multi-channel target speech extraction with channel decorrelation and target speaker adaptation

1 code implementation19 Oct 2020 Jiangyu Han, Xinyuan Zhou, Yanhua Long, Yijie Li

In this work, we propose two methods for exploiting the multi-channel spatial information to extract the target speech.

Speech Extraction Audio and Speech Processing

Cannot find the paper you are looking for? You can Submit a new open access paper.