no code implementations • 24 Oct 2023 • Yuanfeng Song, Yuanqin He, Xuefang Zhao, Hanlin Gu, Di Jiang, Haijun Yang, Lixin Fan, Qiang Yang
The springing up of Large Language Models (LLMs) has shifted the community from single-task-orientated natural language processing (NLP) research to a holistic end-to-end multi-task learning paradigm.
no code implementations • 29 Jul 2023 • Yuanfeng Song, Xuefang Zhao, Raymond Chi-Wing Wong
Since it is the task which has not been studied in the literature, we first build a benchmark dataset named Dial-NVBench, including dialogue sessions with a sequence of queries from a user and responses from the system.
no code implementations • 4 Jan 2022 • Yuanfeng Song, Raymond Chi-Wing Wong, Xuefang Zhao, Di Jiang
We first identify a new task named Speech-to-SQL, which aims to understand the information conveyed by human speech and directly translate it into structured query language (SQL) statements.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 25 Oct 2019 • Yuanfeng Song, Di Jiang, Xuefang Zhao, Qian Xu, Raymond Chi-Wing Wong, Lixin Fan, Qiang Yang
Modern Automatic Speech Recognition (ASR) systems primarily rely on scores from an Acoustic Model (AM) and a Language Model (LM) to rescore the N-best lists.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5