no code implementations • IWSLT 2017 • Eunah Cho, Jan Niehues, Alex Waibel
Experiments show that generalizing rare and unknown words greatly improves the punctuation insertion performance, reaching up to 8. 8 points of improvement in F-score when applied to the out-of-domain test scenario.
no code implementations • IWSLT 2016 • Eunah Cho, Jan Niehues, Thanh-Le Ha, Matthias Sperber, Mohammed Mediani, Alex Waibel
In addition, we investigated methods to combine NMT systems that encode the input as well as the output differently.
no code implementations • IWSLT 2016 • Eunah Cho, Jan Niehues, Thanh-Le Ha, Alex Waibel
In this paper, we investigate a multilingual approach for speech disfluency removal.
no code implementations • EMNLP (NLP4ConvAI) 2021 • Eunah Cho, Ziyan Jiang, Jie Hao, Zheng Chen, Saurabh Gupta, Xing Fan, Chenlei Guo
Query rewrite (QR) is an emerging component in conversational AI systems, reducing user defect.
no code implementations • NAACL 2022 • Dingcheng Li, Zheng Chen, Eunah Cho, Jie Hao, Xiaohu Liu, Fan Xing, Chenlei Guo, Yang Liu
Seq2seq language generation models that are trained offline with multiple domains in a sequential fashion often suffer from catastrophic forgetting.
no code implementations • 15 Nov 2023 • Minqian Liu, Ying Shen, Zhiyang Xu, Yixin Cao, Eunah Cho, Vaibhav Kumar, Reza Ghanadan, Lifu Huang
Natural Language Generation (NLG) typically involves evaluating the generated text in various aspects (e. g., consistency and naturalness) to obtain a comprehensive assessment.
no code implementations • 28 Aug 2023 • Yancheng Wang, Ziyan Jiang, Zheng Chen, Fan Yang, Yingxue Zhou, Eunah Cho, Xing Fan, Xiaojiang Huang, Yanbin Lu, Yingzhen Yang
While the recommendation system (RS) has advanced significantly through deep learning, current RS approaches usually train and fine-tune models on task-specific datasets, limiting their generalizability to new recommendation tasks and their ability to leverage external knowledge due to model scale and data size constraints.
no code implementations • 23 May 2023 • Zheng Chen, Ziyan Jiang, Fan Yang, Eunah Cho, Xing Fan, Xiaojiang Huang, Yanbin Lu, Aram Galstyan
This paper presents our "Collaborative Query Rewriting" approach, which specifically addresses the task of rewriting new user interactions that have not been previously observed in the user's history.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +9
no code implementations • 12 May 2023 • Fan Yang, Zheng Chen, Ziyan Jiang, Eunah Cho, Xiaojiang Huang, Yanbin Lu
Then we adopt a LLM-based ranking model to generate recommended items.
no code implementations • 21 Feb 2023 • Jinglun Cai, Mingda Li, Ziyan Jiang, Eunah Cho, Zheng Chen, Yang Liu, Xing Fan, Chenlei Guo
Query Rewriting (QR) plays a critical role in large-scale dialogue systems for reducing frictions.
no code implementations • 22 Mar 2020 • Thai Son Nguyen, Jan Niehues, Eunah Cho, Thanh-Le Ha, Kevin Kilgour, Markus Muller, Matthias Sperber, Sebastian Stueker, Alex Waibel
User studies have shown that reducing the latency of our simultaneous lecture translation system should be the most important goal.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
3 code implementations • AACL (lifelongnlp) 2020 • Varun Kumar, Ashutosh Choudhary, Eunah Cho
Language model based pre-trained models such as BERT have provided significant gains across different NLP tasks.
no code implementations • WS 2019 • Zimeng Qiu, Eunah Cho, Xiaochun Ma, William Campbell
Semi-supervised learning is an efficient method to augment training data automatically from unlabeled data.
no code implementations • 9 Oct 2019 • Eunah Cho, He Xie, John P. Lalor, Varun Kumar, William M. Campbell
In addition, methods optimizing diversity can reduce training data in many cases to 50% with little impact on performance.
Natural Language Understanding Task-Oriented Dialogue Systems
no code implementations • WS 2019 • Eunah Cho, He Xie, William M. Campbell
Semi-supervised learning is an efficient way to improve performance for natural language processing systems.
no code implementations • WS 2017 • Jan Niehues, Eunah Cho
Linguistic resources such as part-of-speech (POS) tags have been extensively used in statistical machine translation (SMT) frameworks and have yielded better performances.
no code implementations • WS 2017 • Jan Niehues, Eunah Cho, Thanh-Le Ha, Alex Waibel
By separating the search space and the modeling using $n$-best list reranking, we analyze the influence of both parts of an NMT system independently.
no code implementations • COLING 2016 • Jan Niehues, Eunah Cho, Thanh-Le Ha, Alex Waibel
We analyzed the influence of the quality of the initial system on the final result.
no code implementations • NAACL 2016 • Markus M{\"u}ller, Thai Son Nguyen, Jan Niehues, Eunah Cho, Bastian Kr{\"u}ger, Thanh-Le Ha, Kevin Kilgour, Matthias Sperber, Mohammed Mediani, Sebastian St{\"u}ker, Alex Waibel
no code implementations • LREC 2014 • Eunah Cho, Sarah F{\"u}nfer, Sebastian St{\"u}ker, Alex Waibel
With the increasing number of applications handling spontaneous speech, the needs to process spoken languages become stronger.
no code implementations • LREC 2012 • Sebastian St{\"u}ker, Florian Kraft, Christian Mohr, Teresa Herrmann, Eunah Cho, Alex Waibel
Academic lectures offer valuable content, but often do not reach their full potential audience due to the language barrier.