no code implementations • COLING 2022 • Zhengyuan Liu, Shikang Ni, Ai Ti Aw, Nancy F. Chen
In this work, we introduce a joint paraphrasing task of creole translation and text normalization of Singlish messages, which can shed light on how to process other language varieties and dialects.
no code implementations • 10 Sep 2024 • Wenyu Zhang, Shuo Sun, Bin Wang, Xunlong Zou, Zhuohan Liu, Yingxu He, Geyu Lin, Nancy F. Chen, Ai Ti Aw
The rapid advancements in large language models (LLMs) have significantly enhanced natural language processing capabilities, facilitating the development of AudioLLMs that process and understand speech and audio inputs alongside text.
no code implementations • 26 Aug 2024 • Kuluhan Binici, Abhinav Ramesh Kashyap, Viktor Schlegel, Andy T. Liu, Vijay Prakash Dwivedi, Thanh-Tung Nguyen, Xiaoxue Gao, Nancy F. Chen, Stefan Winkler
Experimental results show that LLMs can effectively model ASR noise, and incorporating this noisy data into the training process significantly improves the robustness and accuracy of medical dialogue summarization systems.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
1 code implementation • 16 Aug 2024 • Do Xuan Long, Hai Nguyen Ngoc, Tiviatis Sim, Hieu Dao, Shafiq Joty, Kenji Kawaguchi, Nancy F. Chen, Min-Yen Kan
We present the first systematic evaluation examining format bias in performance of large language models (LLMs).
1 code implementation • 13 Aug 2024 • Perry Lam, Huayun Zhang, Nancy F. Chen, Berrak Sisman, Dorien Herremans
We attain 25. 3% hanzi CER and 13. 0% pinyin CER with the JETS model.
no code implementations • 2 Jul 2024 • Xiaoxue Gao, Yiming Chen, Xianghu Yue, Yu Tsao, Nancy F. Chen
In this paper, we propose TTSlow, a novel adversarial approach specifically tailored to slow down the speech generation process in TTS systems.
1 code implementation • 23 Jun 2024 • Bin Wang, Xunlong Zou, Geyu Lin, Shuo Sun, Zhuohan Liu, Wenyu Zhang, Zhengyuan Liu, AiTi Aw, Nancy F. Chen
We introduce AudioBench, a universal benchmark designed to evaluate Audio Large Language Models (AudioLLMs).
no code implementations • 24 May 2024 • Minzhi Li, Zhengyuan Liu, Shumin Deng, Shafiq Joty, Nancy F. Chen, Min-Yen Kan
The acceleration of Large Language Models (LLMs) research has opened up new possibilities for evaluating generated texts.
2 code implementations • 6 May 2024 • Bin Wang, Geyu Lin, Zhengyuan Liu, Chengwei Wei, Nancy F. Chen
Large language models (LLMs) have rapidly evolved as the foundation of various natural language processing (NLP) applications.
1 code implementation • 18 Apr 2024 • Geyu Lin, Bin Wang, Zhengyuan Liu, Nancy F. Chen
This performance discrepancy mainly stems from the imbalanced distribution of training data across languages during pre-training and instruction tuning stages.
no code implementations • 15 Apr 2024 • Bin Wang, Chengwei Wei, Zhengyuan Liu, Geyu Lin, Nancy F. Chen
As the rapidly advancing domain of natural language processing (NLP), large language models (LLMs) have emerged as powerful tools for interpreting human commands and generating text across various tasks.
Automatic Speech Recognition Optical Character Recognition +3
no code implementations • 10 Apr 2024 • Zhengyuan Liu, Stella Xin Yin, Geyu Lin, Nancy F. Chen
Intelligent Tutoring Systems (ITSs) can provide personalized and self-paced learning experience.
no code implementations • 4 Apr 2024 • Zhengyuan Liu, Stella Xin Yin, Carolyn Lee, Nancy F. Chen
Intelligent tutoring systems (ITSs) that imitate human tutors and aim to provide immediate and customized instructions or feedback to learners have shown their effectiveness in education.
no code implementations • 17 Mar 2024 • Taha Aksu, Nancy F. Chen
Current metrics for evaluating Dialogue State Tracking (DST) systems exhibit three primary limitations.
2 code implementations • 1 Feb 2024 • Fangkai Jiao, Chengwei Qin, Zhengyuan Liu, Nancy F. Chen, Shafiq Joty
Large Language Models (LLMs) have demonstrated significant potential in handling complex reasoning tasks through step-by-step rationale generation.
1 code implementation • 31 Jan 2024 • Florian Le Bronnec, Song Duong, Mathieu Ravaut, Alexandre Allauzen, Nancy F. Chen, Vincent Guigue, Alberto Lumbreras, Laure Soulier, Patrick Gallinari
State-space models are a low-complexity alternative to transformers for encoding long sequences and capturing long-term dependencies.
no code implementations • 15 Dec 2023 • Zhengyuan Liu, Nancy F. Chen
In this work, we investigate the attention head selection and manipulation strategy for feature injection from a network pruning perspective, and conduct a case study on dialogue summarization.
1 code implementation • 5 Dec 2023 • Xuan Long Do, Yiran Zhao, Hannah Brown, Yuxi Xie, James Xu Zhao, Nancy F. Chen, Kenji Kawaguchi, Michael Shieh, Junxian He
We propose a new method, Adversarial In-Context Learning (adv-ICL), to optimize prompt for in-context learning (ICL) by employing one LLM as a generator, another as a discriminator, and a third as a prompt modifier.
no code implementations • 14 Nov 2023 • Xuan Long Do, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen
Aligning language models (LMs) with human opinion is challenging yet vital to enhance their grasp of human values, preferences, and beliefs.
1 code implementation • 13 Nov 2023 • Vernon Toh Yan Han, Ratish Puduppully, Nancy F. Chen
Our findings indicate that our approach, which incorporates unit consistency, currently slightly underperforms compared to an approach that does not.
1 code implementation • 8 Nov 2023 • Zhengyuan Liu, Hai Leong Chieu, Nancy F. Chen
Data collection from manual labeling provides domain-specific and task-aligned supervision for data-driven approaches, and a critical mass of well-annotated resources is required to achieve reasonable performance in natural language processing tasks.
1 code implementation • 24 Oct 2023 • Minzhi Li, Taiwei Shi, Caleb Ziems, Min-Yen Kan, Nancy F. Chen, Zhengyuan Liu, Diyi Yang
Annotated data plays a critical role in Natural Language Processing (NLP) in training models and evaluating their performance.
1 code implementation • 21 Oct 2023 • Minh Nguyen, Nancy F. Chen
NLP models excel on tasks with clean inputs, but are less accurate with noisy inputs.
1 code implementation • 17 Oct 2023 • Bin Wang, Zhengyuan Liu, Nancy F. Chen
With the advancement of instruction-finetuned language models, we introduce instruction-tuning to dialogues to expand the capability set of dialogue summarization models.
Ranked #1 on Text Summarization on DialogSum
1 code implementation • 16 Oct 2023 • Mathieu Ravaut, Aixin Sun, Nancy F. Chen, Shafiq Joty
In this paper, we conduct the first comprehensive study on context utilization and position bias in summarization.
no code implementations • 5 Oct 2023 • Litton J Kurisinkel, Nancy F. Chen
Multi-document summarization is a challenging task due to its inherent subjective bias, highlighted by the low inter-annotator ROUGE-1 score of 0. 4 among DUC-2004 reference summaries.
1 code implementation • 9 Sep 2023 • Bin Wang, Zhengyuan Liu, Xin Huang, Fangkai Jiao, Yang Ding, AiTi Aw, Nancy F. Chen
We present SeaEval, a benchmark for multilingual foundation models.
1 code implementation • 7 Jun 2023 • Taha Aksu, Min-Yen Kan, Nancy F. Chen
A challenge in the Dialogue State Tracking (DST) field is adapting models to new domains without using any supervised data, zero-shot domain adaptation.
no code implementations • 5 Jun 2023 • Jeremy H. M. Wong, Huayun Zhang, Nancy F. Chen
The standard Gaussian Process (GP) only considers a single output sample per input in the training set.
1 code implementation • 31 May 2023 • Zhengyuan Liu, Yong Keong Yap, Hai Leong Chieu, Nancy F. Chen
Stance detection determines whether the author of a piece of text is in favor of, against, or neutral towards a specified target, and can be used to gain valuable insights into social media.
2 code implementations • 23 May 2023 • Fangkai Jiao, Zhiyang Teng, Bosheng Ding, Zhengyuan Liu, Nancy F. Chen, Shafiq Joty
Existing efforts to improve logical reasoning ability of language models have predominantly relied on supervised fine-tuning, hindering generalization to new domains and/or tasks.
1 code implementation • 22 May 2023 • Ratish Puduppully, Anoop Kunchukuttan, Raj Dabre, Ai Ti Aw, Nancy F. Chen
This study investigates machine translation between related languages i. e., languages within the same family that share linguistic characteristics such as word order and lexical similarity.
no code implementations • 10 May 2023 • Zhiqiang Hu, Roy Ka-Wei Lee, Nancy F. Chen
Adapting a large language model for multiple-attribute text style transfer via fine-tuning can be challenging due to the significant amount of computational resources and labeled data required for the specific task.
1 code implementation • 4 May 2023 • Xuan Long Do, Bowei Zou, Shafiq Joty, Anh Tai Tran, Liangming Pan, Nancy F. Chen, Ai Ti Aw
In addition, we propose Conv-Distinct, a novel evaluation metric for CQG, to evaluate the diversity of the generated conversation from a context.
no code implementations • 14 Nov 2022 • Perry Lam, Huayun Zhang, Nancy F. Chen, Berrak Sisman, Dorien Herremans
Text-to-speech (TTS) models have achieved remarkable naturalness in recent years, yet like most deep neural models, they have more parameters than necessary.
no code implementations • 24 Oct 2022 • Zhiqiang Hu, Roy Kaa-Wei Lee, Nancy F. Chen
We hope by releasing the impolite dialogue corpus and establishing the benchmark evaluations, more researchers are encouraged to investigate this new challenging research task.
1 code implementation • 17 Oct 2022 • Mathieu Ravaut, Shafiq Joty, Nancy F. Chen
To bypass this limitation, we propose a new paradigm in second-stage abstractive summarization called SummaFusion that fuses several summary candidates to produce a novel abstractive second-stage summary.
no code implementations • 22 Sep 2022 • Perry Lam, Huayun Zhang, Nancy F. Chen, Berrak Sisman
In this work, we seek to answer the question: what are the characteristics of selected sparse techniques on the performance and model complexity?
1 code implementation • COLING 2022 • Xuan Long Do, Bowei Zou, Liangming Pan, Nancy F. Chen, Shafiq Joty, Ai Ti Aw
While previous studies mainly focus on how to model the flow and alignment of the conversation, there has been no thorough study to date on which parts of the context and history are necessary for the model.
1 code implementation • 1 Aug 2022 • Ratish Puduppully, Parag Jain, Nancy F. Chen, Mark Steedman
In Multi-Document Summarization (MDS), the input can be modeled as a set of documents, and the output is its summary.
no code implementations • 24 Jul 2022 • Gia H. Ngo, Minh Nguyen, Nancy F. Chen, Mert R. Sabuncu
In this paper, we present Text2Brain, an easy to use tool for synthesizing brain activation maps from open-ended text queries.
1 code implementation • NAACL 2022 • Hung Le, Nancy F. Chen, Steven C. H. Hoi
Specifically, we introduce a novel dialogue state tracking task to track the information of visual objects that are mentioned in video-grounded dialogues.
no code implementations • 6 Jun 2022 • Zhengyuan Liu, Pavitra Krishnaswamy, Nancy F. Chen
There is growing interest in the automated extraction of relevant information from clinical dialogues.
1 code implementation • Findings (NAACL) 2022 • Zhengyuan Liu, Nancy F. Chen
To take advantage of both supervised and unsupervised paradigms and tackle the challenges, in this work, we propose a semi-supervised framework for text style transfer.
1 code implementation • ACL 2022 • Mathieu Ravaut, Shafiq Joty, Nancy F. Chen
Sequence-to-sequence neural networks have recently achieved great success in abstractive summarization, especially through fine-tuning large pre-trained language models on the downstream dataset.
Ranked #2 on Document Summarization on CNN / Daily Mail
no code implementations • 18 Feb 2022 • Yuling Gu, Nancy F. Chen
In tense and lax vowel pairs, we also consistently observe that the distinction is less conspicuous for Singaporean children compared to the other speaker groups.
2 code implementations • 29 Jan 2022 • Yizheng Huang, Nana Hou, Nancy F. Chen
Catastrophic forgetting is a thorny challenge when updating keyword spotting (KWS) models after deployment.
1 code implementation • CODI 2021 • Zhengyuan Liu, Ke Shi, Nancy F. Chen
While previous work significantly improves the performance of RST discourse parsing, they are not readily applicable to practical use cases: (1) EDU segmentation is not integrated into most existing tree parsing frameworks, thus it is not straightforward to apply such models on newly-coming data.
1 code implementation • CODI 2021 • Zhengyuan Liu, Nancy F. Chen
While multi-party conversations are often less structured than monologues and documents, they are implicitly organized by semantic level correlations across the interactive turns, and dialogue discourse analysis can be applied to predict the dependency structure and relations between the elementary discourse units, and provide feature-rich structural information for downstream tasks.
Ranked #4 on Discourse Parsing on STAC
2 code implementations • 28 Sep 2021 • Gia H. Ngo, Minh Nguyen, Nancy F. Chen, Mert R. Sabuncu
In this work, we propose Text2Brain, a neural network approach for coordinate-based meta-analysis of neuroimaging studies to synthesize brain activation maps from open-ended text queries.
1 code implementation • EMNLP 2021 • Zhengyuan Liu, Nancy F. Chen
The conditional sequences are modulated to decide what types of information or what perspective to focus on when forming summaries to tackle the under-constrained problem in summarization tasks.
no code implementations • 31 Aug 2021 • Zhengyuan Liu, Nancy F. Chen
In this work, we first analyze the linguistic characteristics of meeting transcripts on a representative corpus, and find that the sentences comprising the summary correlate with the meeting agenda.
no code implementations • 8 Jul 2021 • Huayun Zhang, Ke Shi, Nancy F. Chen
While speech evaluation on English has been popular, automatic speech scoring on low resource languages remains challenging.
no code implementations • 16 Jun 2021 • Hung Le, Nancy F. Chen, Steven C. H. Hoi
Video-grounded dialogue systems aim to integrate video understanding and dialogue understanding to generate responses that are relevant to both the dialogue and video context.
1 code implementation • SIGDIAL (ACL) 2021 • Zhengyuan Liu, Ke Shi, Nancy F. Chen
Summarizing conversations via neural approaches has been gaining research traction lately, yet it is still challenging to obtain practical solutions.
no code implementations • 16 Apr 2021 • Hung Le, Nancy F. Chen, Steven C. H. Hoi
Neural module networks (NMN) have achieved success in image-grounded tasks such as Visual Question Answering (VQA) on synthetic images.
no code implementations • ICLR 2021 • Hung Le, Nancy F. Chen, Steven C. H. Hoi
PDC model then learns to predict reasoning paths over this semantic graph.
no code implementations • Findings (ACL) 2022 • Taha Aksu, Zhengyuan Liu, Min-Yen Kan, Nancy F. Chen
Augmentation of task-oriented dialogues has followed standard methods used for plain-text such as back-translation, word-level manipulation, and paraphrasing despite its richly annotated structure.
no code implementations • 23 Feb 2021 • Richeng Duan, Nancy F. Chen
Acoustic modeling for child speech is challenging due to the high acoustic variability caused by physiological differences in the vocal tract.
no code implementations • 1 Jan 2021 • Hung Le, Nancy F. Chen, Steven Hoi
Neural module networks (NMN) have achieved success in image-grounded tasks such as question answering (QA) on synthetic images.
no code implementations • 21 Dec 2020 • Ke Shi, Zhengyuan Liu, Nancy F. Chen
Document-level discourse parsing, in accordance with the Rhetorical Structure Theory (RST), remains notoriously challenging.
1 code implementation • COLING 2020 • Zhengyuan Liu, Ke Shi, Nancy F. Chen
Text discourse parsing plays an important role in understanding information flow and argumentative structure in natural language.
1 code implementation • EMNLP 2020 • Hung Le, Doyen Sahoo, Nancy F. Chen, Steven C. H. Hoi
Video-grounded dialogues are very challenging due to (i) the complexity of videos which contain both spatial and temporal variations, and (ii) the complexity of user utterances which query different segments and/or different objects in videos over multiple dialogue turns.
1 code implementation • 27 Aug 2020 • Minh Nguyen, Gia H. Ngo, Nancy F. Chen
Spell check is a useful application which processes noisy human-generated text.
no code implementations • Findings of the Association for Computational Linguistics 2020 • Zhengyuan Liu, Ke Shi, Nancy F. Chen
In this paper, we propose a neural framework that can flexibly control summary generation by introducing a set of sub-aspect functions (i. e. importance, diversity, position).
1 code implementation • EMNLP 2020 • Hung Le, Doyen Sahoo, Chenghao Liu, Nancy F. Chen, Steven C. H. Hoi
Building an end-to-end conversational agent for multi-domain task-oriented dialogues has been an open challenge for two main reasons.
no code implementations • 25 Feb 2020 • Hung Le, Nancy F. Chen
Audio-Visual Scene-Aware Dialog (AVSD) is an extension from Video Question Answering (QA) whereby the dialogue agent is required to generate natural language responses to address user queries and carry on conversations.
1 code implementation • 20 Dec 2019 • Minh Nguyen, Gia H. Ngo, Nancy F. Chen
Using recursive neural network imposes a prior on the mapping from logographs to embeddings since the network must read in the sub-units in logographs according to the order specified by the recursive structures.
no code implementations • 16 Oct 2019 • Jiewen Wu, Luis Fernando D'Haro, Nancy F. Chen, Pavitra Krishnaswamy, Rafael E. Banchs
We propose an architecture to jointly learn word and label embeddings for slot filling in spoken language understanding.
no code implementations • 3 Oct 2019 • Zhengyuan Liu, Angela Ng, Sheldon Lee, Ai Ti Aw, Nancy F. Chen
Such linguistic characteristics of dialogue topics make sentence-level extractive summarization approaches used in spoken documents ill-suited for summarizing conversations.
1 code implementation • ACL 2019 • Hung Le, Doyen Sahoo, Nancy F. Chen, Steven C. H. Hoi
Developing Video-Grounded Dialogue Systems (VGDS), where a dialogue is conducted based on visual and audio aspects of a given video, is significantly more challenging than traditional image or text-grounded dialogue systems because (1) feature space of videos span across multiple picture frames, making it difficult to obtain semantic information; and (2) a dialogue agent must perceive and process information from different modalities (audio, video, caption, etc.)
Ranked #4 on Response Generation on SIMMC2.0
no code implementations • NAACL 2019 • Zhengyuan Liu, Hazel Lim, Nur Farah Ain Binte Suhaimi, Shao Chuen Tong, Sharon Ong, Angela Ng, Sheldon Lee, Michael R. Macdonald, Savitha Ramasamy, Pavitra Krishnaswamy, Wai Leng Chow, Nancy F. Chen
Data for human-human spoken dialogues for research and development are currently very limited in quantity, variety, and sources; such data are even scarcer in healthcare.
no code implementations • 7 Oct 2018 • Gia H. Ngo, Minh Nguyen, Nancy F. Chen
The problem is compounded by the limited linguistic resources available when converting foreign words to transliterated words in the target language.
1 code implementation • EMNLP 2018 • Minh Nguyen, Gia H. Ngo, Nancy F. Chen
Graphemes of most languages encode pronunciation, though some are more explicit than others.