no code implementations • COLING 2022 • Nina Zhou, Ai Ti Aw, Zhuo Han Liu, Cher heng Tan, Yonghan Ting, Wen Xiang Chen, Jordan sim zheng Ting
Clinical data annotation has been one of the major obstacles for applying machine learning approaches in clinical NLP.
no code implementations • Findings (EMNLP) 2021 • Yeqiu Li, Bowei Zou, Zhifeng Li, Ai Ti Aw, Yu Hong, Qiaoming Zhu
However, the current reasoning models suffer from the noises in the retrieved knowledge.
no code implementations • NAACL (BEA) 2022 • Bowei Zou, Pengfei Li, Liangming Pan, Ai Ti Aw
In field of teaching, true/false questioning is an important educational method for assessing students’ general understanding of learning materials.
no code implementations • COLING 2022 • Zhengyuan Liu, Shikang Ni, Ai Ti Aw, Nancy F. Chen
In this work, we introduce a joint paraphrasing task of creole translation and text normalization of Singlish messages, which can shed light on how to process other language varieties and dialects.
no code implementations • 7 Jun 2025 • Wenyu Zhang, Yingxu He, Geyu Lin, Zhuohan Liu, Shuo Sun, Bin Wang, Xunlong Zou, Jeremy H. M. Wong, Qiongqiong Wang, Hardik B. Sailor, Nancy F. Chen, Ai Ti Aw
Audio Large Language Models (AudioLLMs) have achieved strong results in semantic tasks like speech recognition and translation, but remain limited in modeling paralinguistic cues such as emotion.
no code implementations • 29 May 2025 • Longyin Zhang, Bowei Zou, Ai Ti Aw
The inherent nature of social media posts, characterized by the freedom of language use with a disjointed array of diverse opinions and topics, poses significant challenges to downstream NLP tasks such as comment clustering, comment summarization, and social media opinion analysis.
no code implementations • 19 May 2025 • Qiongqiong Wang, Hardik B. Sailor, Tianchi Liu, Ai Ti Aw
Current speech-LLMs exhibit limited capability in contextual reasoning alongside paralinguistic understanding, primarily due to the lack of Question-Answer (QA) datasets that cover both aspects.
no code implementations • 27 Jan 2025 • Xin Huang, Tarun Kumar Vangani, Zhengyuan Liu, Bowei Zou, Ai Ti Aw
Large language models have shown impressive multilingual capabilities through pretraining on diverse corpora.
no code implementations • 21 Dec 2024 • Xin Huang, Tarun Kumar Vangani, Minh Duc Pham, Xunlong Zou, Bin Wang, Zhengyuan Liu, Ai Ti Aw
Multilingual large language models (MLLMs) have shown impressive capabilities across a variety of languages.
no code implementations • 16 Dec 2024 • Muhammad Huzaifah, Geyu Lin, Tianchi Liu, Hardik B. Sailor, Kye Min Tan, Tarun K. Vangani, Qiongqiong Wang, Jeremy H. M. Wong, Nancy F. Chen, Ai Ti Aw
This technical report describes the MERaLiON-SpeechEncoder, a foundation model designed to support a wide range of downstream speech applications.
no code implementations • 13 Dec 2024 • Yingxu He, Zhuohan Liu, Shuo Sun, Bin Wang, Wenyu Zhang, Xunlong Zou, Nancy F. Chen, Ai Ti Aw
We introduce MERaLiON-AudioLLM (Multimodal Empathetic Reasoning and Learning in One Network), the first speech-text model tailored for Singapore's multilingual and multicultural landscape.
no code implementations • 10 Sep 2024 • Wenyu Zhang, Shuo Sun, Bin Wang, Xunlong Zou, Zhuohan Liu, Yingxu He, Geyu Lin, Nancy F. Chen, Ai Ti Aw
The rapid advancements in large language models (LLMs) have significantly enhanced natural language processing capabilities, facilitating the development of AudioLLMs that process and understand speech and audio inputs alongside text.
no code implementations • 10 Dec 2023 • Xin Tan, Bowei Zou, Ai Ti Aw
Universal fact-checking systems for real-world claims face significant challenges in gathering valid and sufficient real-time evidence and making reasoned decisions.
1 code implementation • 22 May 2023 • Ratish Puduppully, Anoop Kunchukuttan, Raj Dabre, Ai Ti Aw, Nancy F. Chen
This study investigates machine translation between related languages i. e., languages within the same family that share linguistic characteristics such as word order and lexical similarity.
1 code implementation • 4 May 2023 • Xuan Long Do, Bowei Zou, Shafiq Joty, Anh Tai Tran, Liangming Pan, Nancy F. Chen, Ai Ti Aw
In addition, we propose Conv-Distinct, a novel evaluation metric for CQG, to evaluate the diversity of the generated conversation from a context.
1 code implementation • COLING 2022 • Xuan Long Do, Bowei Zou, Liangming Pan, Nancy F. Chen, Shafiq Joty, Ai Ti Aw
While previous studies mainly focus on how to model the flow and alignment of the conversation, there has been no thorough study to date on which parts of the context and history are necessary for the model.
1 code implementation • 31 May 2022 • Xuan-Phi Nguyen, Shafiq Joty, Wu Kui, Ai Ti Aw
Numerous recent work on unsupervised machine translation (UMT) implies that competent unsupervised translations of low-resource and unrelated languages, such as Nepali or Sinhala, are only possible if the model is trained in a massive multilingual environment, where these low-resource languages are mixed with high-resource counterparts.
no code implementations • 28 Feb 2022 • Weiwen Xu, Bowei Zou, Wai Lam, Ai Ti Aw
Recent techniques in Question Answering (QA) have gained remarkable performance improvement with some QA models even surpassed human performance.
no code implementations • NAACL 2021 • Litton J Kurisinkel, Ai Ti Aw, Nancy F Chen
Neural models for text generation are often designed in an end-to-end fashion, typically with zero control over intermediate computations, limiting their practical usability in downstream applications.
1 code implementation • NAACL 2021 • Weiwen Xu, Ai Ti Aw, Yang Ding, Kui Wu, Shafiq Joty
Neural Machine Translation (NMT) has achieved significant breakthrough in performance but is known to suffer vulnerability to input perturbations.
no code implementations • COLING 2020 • Zhengyuan Liu, Pavitra Krishnaswamy, Ai Ti Aw, Nancy Chen
While neural approaches have achieved significant improvement in machine comprehension tasks, models often work as a black-box, resulting in lower interpretability, which requires special attention in domains such as healthcare or education.
1 code implementation • 3 Jun 2020 • Xuan-Phi Nguyen, Shafiq Joty, Thanh-Tung Nguyen, Wu Kui, Ai Ti Aw
Recent unsupervised machine translation (UMT) systems usually employ three main principles: initialization, language modeling and iterative back-translation, though they may apply them differently.
2 code implementations • NeurIPS 2020 • Xuan-Phi Nguyen, Shafiq Joty, Wu Kui, Ai Ti Aw
Our method achieves state-of-the-art BLEU scores of 30. 7 and 43. 7 in the WMT'14 English-German and English-French translation tasks, respectively.
Ranked #9 on
Machine Translation
on WMT2014 English-German
no code implementations • WS 2019 • Chenglei Si, Kui Wu, Ai Ti Aw, Min-Yen Kan
We conducted tests with both sentiment and non-sentiment bearing contexts to examine the effectiveness of our methods.
no code implementations • IJCNLP 2019 • Wenqiang Lei, Weiwen Xu, Ai Ti Aw, Yuanxin Xiang, Tat Seng Chua
While achieving great fluency, current machine translation (MT) techniques are bottle-necked by adequacy issues.
no code implementations • 3 Oct 2019 • Zhengyuan Liu, Angela Ng, Sheldon Lee, Ai Ti Aw, Nancy F. Chen
Such linguistic characteristics of dialogue topics make sentence-level extractive summarization approaches used in spoken documents ill-suited for summarizing conversations.
no code implementations • WS 2018 • Zhongwei Li, Xuancong Wang, Ai Ti Aw, Eng Siong Chng, Haizhou Li
Customized translation need pay spe-cial attention to the target domain ter-minology especially the named-entities for the domain.