no code implementations • 24 May 2025 • Chen Yang, Ruping Xu, Ruizhe Li, Bin Cao, Jing Fan
We evaluate ExIde using 12 state-of-the-art (SOTA) LLMs on the BPRF dataset, benchmarking performance on both rule extraction and dependency classification tasks of current LLMs.
no code implementations • 22 May 2025 • Ruizhe Li, Chen Chen, Yuchen Hu, Yanjun Gao, Xi Wang, Emine Yilmaz
Retrieval-Augmented Generation (RAG) leverages large language models (LLMs) combined with external contexts to enhance the accuracy and reliability of generated responses.
no code implementations • 4 Apr 2025 • Xianyuan Liu, Jiayang Zhang, Shuo Zhou, Thijs L. van der Plas, Avish Vijayaraghavan, Anastasiia Grishina, Mengdie Zhuang, Daniel Schofield, Christopher Tomlinson, YuHan Wang, Ruizhe Li, Louisa van Zeeland, Sina Tabakhi, Cyndie Demeocq, Xiang Li, Arunav Das, Orlando Timmerman, Thomas Baldwin-McDonald, Jinge Wu, Peizhen Bai, Zahraa Al Sahili, Omnia Alwazzan, Thao N. Do, Mohammod N. I. Suvon, Angeline Wang, Lucia Cipolina-Kun, Luigi A. Moretti, Lucas Farndale, Nitisha Jain, Natalia Efremova, Yan Ge, Marta Varela, Hak-Keung Lam, Oya Celiktutan, Ben R. Evans, Alejandro Coca-Castro, Honghan Wu, Zahraa S. Abdallah, Chen Chen, Valentin Danchev, Nataliya Tkachenko, Lei Lu, Tingting Zhu, Gregory G. Slabaugh, Roger K. Moore, William K. Cheung, Peter H. Charlton, Haiping Lu
Multimodal artificial intelligence (AI) integrates diverse types of data via machine learning to improve understanding, prediction, and decision-making across disciplines such as healthcare, science, and engineering.
1 code implementation • 22 Dec 2024 • Zhengqian Wu, Ruizhe Li, Zijun Xu, Zhongyuan Wang, Chunxia Xiao, Chao Liang
However, existing DVU datasets rarely organize questions according to these story topics, making them difficult to comprehensively assess VideoQA models' DVU capability of complex storylines.
no code implementations • 28 Oct 2024 • Yiyang Guo, Ruizhe Li, Mude Hui, Hanzhong Guo, Chen Zhang, Chuangjian Cai, Le Wan, Shangfei Wang
Invisible watermarking is essential for safeguarding digital content, enabling copyright protection and content authentication.
1 code implementation • 2 Oct 2024 • Yuxuan Zhang, Ruizhe Li
This approach reduces inference time to less than twice that of single LoRA inference by leveraging parallel computation.
no code implementations • 13 Aug 2024 • Gangyi Zhang, Chongming Gao, Hang Pan, Runzhe Teng, Ruizhe Li
Existing Conversational Recommender Systems (CRS) predominantly utilize user simulators for training and evaluating recommendation policies.
no code implementations • 31 Jul 2024 • Xi Wang, Procheta Sen, Ruizhe Li, Emine Yilmaz
Despite the success of integrating large language models into the development of conversational systems, many studies have shown the effectiveness of retrieving and augmenting external knowledge for informative responses.
no code implementations • 26 Jun 2024 • Jiazhou Ji, Ruizhe Li, Shujun Li, Jie Guo, Weidong Qiu, Zheng Huang, Chiyu Chen, Xiaoyu Jiang, Xinru Lu
Instead, we introduce a novel ternary text classification scheme, adding an "undecided" category for texts that could be attributed to either source, and we show that this new category is crucial to understand how to make the detection result more explainable to lay users.
no code implementations • 21 Jun 2024 • Jinge Wu, Zhaolong Wu, Ruizhe Li, Abul Hasan, Yunsoo Kim, Jason P. Y. Cheung, Teng Zhang, Honghan Wu
This study proposes an approach for error correction in radiology reports, leveraging large language models (LLMs) and retrieval-augmented generation (RAG) techniques.
no code implementations • 21 May 2024 • Xinyi Wang, Grazziela Figueredo, Ruizhe Li, Wei Emma Zhang, Weitong Chen, Xin Chen
The aim is to provide comprehensive and rich information for researchers interested in automatic clinical report generation and medical image analysis, especially when using multimodal inputs, and assist them in developing new algorithms to advance the field.
1 code implementation • 16 May 2024 • Ruizhe Li, Grazziela Figueredo, Dorothee Auer, Christian Wagner, Xin Chen
To address this challenge, this paper proposes a mask-guided encoder-decoder DCNN-based image registration method, named as MrRegNet.
no code implementations • 16 May 2024 • Yuchen Hu, Chen Chen, Chengwei Qin, Qiushi Zhu, Eng Siong Chng, Ruizhe Li
Recent advances in large language models (LLMs) have promoted generative error correction (GER) for automatic speech recognition (ASR), which aims to predict the ground-truth transcription from the decoded N-best hypotheses.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3
no code implementations • 16 May 2024 • Chen Chen, Ruizhe Li, Yuchen Hu, YuanYuan Chen, Chengwei Qin, Qiang Zhang
Experimental results show that HESIT effectively alleviates catastrophic forgetting by exemplar selection, and achieves state-of-the-art performance on the largest CL benchmark of ToDs in terms of all metrics.
1 code implementation • 6 May 2024 • Ruizhe Li, Yanjun Gao
By updating these vectors within MLP and recalibrating attention patterns to neutralise the preference for the first choice 'A', we effectively mitigate the anchored bias.
no code implementations • 3 Mar 2024 • Jiangbo Pei, Ruizhe Li, Aidong Men, Yang Liu, Xiahai Zhuang, Qingchao Chen
This paper introduces Zoo-MSFDA, a more general setting that allows each source domain to offer a zoo of multiple source models with different architectures.
1 code implementation • 10 Feb 2024 • Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Dong Zhang, Zhehuai Chen, Eng Siong Chng
Leveraging the rich linguistic knowledge and strong reasoning abilities of LLMs, our new paradigm can integrate the rich information in N-best candidates to generate a higher-quality translation result.
Ranked #1 on
Machine Translation
on FLoRes-200
1 code implementation • 8 Feb 2024 • Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, EnSiong Chng, Chao-Han Huck Yang
Recent studies have successfully shown that large language models (LLMs) can be successfully used for generative error correction (GER) on top of the automatic speech recognition (ASR) output.
Ranked #4 on
Speech Recognition
on WSJ eval92
(using extra training data)
Audio-Visual Speech Recognition
Automatic Speech Recognition
+3
1 code implementation • 19 Jan 2024 • Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng
To this end, we propose to extract a language-space noise embedding from the N-best list to represent the noise conditions of source speech, which can promote the denoising process in GER.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+6
no code implementations • 28 Aug 2023 • Yanjun Gao, Ruizhe Li, John Caskey, Dmitriy Dligach, Timothy Miller, Matthew M. Churpek, Majid Afshar
In this paper, we outline an innovative approach for augmenting the proficiency of LLMs in the realm of automated diagnosis generation, achieved through the incorporation of a medical knowledge graph (KG) and a novel graph model: Dr. Knows, inspired by the clinical diagnostic reasoning process.
1 code implementation • 16 Jul 2023 • Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng
In this paper, we propose a noise-aware speech enhancement (NASE) approach that extracts noise-specific information to guide the reverse process in diffusion model.
1 code implementation • 18 Jun 2023 • Yuchen Hu, Ruizhe Li, Chen Chen, Chengwei Qin, Qiushi Zhu, Eng Siong Chng
In this work, we investigate the noise-invariant visual modality to strengthen robustness of AVSR, which can adapt to any testing noises while without dependence on noisy training data, a. k. a., unsupervised noise adaptation.
1 code implementation • 18 Jun 2023 • Yuchen Hu, Chen Chen, Ruizhe Li, Heqing Zou, Eng Siong Chng
In this paper, we aim to learn the shared representations across modalities to bridge their gap.
1 code implementation • 16 May 2023 • Yuchen Hu, Ruizhe Li, Chen Chen, Heqing Zou, Qiushi Zhu, Eng Siong Chng
However, most existing AVSR approaches simply fuse the audio and visual features by concatenation, without explicit interactions to capture the deep correlations between them, which results in sub-optimal multimodal representations for downstream speech recognition task.
Audio-Visual Speech Recognition
Automatic Speech Recognition
+3
1 code implementation • 22 Feb 2023 • Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng
In this paper, we propose a simple yet effective approach called gradient remedy (GR) to solve interference between task gradients in noise-robust speech recognition, from perspectives of both angle and magnitude.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+4
1 code implementation • 14 Oct 2022 • Ruizhe Li, Xin Chen
The final trained model was also evaluated on an independent test set by the CMRxMotion organisers, which achieved the classification accuracy of 72. 5% and Cohen's Kappa of 0. 6309 (ranked top 1 in this grand challenge).
no code implementations • 7 Oct 2021 • Ruizhe Li, Xutan Peng, Chenghua Lin
In this paper, we provide the first focused study on the discontinuities (aka.
no code implementations • 29 Sep 2021 • Ruizhe Li, Xutan Peng, Chenghua Lin
In this paper, we provide the first focused study on the discontinuities (aka.
1 code implementation • INLG (ACL) 2021 • Chengkun Zeng, Guanyi Chen, Chenghua Lin, Ruizhe Li, Zhigang Chen
Understanding speaker's feelings and producing appropriate responses with emotion connection is a key communicative skill for empathetic dialogue systems.
1 code implementation • 3 Aug 2021 • Ruizhe Li, Matteo Bastiani, Dorothee Auer, Christian Wagner, Xin Chen
The proposed method was evaluated on a public brain MRI data set for age estimation.
1 code implementation • COLING 2020 • Ruizhe Li, Xiao Li, Guanyi Chen, Chenghua Lin
The Variational Autoencoder (VAE) is a popular and powerful model applied to text modelling to generate diverse sentences.
no code implementations • EMNLP 2020 • Xiao Li, Guanyi Chen, Chenghua Lin, Ruizhe Li
We propose DGST, a novel and simple Dual-Generator network architecture for text Style Transfer.
1 code implementation • 28 Apr 2020 • Mina Jafari, Ruizhe Li, Yue Xing, Dorothee Auer, Susan Francis, Jonathan Garibaldi, Xin Chen
In this paper, we present a generic deep convolutional neural network (DCNN) for multi-class image segmentation.
1 code implementation • 16 Apr 2020 • Ruizhe Li, Dorothee Auer, Christian Wagner, Xin Chen
To address this problem, we propose a generic semi-supervised learning framework for image segmentation based on a deep convolutional neural network (DCNN).
1 code implementation • WS 2019 • Ruizhe Li, Xiao Li, Chenghua Lin, Matthew Collinson, Rui Mao
Variational Autoencoder (VAE) is a powerful method for learning representations of high-dimensional data.
2 code implementations • ICML 2020 • Xiao Li, Chenghua Lin, Ruizhe Li, Chaozheng Wang, Frank Guerin
We demonstrate the utility of our method for attribute manipulation in autoencoders trained across varied domains, using both human evaluation and automated methods.
Ranked #7 on
Image Generation
on CelebA 256x256
(FID metric)
no code implementations • CONLL 2019 • Ruizhe Li, Chenghua Lin, Matthew Collinson, Xiao Li, Guanyi Chen
Recognising dialogue acts (DA) is important for many natural language processing tasks such as dialogue generation and intention recognition.
Ranked #4 on
Dialogue Act Classification
on Switchboard corpus
no code implementations • SEMEVAL 2018 • Rui Mao, Guanyi Chen, Ruizhe Li, Chenghua Lin
This paper describes the system that we submitted for SemEval-2018 task 10: capturing discriminative attributes.