no code implementations • 14 Apr 2025 • Xun Zhu, Fanbin Mo, Zheng Zhang, Jiaxi Wang, Yiming Shi, Ming Wu, Chuang Zhang, Miao Li, Ji Wu
In this paper, we introduce the image-centric multi-annotation X-ray dataset (IMAX), the first attempt to enhance the multi-task learning capabilities of medical multi-modal large language models (MLLMs) from the data construction level.
1 code implementation • 6 Apr 2025 • Yiming Shi, Shaoshuai Yang, Xun Zhu, Haoyu Wang, Miao Li, Ji Wu
To support reproducibility and future research, we release a modular codebase, MedM-VL, and two pre-trained models: MedM-VL-2D for 2D medical image analysis and MedM-VL-CT-Chest for 3D CT-based applications.
no code implementations • 6 Mar 2025 • Yuyou Zhang, Miao Li, William Han, Yihang Yao, Zhepeng Cen, Ding Zhao
Large Language Models (LLMs) are vulnerable to jailbreak attacks that exploit weaknesses in traditional safety alignment, which often relies on rigid refusal heuristics or representation engineering to block harmful outputs.
no code implementations • 25 Feb 2025 • Yihang Yao, Zhepeng Cen, Miao Li, William Han, Yuyou Zhang, Emerson Liu, Zuxin Liu, Chuang Gan, Ding Zhao
Large Language Models (LLMs) have demonstrated strong reasoning capabilities across various tasks.
no code implementations • 17 Feb 2025 • Xun Zhu, Zheng Zhang, Xi Chen, Yiming Shi, Miao Li, Ji Wu
With the rapid advancements in multi-modal large language models (MLLMs), connectors play a pivotal role in bridging diverse modalities and enhancing model performance.
no code implementations • 27 Jan 2025 • Miao Li, Jey Han Lau, Eduard Hovy, Mirella Lapata
Opinion summarization plays a key role in deriving meaningful insights from large-scale online reviews.
no code implementations • 31 Dec 2024 • Phuc Nguyen, Miao Li, Alexandra Morgan, Rima Arnaout, Ramy Arnaout
We show that the correlation score, Jaccard score, earth-mover's score, and Kullback-Leibler (relative-entropy) score all suffer grade inflation.
no code implementations • 21 Nov 2024 • Ke Zhao, Huayang Huang, Miao Li, Yu Wu
Language-conditioned robotic learning has significantly enhanced robot adaptability by enabling a single model to execute diverse tasks in response to verbal commands.
no code implementations • 19 Nov 2024 • Yiming Shi, Xun Zhu, Ying Hu, Chenyi Guo, Miao Li, Ji Wu
To the best of our knowledge, Med-2E3 is the first MLLM to integrate both 3D and 2D features for 3D medical image analysis.
1 code implementation • 26 Sep 2024 • Xun Zhu, Ying Hu, Fanbin Mo, Miao Li, Ji Wu
To mitigate the tug-of-war problem of multi-modal multi-task optimization in MLLMs, recent advances primarily focus on improving the LLM components, while neglecting the connector that bridges the gap between modalities.
1 code implementation • 26 Sep 2024 • Jian Gao, Xiao Zhang, Ji Wu, Miao Li
Causal language models acquire vast amount of knowledge from general text corpus during pretraining, but the efficiency of knowledge learning is known to be unsatisfactory, especially when learning from knowledge-dense and small-sized corpora.
1 code implementation • 21 Sep 2024 • Xiao Zhang, Miao Li, Ji Wu
Pretrained language models can encode a large amount of knowledge and utilize it for various reasoning tasks, yet they can still struggle to learn novel factual knowledge effectively from finetuning on limited textual demonstrations.
2 code implementations • 18 Aug 2024 • Boyang Li, Xinyi Ying, Ruojing Li, Yongxian Liu, Yangsi Shi, Miao Li
In this paper, we briefly summarize the first competition on resource-limited infrared small target detection (namely, LimitIRSTD).
1 code implementation • 20 Jun 2024 • Xinyi Ying, Chao Xiao, Ruojing Li, Xu He, Boyang Li, Zhaoxu Li, Yingqian Wang, Mingyuan Hu, Qingyu Xu, Zaiping Lin, Miao Li, Shilin Zhou, Wei An, Weidong Sheng, Li Liu
Based on the proposed RGBT-Tiny dataset and SAFit measure, extensive evaluations have been conducted, including 23 recent state-of-the-art algorithms that cover four different types (i. e., visible generic detection, visible SOD, thermal SOD and RGBT object detection).
1 code implementation • 4 Jun 2024 • Xiao Zhang, Miao Li, Ji Wu
In this fashion, conditional finetuning achieves selective learning from a corpus, learning knowledge useful for downstream tasks while avoiding learning useless corpus statistics like topic biases.
2 code implementations • 20 May 2024 • Junlong Jia, Ying Hu, Xi Weng, Yiming Shi, Miao Li, Xingjian Zhang, Baichuan Zhou, Ziyu Liu, Jie Luo, Lei Huang, Ji Wu
We present TinyLLaVA Factory, an open-source modular codebase for small-scale large multimodal models (LMMs) with a focus on simplicity of code implementations, extensibility of new features, and reproducibility of training results.
1 code implementation • 29 Feb 2024 • Miao Li, Ming-Bin Chen, Bo Tang, Shengbin Hou, Pengyu Wang, Haiying Deng, Zhiyu Li, Feiyu Xiong, Keming Mao, Peng Cheng, Yi Luo
We present NewsBench, a novel evaluation framework to systematically assess the capabilities of Large Language Models (LLMs) for editorial capabilities in Chinese journalism.
1 code implementation • 28 Feb 2024 • Miao Li, Jey Han Lau, Eduard Hovy
Modern natural language generation systems with Large Language Models (LLMs) exhibit the capability to generate a plausible summary of multiple documents; however, it is uncertain if they truly possess the capability of information consolidation to generate summaries, especially on documents with opinionated information.
no code implementations • 28 Aug 2023 • Yemin li, Zhongcheng Liu, Xiaoying Lou, Mirigual Kurban, Miao Li, Jie Yang, Kaiwei Che, Jiankun Wang, Max Q. -H Meng, Yan Huang, Qin Guo, Pinjin Hu
A total of 5105 images of 154 intestinal segments from 87 patients undergoing EC treatment at a center in China between March 2022 and March 2023 are scored according to the Geboes score.
no code implementations • 25 Jul 2023 • Xutian Deng, Junnan Jiang, Wen Cheng, Miao Li
As medical ultrasound is becoming a prevailing examination approach nowadays, robotic ultrasound systems can facilitate the scanning process and prevent professional sonographers from repetitive and tedious work.
no code implementations • 15 Jun 2023 • Miao Li, Wenhao Ding, Ding Zhao
The prominence of embodied Artificial Intelligence (AI), which empowers robots to navigate, perceive, and engage within virtual environments, has attracted significant attention, owing to the remarkable advances in computer vision and large language models.
1 code implementation • 2 Jun 2023 • Yuxuan Zhou, Ziyu Jin, Meiwei Li, Miao Li, Xien Liu, Xinxin You, Ji Wu
The NLI4CT task aims to entail hypotheses based on Clinical Trial Reports (CTRs) and retrieve the corresponding evidence supporting the justification.
1 code implementation • 2 May 2023 • Miao Li, Eduard Hovy, Jey Han Lau
We present PeerSum, a novel dataset for generating meta-reviews of scientific papers.
1 code implementation • 15 Mar 2023 • Zhuohan Xie, Miao Li, Trevor Cohn, Jey Han Lau
Numerous evaluation metrics have been developed for natural language generation tasks, but their effectiveness in evaluating stories is limited as they are not specifically tailored to assess intricate aspects of storytelling, such as fluency and interestingness.
1 code implementation • 12 Mar 2023 • Miao Li, Jianzhong Qi, Jey Han Lau
We propose HGSUM, an MDS model that extends an encoder-decoder architecture, to incorporate a heterogeneous graph to represent different semantic units (e. g., words and sentences) of the documents.
1 code implementation • 12 May 2022 • Junjia Liu, Yiting Chen, Zhipeng Dong, Shixiong Wang, Sylvain Calinon, Miao Li, Fei Chen
This letter describes an approach to achieve well-known Chinese cooking art stir-fry on a bimanual robot system.
1 code implementation • 3 Mar 2022 • Miao Li, Jianzhong Qi, Jey Han Lau
We present PeerSum, a new MDS dataset using peer reviews of scientific publications.
no code implementations • 9 Nov 2021 • Xutian Deng, Ziwei Lei, Yi Wang, Miao Li
Finally, the robustness of the proposed framework is validated with the experiments on real data from sonographers.
no code implementations • 2 Nov 2021 • Xutian Deng, Yiting Chen, Fei Chen, Miao Li
Medical ultrasound has become a routine examination approach nowadays and is widely adopted for different medical applications, so it is desired to have a robotic ultrasound system to perform the ultrasound scanning autonomously.
1 code implementation • 1 Jun 2021 • Boyang Li, Chao Xiao, Longguang Wang, Yingqian Wang, Zaiping Lin, Miao Li, Wei An, Yulan Guo
With the repeated interaction in DNIM, infrared small targets in deep layers can be maintained.
no code implementations • CVPR 2021 • Yu Wang, Rui Zhang, Shuo Zhang, Miao Li, Yangyang Xia, Xishan Zhang, Shaoli Liu
The directions of weights, and the gradients, can be divided into domain-specific and domain-invariant parts, and the goal of domain adaptation is to concentrate on the domain-invariant direction while eliminating the disturbance from domain-specific one.
1 code implementation • The VLDB Journal 2022 • Rui Zhang, Bayu Distiawan Trisedy, Miao Li, Yong Jiang, Jianzhong Qi
In the last few years, the interest in knowledge bases has grown exponentially in both the research community and the industry due to their essential role in AI applications.
1 code implementation • 5 Jul 2020 • Yong Lee, Shaohua Zhang, Miao Li, Xiaoyu He
Unwanted nonlinear gamma distortion frequently occurs in a great diversity of images during the procedures of image acquisition, processing, and/or display.
no code implementations • 16 Jun 2020 • Miao Li, Haoqi Xiong, Yunbo Cao
This paper introduces one of our group's work on the Dialog System Technology Challenges 8 (DSTC8), the SPPD system for Schema Guided dialogue state tracking challenge.
Dialogue State Tracking
Multi-domain Dialogue State Tracking
no code implementations • IJCNLP 2019 • Hongyin Tang, Miao Li, Beihong Jin
This model captures structural features by a sequential variational autoencoder component and leverages a topic modeling component based on Gaussian distribution to enhance the recognition of text semantics.
no code implementations • 27 Apr 2015 • Ji Wu, Miao Li, Chin-Hui Lee
A Song-On-Demand task, with a total of 38117 songs and 12 attributes corresponding to each song, is used to test the performance of the proposed approach.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3