Search Results for author: Jiacheng Ruan

Found 20 papers, 16 papers with code

EIAD: Explainable Industrial Anomaly Detection Via Multi-Modal Large Language Models

no code implementations18 Mar 2025 Zongyun Zhang, Jiacheng Ruan, Xian Gao, Ting Liu, Yuzhuo Fu

Additionally, we contribute to the first multi-modal industrial anomaly detection training dataset, named Defect Detection Question Answering (DDQA), encompassing a wide range of defect types and industrial scenarios.

Anomaly Detection Defect Detection +1

ReviewAgents: Bridging the Gap Between Human and AI-Generated Paper Reviews

no code implementations11 Mar 2025 Xian Gao, Jiacheng Ruan, Jingsheng Gao, Ting Liu, Yuzhuo Fu

In this paper, we address this challenge by proposing ReviewAgents, a framework that leverages large language models (LLMs) to generate academic paper reviews.

Comment Generation

VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models

1 code implementation10 Mar 2025 Jiacheng Ruan, Wenzhen Yuan, Xian Gao, Ye Guo, Daoxin Zhang, Zhe Xu, Yao Hu, Ting Liu, Yuzhuo Fu

Specifically, process RMs evaluate each reasoning step, outcome RMs focus on the assessment of reasoning results, and critique RMs perform error analysis on the entire reasoning process, followed by corrections.

Binary Classification Hallucination +1

FTII-Bench: A Comprehensive Multimodal Benchmark for Flow Text with Image Insertion

1 code implementation16 Oct 2024 Jiacheng Ruan, Yebin Yang, Zehao Lin, Yuchen Feng, Feiyu Xiong, Zeyun Tang, Zhiyu Li

Based on this, we introduce the Flow Text with Image Insertion Benchmark (FTII-Bench), which includes 318 high-quality Chinese image-text news articles and 307 high-quality English image-text news articles, covering 10 different news domains.

Articles Image Comprehension

Understanding Robustness of Parameter-Efficient Tuning for Image Classification

1 code implementation13 Oct 2024 Jiacheng Ruan, Xian Gao, Suncheng Xiang, Mingye Xie, Ting Liu, Yuzhuo Fu

Parameter-efficient tuning (PET) techniques calibrate the model's predictions on downstream tasks by freezing the pre-trained models and introducing a small number of learnable parameters.

image-classification Image Classification

MM-CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios

1 code implementation24 Sep 2024 Jiacheng Ruan, Wenzhen Yuan, Zehao Lin, Ning Liao, Zhiyu Li, Feiyu Xiong, Ting Liu, Yuzhuo Fu

CamObj-Instruct is collected for fine-tuning the LVLMs with improved instruction-following capabilities, and it includes 11, 363 images and 68, 849 conversations with diverse instructions.

Instruction Following

LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

2 code implementations24 Jun 2024 Tong Zhu, Xiaoye Qu, Daize Dong, Jiacheng Ruan, Jingqi Tong, Conghui He, Yu Cheng

Motivated by this limit, we investigate building MoE models from existing dense large language models.

Mixture-of-Experts

Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts

1 code implementation17 Jun 2024 Tong Zhu, Daize Dong, Xiaoye Qu, Jiacheng Ruan, Wenliang Chen, Yu Cheng

Mixture-of-Experts (MoE) models have shown remarkable capability in instruction tuning, especially when the number of tasks scales.

Mixture-of-Experts

iDAT: inverse Distillation Adapter-Tuning

1 code implementation23 Mar 2024 Jiacheng Ruan, Jingsheng Gao, Mingye Xie, Daize Dong, Suncheng Xiang, Ting Liu, Yuzhuo Fu

Adapter-Tuning (AT) method involves freezing a pre-trained model and introducing trainable adapter modules to acquire downstream knowledge, thereby calibrating the model for better adaptation to downstream tasks.

image-classification Image Classification +1

VM-UNet: Vision Mamba UNet for Medical Image Segmentation

4 code implementations4 Feb 2024 Jiacheng Ruan, Jincheng Li, Suncheng Xiang

To our best knowledge, this is the first medical image segmentation model constructed based on the pure SSM-based model.

Image Segmentation Mamba +1

Oceanship: A Large-Scale Dataset for Underwater Audio Target Recognition

1 code implementation4 Jan 2024 Zeyu Li, Suncheng Xiang, Tong Yu, Jingsheng Gao, Jiacheng Ruan, Yanping Hu, Ting Liu, Yuzhuo Fu

While audio retrieval tasks are well-established in general audio classification, they have not been explored in the context of underwater audio recognition.

Attribute Audio Classification +3

Learning Multi-axis Representation in Frequency Domain for Medical Image Segmentation

1 code implementation28 Dec 2023 Jiacheng Ruan, Jingsheng Gao, Mingye Xie, Suncheng Xiang

Specifically, our block performs a Fourier transform on the three axes of the input features and assigns the external weight in the frequency domain, which is generated by our External Weights Generator.

Image Segmentation Medical Image Segmentation +1

LAMM: Label Alignment for Multi-Modal Prompt Learning

1 code implementation13 Dec 2023 Jingsheng Gao, Jiacheng Ruan, Suncheng Xiang, Zefang Yu, Ke Ji, Mingye Xie, Ting Liu, Yuzhuo Fu

We conduct experiments on 11 downstream vision datasets and demonstrate that our method significantly improves the performance of existing multi-modal prompt learning models in few-shot scenarios, exhibiting an average accuracy improvement of 2. 31(\%) compared to the state-of-the-art methods on 16 shots.

Continual Learning Prompt Learning

GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction

1 code implementation12 Dec 2023 Jiacheng Ruan, Jingsheng Gao, Mingye Xie, Suncheng Xiang, Zefang Yu, Ting Liu, Yuzhuo Fu

2) They neglect the interaction between the intrinsic task-agnostic knowledge of pre-trained models and the task-specific knowledge in downstream tasks.

parameter-efficient fine-tuning

EGE-UNet: an Efficient Group Enhanced UNet for skin lesion segmentation

1 code implementation17 Jul 2023 Jiacheng Ruan, Mingye Xie, Jingsheng Gao, Ting Liu, Yuzhuo Fu

Moreover, to our best knowledge, this is the first model with a parameter count limited to just 50KB.

Decoder Image Segmentation +3

Learning Robust Visual-Semantic Embedding for Generalizable Person Re-identification

1 code implementation19 Apr 2023 Suncheng Xiang, Jingsheng Gao, Mengyuan Guan, Jiacheng Ruan, Chengfeng Zhou, Ting Liu, Dahong Qian, Yuzhuo Fu

In this paper, we propose a Multi-Modal Equivalent Transformer called MMET for more robust visual-semantic embedding learning on visual, textual and visual-textual tasks respectively.

Generalizable Person Re-identification Representation Learning

MALUNet: A Multi-Attention and Light-weight UNet for Skin Lesion Segmentation

1 code implementation3 Nov 2022 Jiacheng Ruan, Suncheng Xiang, Mingye Xie, Ting Liu, Yuzhuo Fu

To address this challenge, we propose a light-weight model to achieve competitive performances for skin lesion segmentation at the lowest cost of parameters and computational complexity so far.

Image Segmentation Lesion Segmentation +3

MEW-UNet: Multi-axis representation learning in frequency domain for medical image segmentation

1 code implementation25 Oct 2022 Jiacheng Ruan, Mingye Xie, Suncheng Xiang, Ting Liu, Yuzhuo Fu

Specifically, our block performs a Fourier transform on the three axes of the input feature and assigns the external weight in the frequency domain, which is generated by our Weights Generator.

Image Segmentation Medical Image Segmentation +2

Adaptive Generation Model: A New Ensemble Method

no code implementations14 Sep 2020 Jiacheng Ruan, Jiahao Li

As a common method in Machine Learning, Ensemble Method is used to train multiple models from a data set and obtain better results through certain combination strategies.

BIG-bench Machine Learning Ensemble Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.