Search Results for author: Mingjie Li

Found 48 papers, 21 papers with code

Implicit Euler Skip Connections: Enhancing Adversarial Robustness via Numerical Stability

no code implementations ICML 2020 Mingjie Li, Lingshen He, Zhouchen Lin

By viewing ResNet as an explicit Euler discretization of an ordinary differential equation (ODE), for the first time, we find that the adversarial robustness of ResNet is connected to the numerical stability of the corresponding dynamic system.

Adversarial Attack Adversarial Robustness

Towards Interpretable Counterfactual Generation via Multimodal Autoregression

no code implementations29 Mar 2025 Chenglong Ma, Yuanfeng Ji, Jin Ye, Lu Zhang, Ying Chen, Tianbin Li, Mingjie Li, Junjun He, Hongming Shan

We further introduce ProgEmu, an autoregressive model that unifies the generation of counterfactual images and textual interpretations.

counterfactual Decision Making +2

SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation

no code implementations3 Jan 2025 Mingjie Li, Wai Man Si, Michael Backes, Yang Zhang, Yisen Wang

As advancements in large language models (LLMs) continue and the demand for personalized models increases, parameter-efficient fine-tuning (PEFT) methods (e. g., LoRA) will become essential due to their efficiency in reducing computation costs.

parameter-efficient fine-tuning Safety Alignment

HC-LLM: Historical-Constrained Large Language Models for Radiology Report Generation

1 code implementation15 Dec 2024 Tengfei Liu, Jiapu Wang, Yongli Hu, Mingjie Li, Junfei Yi, Xiaojun Chang, Junbin Gao, BaoCai Yin

Specifically, our approach extracts both time-shared and time-specific features from longitudinal chest X-rays and diagnostic reports to capture disease progression.

Diagnostic In-Context Learning

MADE: Graph Backdoor Defense with Masked Unlearning

no code implementations26 Nov 2024 Xiao Lin, Mingjie Li, Yisen Wang

Graph Neural Networks (GNNs) have garnered significant attention from researchers due to their outstanding performance in handling graph-related tasks, such as social network analysis, protein design, and so on.

backdoor defense Drug Discovery +2

Artificial Intelligence-Enhanced Couinaud Segmentation for Precision Liver Cancer Therapy

no code implementations5 Nov 2024 Liang Qiu, Wenhao Chi, Xiaohan Xing, Praveenbalaji Rajendran, Mingjie Li, Yuming Jiang, Oscar Pastor-Serrano, Sen yang, Xiyue Wang, Yuanfeng Ji, Qiang Wen

Precision therapy for liver cancer necessitates accurately delineating liver sub-regions to protect healthy tissue while targeting tumors, which is essential for reducing recurrence and improving survival rates.

Data Augmentation Segmentation

On the Adversarial Transferability of Generalized "Skip Connections"

1 code implementation11 Oct 2024 Yisen Wang, Yichuan Mo, Dongxian Wu, Mingjie Li, Xingjun Ma, Zhouchen Lin

Specifically, in ResNet-like models (with skip connections), we find that using more gradients from the skip connections rather than the residual modules according to a decay factor during backpropagation allows one to craft adversarial examples with high transferability.

Neural Architecture Search

TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors

1 code implementation9 Sep 2024 Yichuan Mo, Hui Huang, Mingjie Li, Ang Li, Yisen Wang

Additionally, with the reversed trigger, we propose backdoor detection from the noise space, introducing the first backdoor input detection approach for diffusion models and a novel model detection algorithm that calculates the KL divergence between reversed and benign distributions.

backdoor defense Image Generation

Medical Report Generation Is A Multi-label Classification Problem

no code implementations30 Aug 2024 Yijian Fan, Zhenbang Yang, Rui Liu, Mingjie Li, Xiaojun Chang

However, in this paper, we propose a novel perspective: rethinking medical report generation as a multi-label classification problem.

Medical Report Generation Multi-Label Classification +1

Image Super-Resolution with Taylor Expansion Approximation and Large Field Reception

no code implementations1 Aug 2024 Jiancong Feng, Yuan-Gen Wang, Mingjie Li, Fengchuang Xing

This softmax makes the matrix multiplication between Query and Key inseparable, posing a great challenge in simplifying computational complexity.

Blind Super-Resolution Image Super-Resolution

Contrastive Learning with Counterfactual Explanations for Radiology Report Generation

no code implementations19 Jul 2024 Mingjie Li, Haokun Lin, Liang Qiu, Xiaodan Liang, Ling Chen, Abdulmotaleb Elsaddik, Xiaojun Chang

By leveraging this concept, CoFE can learn non-spurious visual representations by contrasting the representations between factual and counterfactual images.

Anatomy Contrastive Learning +5

CLIPVQA:Video Quality Assessment via CLIP

1 code implementation6 Jul 2024 Fengchuang Xing, Mingjie Li, Yuan-Gen Wang, Guopu Zhu, Xiaochun Cao

To utilize the quality language descriptions of videos for supervision, we develop a CLIP-based encoder for language embedding, which is then fully aggregated with the generated content information via a cross-attention module for producing video-language representation.

Video Quality Assessment Visual Question Answering (VQA)

PID: Prompt-Independent Data Protection Against Latent Diffusion Models

1 code implementation14 Jun 2024 Ang Li, Yichuan Mo, Mingjie Li, Yisen Wang

Drawing on these insights, we propose a simple yet effective method called \textbf{Prompt-Independent Defense (PID)} to safeguard privacy against LDMs.

Teaching with Uncertainty: Unleashing the Potential of Knowledge Distillation in Object Detection

no code implementations11 Jun 2024 Junfei Yi, Jianxu Mao, Tengfei Liu, Mingjie Li, Hanyu Gu, HUI ZHANG, Xiaojun Chang, Yaonan Wang

In this paper, we propose a novel feature-based distillation paradigm with knowledge uncertainty for object detection, termed "Uncertainty Estimation-Discriminative Knowledge Extraction-Knowledge Transfer (UET)", which can seamlessly integrate with existing distillation methods.

Knowledge Distillation object-detection +2

Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification

1 code implementation5 Jun 2024 Gexin Huang, Chenfei Wu, Mingjie Li, Xiaojun Chang, Ling Chen, Ying Sun, Shen Zhao, Xiaodan Liang, Liang Lin

(b) A knowledge association module that fuses linguistic and biomedical knowledge into gene priors by transformer-based graph representation learning, capturing the intrinsic relationships between different genes' mutations.

Binary Classification Graph Representation Learning +3

Black-box Adversarial Attacks Against Image Quality Assessment Models

no code implementations27 Feb 2024 Yu Ran, Ao-Xiang Zhang, Mingjie Li, Weixuan Tang, Yuan-Gen Wang

Specifically, we first formulate the attack problem as maximizing the deviation between the estimated quality scores of original and perturbed images, while restricting the perturbed image distortions for visual quality preservation.

NR-IQA

EMBRE: Entity-aware Masking for Biomedical Relation Extraction

no code implementations15 Jan 2024 Mingjie Li, Karin Verspoor

Information extraction techniques, including named entity recognition (NER) and relation extraction (RE), are crucial in many domains to support making sense of vast amounts of unstructured text data by identifying and connecting relevant information.

named-entity-recognition Named Entity Recognition +3

Mask Propagation for Efficient Video Semantic Segmentation

1 code implementation NeurIPS 2023 Yuetian Weng, Mingfei Han, Haoyu He, Mingjie Li, Lina Yao, Xiaojun Chang, Bohan Zhuang

By reusing predictions from key frames, we circumvent the need to process a large volume of video frames individually with resource-intensive segmentors, alleviating temporal redundancy and significantly reducing computational costs.

Semantic Segmentation Video Semantic Segmentation

Explaining How a Neural Network Play the Go Game and Let People Learn

no code implementations15 Oct 2023 Huilin Zhou, Huijie Tang, Mingjie Li, Hao Zhang, Zhenyu Liu, Quanshi Zhang

The AI model has surpassed human players in the game of Go, and it is widely believed that the AI model has encoded new knowledge about the Go game beyond human players.

Game of Go

Technical Note: Defining and Quantifying AND-OR Interactions for Faithful and Concise Explanation of DNNs

1 code implementation26 Apr 2023 Mingjie Li, Quanshi Zhang

For faithfulness, we prove the uniqueness of the AND (OR) interaction in quantifying the effect of the AND (OR) relationship between input variables.

Can the Inference Logic of Large Language Models be Disentangled into Symbolic Concepts?

no code implementations3 Apr 2023 Wen Shen, Lei Cheng, Yuxiao Yang, Mingjie Li, Quanshi Zhang

In this paper, we explain the inference logic of large language models (LLMs) as a set of symbolic concepts.

Sentence

Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation

1 code implementation CVPR 2023 Mingjie Li, Bingqian Lin, Zicong Chen, Haokun Lin, Xiaodan Liang, Xiaojun Chang

To address the limitation, we propose a knowledge graph with Dynamic structure and nodes to facilitate medical report generation with Contrastive Learning, named DCL.

Contrastive Learning Decoder +3

Does a Neural Network Really Encode Symbolic Concepts?

1 code implementation25 Feb 2023 Mingjie Li, Quanshi Zhang

Recently, a series of studies have tried to extract interactions between input variables modeled by a DNN and define such interactions as concepts encoded by the DNN.

Mitigating Data Redundancy to Revitalize Transformer-based Long-Term Time Series Forecasting System

2 code implementations16 Jul 2022 Mingjie Li, Rui Liu, Guangsi Shi, Mingfei Han, Changling Li, Lina Yao, Xiaojun Chang, Ling Chen

This curriculum-driven noise introduction aids the memory-driven decoder by supplying more diverse and representative training data, enhancing the decoder's ability to model seasonal tendencies and dependencies in the time-series data.

Data Augmentation Decoder +2

Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation

no code implementations CVPR 2022 Mingjie Li, Wenjia Cai, Karin Verspoor, Shirui Pan, Xiaodan Liang, Xiaojun Chang

To endow models with the capability of incorporating expert knowledge, we propose a Cross-modal clinical Graph Transformer (CGT) for ophthalmic report generation (ORG), in which clinical relation triples are injected into the visual features as prior knowledge to drive the decoding procedure.

Clinical Knowledge Decoder +1

SymNMF-Net for The Symmetric NMF Problem

no code implementations26 May 2022 Mingjie Li, Hao Kong, Zhouchen Lin

Furthermore, we analyze the constraints of the inversion layer to ensure the output stability of the network to a certain extent.

Clustering

Towards Theoretical Analysis of Transformation Complexity of ReLU DNNs

1 code implementation4 May 2022 Jie Ren, Mingjie Li, Meng Zhou, Shih-Han Chan, Quanshi Zhang

Based on the proposed metrics, we analyze two typical phenomena of the change of the transformation complexity during the training process, and explore the ceiling of a DNN's complexity.

Adversarial Robustness Disentanglement

Visualizing the Emergence of Intermediate Visual Patterns in DNNs

no code implementations NeurIPS 2021 Mingjie Li, Shaobo Wang, Quanshi Zhang

This paper proposes a method to visualize the discrimination power of intermediate-layer visual patterns encoded by a DNN.

Knowledge Distillation

Generating Watermarked Adversarial Texts

no code implementations25 Oct 2021 Mingjie Li, Hanzhou Wu, Xinpeng Zhang

Adversarial example generation has been a hot spot in recent years because it can cause deep neural networks (DNNs) to misclassify the generated adversarial examples, which reveals the vulnerability of DNNs, motivating us to find good solutions to improve the robustness of DNN models.

Adversarial Text Text Generation

Optimization inspired Multi-Branch Equilibrium Models

no code implementations ICLR 2022 Mingjie Li, Yisen Wang, Xingyu Xie, Zhouchen Lin

Works have shown the strong connections between some implicit models and optimization problems.

Towards Understanding and Improving Dropout in Game Theory

no code implementations ICLR 2021 Hao Zhang, Sen Li, Yinchao Ma, Mingjie Li, Yichen Xie, Quanshi Zhang

Experimental results on various DNNs and datasets have shown that the interaction loss can effectively improve the utility of dropout and boost the performance of DNNs.

Understanding, Analyzing, and Optimizing the Complexity of Deep Models

no code implementations1 Jan 2021 Jie Ren, Mingjie Li, Meng Zhou, Shih-Han Chan, Zexu Liu, Quanshi Zhang

Based on the proposed metrics, we analyze two typical phenomena of the change of the transformation complexity during the training process, and explore the ceiling of a DNN’s complexity.

Disentanglement

Interpreting and Boosting Dropout from a Game-Theoretic View

no code implementations24 Sep 2020 Hao Zhang, Sen Li, Yinchao Ma, Mingjie Li, Yichen Xie, Quanshi Zhang

This paper aims to understand and improve the utility of the dropout operation from the perspective of game-theoretic interactions.

Interpreting and Disentangling Feature Components of Various Complexity from DNNs

1 code implementation29 Jun 2020 Jie Ren, Mingjie Li, Zexu Liu, Quanshi Zhang

This paper aims to define, quantify, and analyze the feature complexity that is learned by a DNN.

Knowledge Distillation

Auxiliary Signal-Guided Knowledge Encoder-Decoder for Medical Report Generation

2 code implementations6 Jun 2020 Mingjie Li, Fuyu Wang, Xiaojun Chang, Xiaodan Liang

Firstly, the regions of primary interest to radiologists are usually located in a small area of the global image, meaning that the remainder parts of the image could be considered as irrelevant noise in the training procedure.

Decoder Image Captioning +2

An Action Recognition network for specific target based on rMC and RPN

no code implementations19 Jun 2019 Mingjie Li, Youqian Feng, Zhonghai Yin, Cheng Zhou, Fanghao Dong, Yu-an Lin, Yuhao Dong

Meanwhile, the action recognition network is tested in our gesture and body posture data sets for specific target.

Action Recognition regression

Impoved RPN for Single Targets Detection based on the Anchor Mask Net

no code implementations18 Jun 2019 Mingjie Li, Youqian Feng, Zhonghai Yin, Cheng Zhou, Fanghao Dong

Common target detection is usually based on single frame images, which is vulnerable to affected by the similar targets in the image and not applicable to video images.

Cannot find the paper you are looking for? You can Submit a new open access paper.