Search Results for author: Jindong Gu

Found 62 papers, 21 papers with code

Latent Guard: a Safety Framework for Text-to-image Generation

2 code implementations • 11 Apr 2024 • Runtao Liu, Ashkan Khakzar, Jindong Gu, Qifeng Chen, Philip Torr, Fabio Pizzati

Hence, we propose Latent Guard, a framework designed to improve safety measures in text-to-image generation.

Contrastive Learning Text-to-Image Generation

140

Paper
Code

Responsible Generative AI: What to Generate and What Not

no code implementations • 8 Apr 2024 • Jindong Gu

To answer the question, this paper investigates the practical responsible requirements of both textual and visual generative models, outlining five key considerations: generating truthful content, avoiding toxic content, refusing harmful instruction, leaking no training data-related content, and ensuring generated content identifiable.

Paper
Add Code

Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?

no code implementations • 4 Apr 2024 • Shuo Chen, Zhen Han, Bailan He, Zifeng Ding, Wenqian Yu, Philip Torr, Volker Tresp, Jindong Gu

Various jailbreak attacks have been proposed to red-team Large Language Models (LLMs) and revealed the vulnerable safeguards of LLMs.

Paper
Add Code

Model-agnostic Origin Attribution of Generated Images with Few-shot Examples

no code implementations • 3 Apr 2024 • Fengyuan Liu, Haochen Luo, Yiming Li, Philip Torr, Jindong Gu

In this work, we study the origin attribution of generated images in a practical setting where only a few images generated by a source model are available and the source model cannot be accessed.

One-Class Classification

Paper
Add Code

As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?

no code implementations • 19 Mar 2024 • Anjun Hu, Jindong Gu, Francesco Pinto, Konstantinos Kamnitsas, Philip Torr

Foundation models pre-trained on web-scale vision-language data, such as CLIP, are widely used as cornerstones of powerful machine learning systems.

Adversarial Attack Image Captioning +5

Paper
Add Code

An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language Models

1 code implementation • 14 Mar 2024 • Haochen Luo, Jindong Gu, Fengyuan Liu, Philip Torr

Given that VLMs rely on prompts to adapt to different tasks, an intriguing question emerges: Can a single adversarial image mislead all predictions of VLMs when a thousand different prompts are given?

Paper
Code

Hide in Thicket: Generating Imperceptible and Rational Adversarial Perturbations on 3D Point Clouds

1 code implementation • 8 Mar 2024 • Tianrui Lou, Xiaojun Jia, Jindong Gu, Li Liu, Siyuan Liang, Bangyan He, Xiaochun Cao

We find that concealing deformation perturbations in areas insensitive to human eyes can achieve a better trade-off between imperceptibility and adversarial strength, specifically in parts of the object surface that are complex and exhibit drastic curvature changes.

3D Point Cloud Classification Adversarial Attack +1

Paper
Code

Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model

no code implementations • 29 Feb 2024 • Hao Cheng, Erjia Xiao, Jindong Gu, Le Yang, Jinhao Duan, Jize Zhang, Jiahang Cao, Kaidi Xu, Renjing Xu

Large Vision-Language Models (LVLMs) rely on vision encoders and Large Language Models (LLMs) to exhibit remarkable capabilities on various multi-modal tasks in the joint space of vision and language.

Language Modelling Object Recognition +1

Paper
Add Code

Stop Reasoning! When Multimodal LLMs with Chain-of-Thought Reasoning Meets Adversarial Images

no code implementations • 22 Feb 2024 • Zefeng Wang, Zhen Han, Shuo Chen, Fan Xue, Zifeng Ding, Xun Xiao, Volker Tresp, Philip Torr, Jindong Gu

Our research evaluates the adversarial robustness of MLLMs when employing CoT reasoning, finding that CoT marginally improves adversarial robustness against existing attack methods.

Adversarial Robustness

Paper
Add Code

Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images

1 code implementation • 20 Jan 2024 • Kuofeng Gao, Yang Bai, Jindong Gu, Shu-Tao Xia, Philip Torr, Zhifeng Li, Wei Liu

Once attackers maliciously induce high energy consumption and latency time (energy-latency cost) during inference of VLMs, it will exhaust computational resources.

Paper
Code

Does Few-shot Learning Suffer from Backdoor Attacks?

no code implementations • 31 Dec 2023 • Xinwei Liu, Xiaojun Jia, Jindong Gu, Yuan Xun, Siyuan Liang, Xiaochun Cao

However, in this paper, we propose the Few-shot Learning Backdoor Attack (FLBA) to show that FSL can still be vulnerable to backdoor attacks.

Backdoor Attack Few-Shot Learning

Paper
Add Code

XAI for In-hospital Mortality Prediction via Multimodal ICU Data

1 code implementation • 29 Dec 2023 • Xingqiao Li, Jindong Gu, Zhiyong Wang, Yancheng Yuan, Bo Du, Fengxiang He

To address this issue, this paper proposes an eXplainable Multimodal Mortality Predictor (X-MMP) approaching an efficient, explainable AI solution for predicting in-hospital mortality via multimodal ICU data.

Decision Making Mortality Prediction

Paper
Code

Initialization Matters for Adversarial Transfer Learning

1 code implementation • 10 Dec 2023 • Andong Hua, Jindong Gu, Zhiyu Xue, Nicholas Carlini, Eric Wong, Yao Qin

Based on this, we propose Robust Linear Initialization (RoLI) for adversarial finetuning, which initializes the linear head with the weights obtained by adversarial linear probing to maximally inherit the robustness from pretraining.

Adversarial Robustness Image Classification +1

Paper
Code

OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization

no code implementations • 7 Dec 2023 • Dongchen Han, Xiaojun Jia, Yang Bai, Jindong Gu, Yang Liu, Xiaochun Cao

Investigating the generation of high-transferability adversarial examples is crucial for uncovering VLP models' vulnerabilities in practical scenarios.

Adversarial Attack Data Augmentation +2

Paper
Add Code

TranSegPGD: Improving Transferability of Adversarial Examples on Semantic Segmentation

no code implementations • 3 Dec 2023 • Xiaojun Jia, Jindong Gu, Yihao Huang, Simeng Qin, Qing Guo, Yang Liu, Xiaochun Cao

At the second stage, the pixels are divided into different branches based on their transferable property which is dependent on Kullback-Leibler divergence.

Adversarial Attack Image Classification +2

Paper
Add Code

Improving Adversarial Transferability via Model Alignment

no code implementations • 30 Nov 2023 • Avery Ma, Amir-Massoud Farahmand, Yangchen Pan, Philip Torr, Jindong Gu

During the alignment process, the parameters of the source model are fine-tuned to minimize an alignment loss.

Paper
Add Code

MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models

1 code implementation • 29 Nov 2023 • Xin Liu, Yichen Zhu, Jindong Gu, Yunshi Lan, Chao Yang, Yu Qiao

The security concerns surrounding Large Language Models (LLMs) have been extensively explored, yet the safety of Multimodal Large Language Models (MLLMs) remains understudied.

Paper
Code

Understanding and Improving In-Context Learning on Vision-language Models

no code implementations • 29 Nov 2023 • Shuo Chen, Zhen Han, Bailan He, Mark Buckley, Philip Torr, Volker Tresp, Jindong Gu

Our findings indicate that ICL in VLMs is predominantly driven by the textual information in the demonstrations whereas the visual information in the demonstrations barely affects the ICL performance.

In-Context Learning

Paper
Add Code

Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation

no code implementations • 28 Nov 2023 • Hang Li, Chengzhi Shen, Philip Torr, Volker Tresp, Jindong Gu

A risk with these models is the potential generation of inappropriate content, such as biased or harmful images.

Text-to-Image Generation

Paper
Add Code

Benchmarking Robustness of Text-Image Composed Retrieval

no code implementations • 24 Nov 2023 • Shitong Sun, Jindong Gu, Shaogang Gong

In this paper, we perform the first robustness study and establish three new diversified benchmarks for systematic analysis of text-image composed retrieval against natural corruptions in both vision and text and further probe textural understanding.

Attribute Benchmarking +1

Paper
Add Code

SPOT! Revisiting Video-Language Models for Event Understanding

no code implementations • 21 Nov 2023 • Gengyuan Zhang, Jinhe Bi, Jindong Gu, Yanyu Chen, Volker Tresp

This raises a question: with such weak supervision, can video representation in video-language models gain the ability to distinguish even factual discrepancies in textual description and understand fine-grained events?

Attribute Video Understanding

Paper
Add Code

A Survey on Transferability of Adversarial Examples across Deep Neural Networks

1 code implementation • 26 Oct 2023 • Jindong Gu, Xiaojun Jia, Pau de Jorge, Wenqain Yu, Xinwei Liu, Avery Ma, Yuan Xun, Anjun Hu, Ashkan Khakzar, Zhijiang Li, Xiaochun Cao, Philip Torr

This survey explores the landscape of the adversarial transferability of adversarial examples.

Image Classification

Paper
Code

Fast Propagation is Better: Accelerating Single-Step Adversarial Training via Sampling Subnetworks

1 code implementation • 24 Oct 2023 • Xiaojun Jia, Jianshu Li, Jindong Gu, Yang Bai, Xiaochun Cao

Besides, we provide theoretical analysis to show the model robustness can be improved by the single-step adversarial training with sampled subnetworks.

Paper
Code

Boosting Fair Classifier Generalization through Adaptive Priority Reweighing

1 code implementation • 15 Sep 2023 • Zhihao Hu, Yiran Xu, Mengnan Du, Jindong Gu, Xinmei Tian, Fengxiang He

Our adaptive reweighing method prioritizes samples closer to the decision boundary and assigns a higher weight to improve the generalizability of fair classifiers.

Decision Making Fairness

Paper
Code

Exploring Non-additive Randomness on ViT against Query-Based Black-Box Attacks

no code implementations • 12 Sep 2023 • Jindong Gu, Fangyun Wei, Philip Torr, Han Hu

In this work, we first taxonomize the stochastic defense strategies against QBBA.

Paper
Add Code

Revisiting and Exploring Efficient Fast Adversarial Training via LAW: Lipschitz Regularization and Auto Weight Averaging

no code implementations • 22 Aug 2023 • Xiaojun Jia, Yuefeng Chen, Xiaofeng Mao, Ranjie Duan, Jindong Gu, Rong Zhang, Hui Xue, Xiaochun Cao

In this paper, we conduct a comprehensive study of over 10 fast adversarial training methods in terms of adversarial robustness and training costs.

Adversarial Robustness Data Augmentation

Paper
Add Code

Multi-event Video-Text Retrieval

1 code implementation • ICCV 2023 • Gengyuan Zhang, Jisen Ren, Jindong Gu, Volker Tresp

In this study, we introduce the Multi-event Video-Text Retrieval (MeVTR) task, addressing scenarios in which each video contains multiple different events, as a niche scenario of the conventional Video-Text Retrieval Task.

Language Modelling Retrieval +2

Paper
Code

FedDAT: An Approach for Foundation Model Finetuning in Multi-Modal Heterogeneous Federated Learning

no code implementations • 21 Aug 2023 • Haokun Chen, Yao Zhang, Denis Krompass, Jindong Gu, Volker Tresp

FedDAT is the first approach that enables an efficient distributed finetuning of foundation models for a variety of heterogeneous Vision-Language tasks.

Federated Learning Knowledge Distillation +1

Paper
Add Code

Discretization-Induced Dirichlet Posterior for Robust Uncertainty Quantification on Regression

1 code implementation • 17 Aug 2023 • Xuanlong Yu, Gianni Franchi, Jindong Gu, Emanuel Aldea

In this work, we propose a generalized AuxUE scheme for more robust uncertainty quantification on regression tasks.

Age Estimation Depth Aleatoric Uncertainty Estimation +5

Paper
Code

FedPop: Federated Population-based Hyperparameter Tuning

no code implementations • 16 Aug 2023 • Haokun Chen, Denis Krompass, Jindong Gu, Volker Tresp

Similar to conventional ML pipelines, the client local optimization and server aggregation procedure in FL are sensitive to the hyperparameter (HP) selection.

Computational Efficiency Evolutionary Algorithms +1

Paper
Add Code

A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models

1 code implementation • 24 Jul 2023 • Jindong Gu, Zhen Han, Shuo Chen, Ahmad Beirami, Bailan He, Gengyuan Zhang, Ruotong Liao, Yao Qin, Volker Tresp, Philip Torr

This paper aims to provide a comprehensive survey of cutting-edge research in prompt engineering on three types of vision-language models: multimodal-to-text generation models (e. g. Flamingo), image-text matching models (e. g.

Image-text matching Language Modelling +4

258

Paper
Code

Reliable Evaluation of Adversarial Transferability

no code implementations • 14 Jun 2023 • Wenqian Yu, Jindong Gu, Zhijiang Li, Philip Torr

Adversarial examples (AEs) with small adversarial perturbations can mislead deep neural networks (DNNs) into wrong predictions.

Paper
Add Code

Towards Robust Prompts on Vision-Language Models

no code implementations • 17 Apr 2023 • Jindong Gu, Ahmad Beirami, Xuezhi Wang, Alex Beutel, Philip Torr, Yao Qin

With the advent of vision-language models (VLMs) that can perform in-context and prompt-based learning, how can we design prompting approaches that robustly generalize to distribution shift and can be used on novel classes outside the support set of the prompts?

In-Context Learning

Paper
Add Code

Backdoor Defense via Adaptively Splitting Poisoned Dataset

1 code implementation • CVPR 2023 • Kuofeng Gao, Yang Bai, Jindong Gu, Yong Yang, Shu-Tao Xia

With the split clean data pool and polluted data pool, ASD successfully defends against backdoor attacks during training.

backdoor defense

Paper
Code

Influencer Backdoor Attack on Semantic Segmentation

1 code implementation • 21 Mar 2023 • Haoheng Lan, Jindong Gu, Philip Torr, Hengshuang Zhao

In this work, we explore backdoor attacks on segmentation models to misclassify all pixels of a victim class by injecting a specific trigger on non-victim pixels during inferences, which is dubbed Influencer Backdoor Attack (IBA).

Backdoor Attack Position +2

Paper
Code

Explainability and Robustness of Deep Visual Classification Models

no code implementations • 3 Jan 2023 • Jindong Gu

The vulnerability of deep neural networks poses challenges to current visual classification models.

Classification Image Classification +1

Paper
Add Code

Do DALL-E and Flamingo Understand Each Other?

no code implementations • ICCV 2023 • Hang Li, Jindong Gu, Rajat Koner, Sahand Sharifzadeh, Volker Tresp

To study this question, we propose a reconstruction task where Flamingo generates a description for a given image and DALL-E uses this description as input to synthesize a new image.

Image Captioning Image Reconstruction +3

Paper
Add Code

CL-CrossVQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering

no code implementations • 19 Nov 2022 • Yao Zhang, Haokun Chen, Ahmed Frikha, Yezi Yang, Denis Krompass, Gengyuan Zhang, Jindong Gu, Volker Tresp

Visual Question Answering (VQA) is a multi-discipline research task.

Continual Learning Question Answering +2

Paper
Add Code

SegPGD: An Effective and Efficient Adversarial Attack for Evaluating and Boosting Segmentation Robustness

1 code implementation • 25 Jul 2022 • Jindong Gu, Hengshuang Zhao, Volker Tresp, Philip Torr

Since SegPGD can create more effective adversarial examples, the adversarial training with our SegPGD can boost the robustness of segmentation models.

Adversarial Attack Segmentation +1

Paper
Code

Towards Efficient Adversarial Training on Vision Transformers

no code implementations • 21 Jul 2022 • Boxi Wu, Jindong Gu, Zhifeng Li, Deng Cai, Xiaofei He, Wei Liu

Vision Transformer (ViT), as a powerful alternative to Convolutional Neural Network (CNN), has received much attention.

Paper
Add Code

Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal

1 code implementation • 17 Jul 2022 • Xinwei Liu, Jian Liu, Yang Bai, Jindong Gu, Tao Chen, Xiaojun Jia, Xiaochun Cao

Inspired by the vulnerability of DNNs on adversarial perturbations, we propose a novel defence mechanism by adversarial machine learning for good.

Paper
Code

FRAug: Tackling Federated Learning with Non-IID Features via Representation Augmentation

no code implementations • ICCV 2023 • Haokun Chen, Ahmed Frikha, Denis Krompass, Jindong Gu, Volker Tresp

Real-world applications usually involve a distribution shift across the datasets of the different clients, which hurts the generalization ability of the clients to unseen samples from their respective data distributions.

Federated Learning

Paper
Add Code

ECOLA: Enhanced Temporal Knowledge Embeddings with Contextualized Language Representations

no code implementations • 17 Mar 2022 • Zhen Han, Ruotong Liao, Jindong Gu, Yao Zhang, Zifeng Ding, Yujia Gu, Heinz Köppl, Hinrich Schütze, Volker Tresp

Since conventional knowledge embedding models cannot take full advantage of the abundant textual information, there have been extensive research efforts in enhancing knowledge embedding using texts.

Knowledge Graph Embedding Link Prediction +1

Paper
Add Code

Adversarial Examples on Segmentation Models Can be Easy to Transfer

no code implementations • 22 Nov 2021 • Jindong Gu, Hengshuang Zhao, Volker Tresp, Philip Torr

The high transferability achieved by our method shows that, in contrast to the observations in previous work, adversarial examples on a segmentation model can be easy to transfer to other segmentation models.

Adversarial Robustness Attribute +5

Paper
Add Code

Are Vision Transformers Robust to Patch Perturbations?

no code implementations • 20 Nov 2021 • Jindong Gu, Volker Tresp, Yao Qin

However, when ViTs are attacked by an adversary, the attention mechanism can be easily fooled to focus more on the adversarially perturbed patches and cause a mistake.

Image Classification

Paper
Add Code

Are Vision Transformers Robust to Patch-wise Perturbations?

no code implementations • 29 Sep 2021 • Jindong Gu, Volker Tresp, Yao Qin

Based on extensive qualitative and quantitative experiments, we discover that ViT's stronger robustness to natural corrupted patches and higher vulnerability against adversarial patches are both caused by the attention mechanism.

Image Classification

Paper
Add Code

Simple Distillation Baselines for Improving Small Self-supervised Models

1 code implementation • 21 Jun 2021 • Jindong Gu, Wei Liu, Yonglong Tian

While large self-supervised models have rivalled the performance of their supervised counterparts, small models still struggle.

Paper
Code

Attacking Adversarial Attacks as A Defense

no code implementations • 9 Jun 2021 • Boxi Wu, Heng Pan, Li Shen, Jindong Gu, Shuai Zhao, Zhifeng Li, Deng Cai, Xiaofei He, Wei Liu

In this work, we find that the adversarial attacks can also be vulnerable to small perturbations.

Paper
Add Code

Quantifying Predictive Uncertainty in Medical Image Analysis with Deep Kernel Learning

1 code implementation • 1 Jun 2021 • Zhiliang Wu, Yinchong Yang, Jindong Gu, Volker Tresp

We propose an uncertainty-aware deep kernel learning model which permits the estimation of the uncertainty in the prediction by a pipeline of a Convolutional Neural Network and a sparse Gaussian Process.

Paper
Code

Capsule Network is Not More Robust than Convolutional Network

no code implementations • CVPR 2021 • Jindong Gu, Volker Tresp, Han Hu

The examination reveals five major new/different components in CapsNet: a transformation process, a dynamic routing layer, a squashing function, a marginal loss other than cross-entropy loss, and an additional class-conditional reconstruction loss for regularization.

Image Classification

Paper
Add Code

Effective and Efficient Vote Attack on Capsule Networks

1 code implementation • ICLR 2021 • Jindong Gu, Baoyuan Wu, Volker Tresp

As alternatives to CNNs, the recently proposed Capsule Networks (CapsNets) are shown to be more robust to white-box attacks than CNNs under popular attack protocols.

Adversarial Robustness

Paper
Code

Interpretable Graph Capsule Networks for Object Recognition

no code implementations • 3 Dec 2020 • Jindong Gu, Volker Tresp

In the proposed model, individual classification explanations can be created effectively and efficiently.

Adversarial Robustness Object +1

Paper
Add Code

Introspective Learning by Distilling Knowledge from Online Self-explanation

no code implementations • 19 Sep 2020 • Jindong Gu, Zhiliang Wu, Volker Tresp

Motivated by the conclusion, we propose an implementation of introspective learning by distilling knowledge from online self-explanations.

Knowledge Distillation

Paper
Add Code

Search for Better Students to Learn Distilled Knowledge

no code implementations • 30 Jan 2020 • Jindong Gu, Volker Tresp

The knowledge of a well-performed teacher is distilled to a student with a small architecture.

Knowledge Distillation Model Compression

Paper
Add Code

Neural Network Memorization Dissection

no code implementations • 21 Nov 2019 • Jindong Gu, Volker Tresp

What is the difference between DNNs trained with random labels and the ones trained with true labels?

Memorization

Paper
Add Code

Improving the Robustness of Capsule Networks to Image Affine Transformations

no code implementations • CVPR 2020 • Jindong Gu, Volker Tresp

Our investigation reveals that the routing procedure contributes neither to the generalization ability nor to the affine robustness of the CapsNets.

Paper
Add Code

Semantics for Global and Local Interpretation of Deep Neural Networks

no code implementations • 21 Oct 2019 • Jindong Gu, Volker Tresp

Deep neural networks (DNNs) with high expressiveness have achieved state-of-the-art performance in many tasks.

Paper
Add Code

Contextual Prediction Difference Analysis for Explaining Individual Image Classifications

no code implementations • 21 Oct 2019 • Jindong Gu, Volker Tresp

In this work, we first show that PDA can suffer from saturated classifiers.

Paper
Add Code

Understanding Bias in Machine Learning

no code implementations • 2 Sep 2019 • Jindong Gu, Daniela Oelke

Bias is known to be an impediment to fair decisions in many domains such as human resources, the public sector, health care etc.

BIG-bench Machine Learning

Paper
Add Code

Saliency Methods for Explaining Adversarial Attacks

no code implementations • 22 Aug 2019 • Jindong Gu, Volker Tresp

The idea behind saliency methods is to explain the classification decisions of neural networks by creating so-called saliency maps.

General Classification

Paper
Add Code

Understanding Individual Decisions of CNNs via Contrastive Backpropagation

2 code implementations • 5 Dec 2018 • Jindong Gu, Yinchong Yang, Volker Tresp

The experiments and analysis conclude that the explanations generated by LRP are not class-discriminative.

General Classification

Paper
Code

Semi-supervised Outlier Detection using Generative And Adversary Framework

no code implementations • ICLR 2018 • Jindong Gu, Matthias Schubert, Volker Tresp

In the adversarial process of training CorGAN, the Generator is supposed to generate outlier samples for negative class, and the Discriminator as an one-class classifier is trained to distinguish data from training datasets (i. e. positive class) and generated data from the Generator (i. e. negative class).

General Classification Multi-class Classification +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.