Search Results for author: Mengxin Zheng

Found 4 papers, 1 papers with code

TrojFSP: Trojan Insertion in Few-shot Prompt Tuning

no code implementations16 Dec 2023 Mengxin Zheng, Jiaqi Xue, Xun Chen, Yanshan Wang, Qian Lou, Lei Jiang

However, the security issues, e. g., Trojan attacks, of prompt tuning on a few data samples are not well-studied.

Data Poisoning Language Modelling

TrojFair: Trojan Fairness Attacks

no code implementations16 Dec 2023 Mengxin Zheng, Jiaqi Xue, Yi Sheng, Lei Yang, Qian Lou, Lei Jiang

TrojFair is a stealthy Fairness attack that is resilient to existing model fairness audition detectors since the model for clean inputs is fair.

Fairness

SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning

no code implementations16 Mar 2023 Mengxin Zheng, Jiaqi Xue, ZiHao Wang, Xun Chen, Qian Lou, Lei Jiang, XiaoFeng Wang

We evaluated SSL-Cleanse on various datasets using 1200 encoders, achieving an average detection success rate of 82. 2% on ImageNet-100.

Self-Supervised Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.