Search Results for author: Yuanpu Cao

Found 4 papers, 2 papers with code

Federated Learning with Projected Trajectory Regularization

no code implementations22 Dec 2023 Tiejin Chen, Yuanpu Cao, Yujia Wang, Cho-Jui Hsieh, Jinghui Chen

Specifically, FedPTR allows local clients or the server to optimize an auxiliary (synthetic) dataset that mimics the learning dynamics of the recent model update and utilizes it to project the next-step model trajectory for local training regularization.

Federated Learning

Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections

no code implementations15 Nov 2023 Yuanpu Cao, Bochuan Cao, Jinghui Chen

In this work, we show that it is possible to conduct stealthy and persistent unalignment on large language models via backdoor injections.

Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM

1 code implementation18 Sep 2023 Bochuan Cao, Yuanpu Cao, Lu Lin, Jinghui Chen

In this work, we introduce a Robustly Aligned LLM (RA-LLM) to defend against potential alignment-breaking attacks.

RLCard: A Toolkit for Reinforcement Learning in Card Games

8 code implementations10 Oct 2019 Daochen Zha, Kwei-Herng Lai, Yuanpu Cao, Songyi Huang, Ruzhe Wei, Junyu Guo, Xia Hu

The goal of RLCard is to bridge reinforcement learning and imperfect information games, and push forward the research of reinforcement learning in domains with multiple agents, large state and action space, and sparse reward.

Board Games Game of Poker +3

Cannot find the paper you are looking for? You can Submit a new open access paper.