Search Results for author: Zipeng Dai

Found 5 papers, 1 papers with code

CuDA2: An approach for Incorporating Traitor Agents into Cooperative Multi-Agent Systems

no code implementations25 Jun 2024 Zhen Chen, Yong Liao, Youpeng Zhao, Zipeng Dai, Jian Zhao

Previous works on adversarial attacks have primarily focused on white-box attacks that directly perturb the states or actions of victim agents, often in scenarios with a limited number of attacks.

Adversarial Attack SMAC+

HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning

no code implementations29 Dec 2023 Hao Wang, Bo Tang, Chi Harold Liu, Shangqin Mao, Jiahong Zhou, Zipeng Dai, Yaqi Sun, Qianlong Xie, Xingxing Wang, Dong Wang

Online display advertising platforms service numerous advertisers by providing real-time bidding (RTB) for the scale of billions of ad requests every day.

Data Augmentation

Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints

no code implementations31 May 2022 David Mguni, Aivar Sootla, Juliusz Ziomek, Oliver Slumbers, Zipeng Dai, Kun Shao, Jun Wang

In this paper, we introduce a reinforcement learning (RL) framework named \textbf{L}earnable \textbf{I}mpulse \textbf{C}ontrol \textbf{R}einforcement \textbf{A}lgorithm (LICRA), for learning to optimally select both when to act and which actions to take when actions incur costs.

Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.