no code implementations • 25 Jun 2024 • Zhen Chen, Yong Liao, Youpeng Zhao, Zipeng Dai, Jian Zhao
Previous works on adversarial attacks have primarily focused on white-box attacks that directly perturb the states or actions of victim agents, often in scenarios with a limited number of attacks.
1 code implementation • 6 Jun 2024 • Lin Liu, Jian Zhao, Cheng Hu, Zhengtao Cao, Youpeng Zhao, Zhenbin Ye, Meng Meng, Wenjun Wang, Zhaofeng He, Houqiang Li, Xia Lin, Lanxiao Huang
To address these issues, we introduce the first publicly available map editor for the popular mobile game Honor of Kings and design a lightweight environment, Mini Honor of Kings (Mini HoK), for researchers to conduct experiments.
no code implementations • 26 Mar 2024 • Youpeng Zhao, Di wu, Jun Wang
In a single GPU-CPU system, we demonstrate that under varying workloads, ALISA improves the throughput of baseline systems such as FlexGen and vLLM by up to 3X and 1. 9X, respectively.
no code implementations • 28 Feb 2024 • Youpeng Zhao, Ming Lin, Huadong Tang, Qiang Wu, Jun Wang
Generative Large Language Models (LLMs) stand as a revolutionary advancement in the modern era of artificial intelligence (AI).
1 code implementation • 5 Dec 2023 • Youpeng Zhao, Yudong Lu, Jian Zhao, Wengang Zhou, Houqiang Li
The utilization of artificial intelligence (AI) in card games has been a well-explored subject within AI research for an extensive period.
no code implementations • 31 Oct 2022 • Yudong Lu, Jian Zhao, Youpeng Zhao, Wengang Zhou, Houqiang Li
We compare it with 8 baseline AI programs which are based on heuristic rules and the results reveal the outstanding performance of DanZero.
no code implementations • 15 Jul 2022 • Youpeng Zhao, Huadong Tang, Yingying Jiang, Yong A, Qiang Wu
Recent advances in vision transformers (ViTs) have achieved great performance in visual recognition tasks.
1 code implementation • 6 Jun 2022 • Yunpeng Xiao, Youpeng Zhao, Ge Yang
Fully supervised deep learning models developed for this task achieve excellent performance but require substantial amounts of annotated data for training.
1 code implementation • 6 Apr 2022 • Youpeng Zhao, Jian Zhao, Xunhan Hu, Wengang Zhou, Houqiang Li
Recent years have witnessed the great breakthrough of deep reinforcement learning (DRL) in various perfect and imperfect information games.
1 code implementation • 16 Mar 2022 • Jian Zhao, Youpeng Zhao, Weixun Wang, Mingyu Yang, Xunhan Hu, Wengang Zhou, Jianye Hao, Houqiang Li
To the best of our knowledge, this work is the first to study the unexpected crashes in the multi-agent system.
Multi-agent Reinforcement Learning reinforcement-learning +3
1 code implementation • 21 Feb 2022 • Jian Zhao, Mingyu Yang, Youpeng Zhao, Xunhan Hu, Wengang Zhou, Jiangcheng Zhu, Houqiang Li
Specifically, we model both individual Q-values and global Q-value with categorical distribution.