no code implementations • 26 Mar 2024 • Youpeng Zhao, Di wu, Jun Wang
In a single GPU-CPU system, we demonstrate that under varying workloads, ALISA improves the throughput of baseline systems such as FlexGen and vLLM by up to 3X and 1. 9X, respectively.
no code implementations • 28 Feb 2024 • Youpeng Zhao, Ming Lin, Huadong Tang, Qiang Wu, Jun Wang
Generative Large Language Models (LLMs) stand as a revolutionary advancement in the modern era of artificial intelligence (AI).
1 code implementation • 5 Dec 2023 • Youpeng Zhao, Yudong Lu, Jian Zhao, Wengang Zhou, Houqiang Li
The utilization of artificial intelligence (AI) in card games has been a well-explored subject within AI research for an extensive period.
no code implementations • 31 Oct 2022 • Yudong Lu, Jian Zhao, Youpeng Zhao, Wengang Zhou, Houqiang Li
We compare it with 8 baseline AI programs which are based on heuristic rules and the results reveal the outstanding performance of DanZero.
no code implementations • 15 Jul 2022 • Youpeng Zhao, Huadong Tang, Yingying Jiang, Yong A, Qiang Wu
Recent advances in vision transformers (ViTs) have achieved great performance in visual recognition tasks.
1 code implementation • 6 Jun 2022 • Yunpeng Xiao, Youpeng Zhao, Ge Yang
Fully supervised deep learning models developed for this task achieve excellent performance but require substantial amounts of annotated data for training.
1 code implementation • 6 Apr 2022 • Youpeng Zhao, Jian Zhao, Xunhan Hu, Wengang Zhou, Houqiang Li
Recent years have witnessed the great breakthrough of deep reinforcement learning (DRL) in various perfect and imperfect information games.
1 code implementation • 16 Mar 2022 • Jian Zhao, Youpeng Zhao, Weixun Wang, Mingyu Yang, Xunhan Hu, Wengang Zhou, Jianye Hao, Houqiang Li
To the best of our knowledge, this work is the first to study the unexpected crashes in the multi-agent system.
Multi-agent Reinforcement Learning reinforcement-learning +3
1 code implementation • 21 Feb 2022 • Jian Zhao, Mingyu Yang, Youpeng Zhao, Xunhan Hu, Wengang Zhou, Jiangcheng Zhu, Houqiang Li
Specifically, we model both individual Q-values and global Q-value with categorical distribution.