1 code implementation • 4 Feb 2024 • Liang Qiao, Jun Shi, Xiaoyu Hao, Xi Fang, Minfan Zhao, Ziqi Zhu, Junshi Chen, Hong An, Bing Li, Honghui Yuan, Xinyang Wang
Tensor program optimization on Deep Learning Accelerators (DLAs) is critical for efficient model deployment.
no code implementations • 24 Oct 2023 • Enrique Urbano Arellano, Xinyang Wang
Why do agents adopt a particular general behavioral rule among a collection of possible alternatives?
1 code implementation • 12 Oct 2023 • Jinbo Song, Ruoran Huang, Xinyang Wang, Wei Huang, Qian Yu, Mingming Chen, Yafei Yao, Chaosheng Fan, Changping Peng, Zhangang Lin, Jinghe Hu, Jingping Shao
Industrial systems such as recommender systems and online advertising, have been widely equipped with multi-stage architectures, which are divided into several cascaded modules, including matching, pre-ranking, ranking and re-ranking.