no code implementations • 21 Feb 2024 • Xiao-Yang Liu, Jie Zhang, Guoxuan Wang, Weiqing Tong, Anwar Walid
However, the resulting model still consumes a large amount of GPU memory.
no code implementations • 4 Feb 2023 • Xiao-Yang Liu, Ming Zhu, Sem Borst, Anwar Walid
In this paper, we investigate deep reinforcement learning to control traffic lights, and both theoretical analysis and numerical experiments show that the intelligent behavior ``greenwave" (i. e., a vehicle will see a progressive cascade of green lights, and not have to brake at any intersection) emerges naturally a grid road network, which is proved to be the optimal policy in an avenue with multiple cross streets.
no code implementations • 3 Nov 2022 • Shiva Raj Pokhrel, Jinho Choi, Anwar Walid
The bottleneck of distributed edge learning (DEL) over wireless has shifted from computing to communication, primarily the aggregation-averaging (Agg-Avg) process of DEL.
no code implementations • 15 Mar 2022 • Zeyu Zhou, Bruce Hajek, Nakjung Choi, Anwar Walid
Particle Thompson sampling (PTS) is an approximation of Thompson sampling obtained by simply replacing the continuous distribution by a discrete distribution supported at a set of weighted static particles.
1 code implementation • 11 Dec 2021 • Xiao-Yang Liu, Zechu Li, Zhuoran Yang, Jiahao Zheng, Zhaoran Wang, Anwar Walid, Jian Guo, Michael I. Jordan
In this paper, we present a scalable and elastic library ElegantRL-podracer for cloud-native deep reinforcement learning, which efficiently supports millions of GPU cores to carry out massively parallel training at multiple levels.
no code implementations • 7 Nov 2021 • Zechu Li, Xiao-Yang Liu, Jiahao Zheng, Zhaoran Wang, Anwar Walid, Jian Guo
Unfortunately, the steep learning curve and the difficulty in quick modeling and agile development are impeding finance researchers from using deep reinforcement learning in quantitative trading.
no code implementations • 25 Feb 2021 • Shaoxiong Ji, Yue Tan, Teemu Saravirta, Zhiqin Yang, Yixin Liu, Lauri Vasankari, Shirui Pan, Guodong Long, Anwar Walid
Federated learning is a new learning paradigm that decouples data collection and model training via multi-party computation and model aggregation.
no code implementations • 13 Dec 2020 • Beomyeol Jeon, S. M. Ferdous, Muntasir Raihan Rahman, Anwar Walid
In this paper, we develop a privacy-preserving decentralized aggregation protocol for federated learning.
no code implementations • 21 Mar 2020 • Shaoxiong Ji, Wenqi Jiang, Anwar Walid, Xue Li
Federated learning (FL) is a novel machine learning setting that enables on-device intelligence via decentralized training and federated optimization.
no code implementations • 3 Dec 2018 • Xiao-Yang Liu, Zihan Ding, Sem Borst, Anwar Walid
Intelligent Transportation Systems (ITSs) are envisioned to play a critical role in improving traffic flow and reducing congestion, which is a pervasive issue impacting urban areas around the globe.
9 code implementations • 19 Nov 2018 • Xiao-Yang Liu, Zhuoran Xiong, Shan Zhong, Hongyang Yang, Anwar Walid
We explore the potential of deep reinforcement learning to optimize stock trading strategy and thus maximize investment return.
no code implementations • 18 Nov 2018 • Weijun Lu, Xiao-Yang Liu, Qingwei Wu, Yue Sun, Anwar Walid
We propose a novel multilinear dynamical system (MLDS) in a transform domain, named $\mathcal{L}$-MLDS, to model tensor time series.
no code implementations • 13 Dec 2017 • Tao Deng, Xiao-Yang Liu, Feng Qian, Anwar Walid
The recently proposed transform-based tensor model is more appropriate for sensory data processing, as it helps exploit the geometric structures of the three-dimensional target and improve the recovery precision.