no code implementations • 9 Apr 2024 • Xudong Yu, Chenjia Bai, Hongyi Guo, Changhong Wang, Zhen Wang
Offline Reinforcement Learning (RL) faces distributional shift and unreliable value estimation, especially for out-of-distribution (OOD) actions.
no code implementations • 7 Apr 2024 • Xudong Yu, Chenjia Bai, Haoran He, Changhong Wang, Xuelong Li
Sequential decision-making is desired to align with human intents and exhibit versatility across various tasks.
no code implementations • 29 Sep 2023 • Xiaoyu Wen, Xudong Yu, Rui Yang, Chenjia Bai, Zhen Wang
Experimental results illustrate the superiority of RO2O in facilitating stable offline-to-online learning and achieving significant improvement with limited online interactions.
no code implementations • 17 Dec 2020 • Yuanbin Jin, Jiangwei Yan, Shah Jee Rahman, Jie Li, Xudong Yu, Jing Zhang
We measure a highest rotation frequency about 4. 3 GHz of the trapped nanoparticle without feedback cooling and a 6 GHz rotation with feedback cooling, which is the fastest mechanical rotation ever reported to date.
Optics Mesoscale and Nanoscale Physics Quantum Physics