no code implementations • 11 Apr 2024 • Soichiro Nishimori, Xin-Qiang Cai, Johannes Ackermann, Masashi Sugiyama
In this paper, we investigate an offline reinforcement learning (RL) problem where datasets are collected from two domains.
no code implementations • 10 Apr 2024 • Xingyu Song, Zhan Li, Shi Chen, Xin-Qiang Cai, Kazuyuki Demachi
(3) We achieve the same performance with only 10% of the original data for training as with all of the original data from the real-world dataset, and a better performance on In-the-wild videos, by employing our data augmentation techniques.
no code implementations • 6 Feb 2024 • Yuting Tang, Xin-Qiang Cai, Yao-Xiang Ding, Qiyu Wu, Guoqing Liu, Masashi Sugiyama
Instead, the learner only obtains rewards at the ends of bags, where a bag is defined as a partial sequence of a complete trajectory.
no code implementations • 16 Sep 2023 • Kaiyan Zhao, Qiyu Wu, Xin-Qiang Cai, Yoshimasa Tsuruoka
Learning multi-lingual sentence embeddings is a fundamental task in natural language processing.
no code implementations • 17 Jun 2021 • Xin-Qiang Cai, Yao-Xiang Ding, Zi-Xuan Chen, Yuan Jiang, Masashi Sugiyama, Zhi-Hua Zhou
In many real-world imitation learning tasks, the demonstrator and the learner have to act under different observation spaces.
no code implementations • 9 Sep 2019 • Xin-Qiang Cai, Yao-Xiang Ding, Yuan Jiang, Zhi-Hua Zhou
One of the key issues for imitation learning lies in making policy learned from limited samples to generalize well in the whole state-action space.