no code implementations • 7 Jun 2024 • Yuxing Long, Wenzhe Cai, Hongcheng Wang, Guanqi Zhan, Hao Dong
To reach this goal, we introduce Dynamic Chain-of-Navigation (DCoN) to unify the planning process for different types of navigation instructions.
no code implementations • 17 Apr 2024 • Guangran Cheng, Chuheng Zhang, Wenzhe Cai, Li Zhao, Changyin Sun, Jiang Bian
While large language models (LLMs) are successful in completing various language processing tasks, they easily fail to interact with the physical world by generating control sequences properly.
1 code implementation • 25 Dec 2023 • Wenzhang Liu, Wenzhe Cai, Kun Jiang, Guangran Cheng, Yuanda Wang, Jiawei Wang, Jingyu Cao, Lele Xu, Chaoxu Mu, Changyin Sun
In this paper, we present XuanCe, a comprehensive and unified deep reinforcement learning (DRL) library designed to be compatible with PyTorch, TensorFlow, and MindSpore.
1 code implementation • 23 Sep 2023 • Wenzhe Cai, Guangran Cheng, Lingyue Kong, Lu Dong, Changyin Sun
We consider the problem of improving the generalization of mobile robots and achieving sim-to-real transfer for navigation skills.
no code implementations • 20 Sep 2023 • Yuxing Long, Xiaoqi Li, Wenzhe Cai, Hao Dong
The performances on the representative VLN task R2R show that our method surpasses the leading zero-shot VLN model by a large margin on all metrics.
1 code implementation • 17 Mar 2022 • Xiaoguang Chang, Teng Wang, Changyin Sun, Wenzhe Cai
Scene graph generation is a sophisticated task because there is no specific recognition pattern (e. g., "looking at" and "near" have no conspicuous difference concerning vision, whereas "near" could occur between entities with different morphology).
Ranked #1 on Predicate Classification on Visual Genome (mR@20 metric)