Search Results for author: Wenzhe Cai

Found 6 papers, 3 papers with code

InstructNav: Zero-shot System for Generic Instruction Navigation in Unexplored Environment

no code implementations7 Jun 2024 Yuxing Long, Wenzhe Cai, Hongcheng Wang, Guanqi Zhan, Hao Dong

To reach this goal, we introduce Dynamic Chain-of-Navigation (DCoN) to unify the planning process for different types of navigation instructions.

Navigate

Empowering Large Language Models on Robotic Manipulation with Affordance Prompting

no code implementations17 Apr 2024 Guangran Cheng, Chuheng Zhang, Wenzhe Cai, Li Zhao, Changyin Sun, Jiang Bian

While large language models (LLMs) are successful in completing various language processing tasks, they easily fail to interact with the physical world by generating control sequences properly.

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

1 code implementation25 Dec 2023 Wenzhang Liu, Wenzhe Cai, Kun Jiang, Guangran Cheng, Yuanda Wang, Jiawei Wang, Jingyu Cao, Lele Xu, Chaoxu Mu, Changyin Sun

In this paper, we present XuanCe, a comprehensive and unified deep reinforcement learning (DRL) library designed to be compatible with PyTorch, TensorFlow, and MindSpore.

reinforcement-learning

Robust Navigation with Cross-Modal Fusion and Knowledge Transfer

1 code implementation23 Sep 2023 Wenzhe Cai, Guangran Cheng, Lingyue Kong, Lu Dong, Changyin Sun

We consider the problem of improving the generalization of mobile robots and achieving sim-to-real transfer for navigation skills.

Transfer Learning

Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions

no code implementations20 Sep 2023 Yuxing Long, Xiaoqi Li, Wenzhe Cai, Hao Dong

The performances on the representative VLN task R2R show that our method surpasses the leading zero-shot VLN model by a large margin on all metrics.

Language Modelling Large Language Model

Biasing Like Human: A Cognitive Bias Framework for Scene Graph Generation

1 code implementation17 Mar 2022 Xiaoguang Chang, Teng Wang, Changyin Sun, Wenzhe Cai

Scene graph generation is a sophisticated task because there is no specific recognition pattern (e. g., "looking at" and "near" have no conspicuous difference concerning vision, whereas "near" could occur between entities with different morphology).

Graph Generation Predicate Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.