no code implementations • 13 Mar 2025 • Siyin Wang, Zhaoye Fei, Qinyuan Cheng, Shiduo Zhang, Panpan Cai, Jinlan Fu, Xipeng Qiu
Recent advances in large vision-language models (LVLMs) have shown promise for embodied task planning, yet they struggle with fundamental challenges like dependency constraints and efficiency.
no code implementations • 24 Dec 2024 • Shiduo Zhang, Zhe Xu, Peiju Liu, Xiaopeng Yu, Yuan Li, Qinghui Gao, Zhaoye Fei, Zhangyue Yin, Zuxuan Wu, Yu-Gang Jiang, Xipeng Qiu
General-purposed embodied agents are designed to understand the users' natural instructions or intentions and act precisely to complete universal tasks.
1 code implementation • 30 Oct 2023 • Qiao Sun, Shiduo Zhang, Danjiao Ma, Jingzhe Shi, Derun Li, Simian Luo, Yu Wang, Ningyi Xu, Guangzhi Cao, Hang Zhao
STR reformulates the motion prediction and motion planning problems by arranging observations, states, and actions into one unified sequence modeling task.