1 code implementation • 28 Mar 2024 • Junkai Zhou, Liang Pang, Ya Jing, Jia Gu, HuaWei Shen, Xueqi Cheng
For dynamic persona information, we use current action information to internally retrieve the persona information of the agent, thereby reducing the interference of diverse persona information on the current action.
1 code implementation • 20 Dec 2023 • Hongtao Wu, Ya Jing, Chilam Cheang, Guangzeng Chen, Jiafeng Xu, Xinghang Li, Minghuan Liu, Hang Li, Tao Kong
In this paper, we extend the scope of this effectiveness by showing that visual robot manipulation can significantly benefit from large-scale video generative pre-training.
Ranked #2 on Zero-shot Generalization on CALVIN (using extra training data)
no code implementations • 2 Nov 2023 • Xinghang Li, Minghuan Liu, Hanbo Zhang, Cunjun Yu, Jie Xu, Hongtao Wu, Chilam Cheang, Ya Jing, Weinan Zhang, Huaping Liu, Hang Li, Tao Kong
We believe RoboFlamingo has the potential to be a cost-effective and easy-to-use solution for robotics manipulation, empowering everyone with the ability to fine-tune their own robotics policy.
no code implementations • 7 Aug 2023 • Taozheng Yang, Ya Jing, Hongtao Wu, Jiafeng Xu, Kuankuan Sima, Guangzeng Chen, Qie Sima, Tao Kong
In this paper, we present a novel method for mobile manipulators to perform multiple contact-rich manipulation tasks.
no code implementations • 7 Aug 2023 • Ya Jing, Xuelin Zhu, Xingbin Liu, Qie Sima, Taozheng Yang, Yunhai Feng, Tao Kong
However, the recipes of visual pre-training for robot manipulation tasks are yet to be built.
no code implementations • 20 Mar 2023 • Ya Jing, Tao Kong
Unlike static perception methods trained on pre-collected images, the embodied agent can move around in the environment and obtain images of objects from any viewpoints.
1 code implementation • 24 Oct 2022 • Duo Zheng, Tao Kong, Ya Jing, Jiaan Wang, Xiaojie Wang
Additionally, IRTF could generate pseudo input regions for the REC task to enable a uniform way for sharing the identical representation space across the REC and REG.
no code implementations • CVPR 2021 • Ya Jing, Tao Kong, Wei Wang, Liang Wang, Lei LI, Tieniu Tan
Referring image segmentation aims to segment the objects referred by a natural language expression.
Generalized Referring Expression Segmentation Image Segmentation +2
no code implementations • CVPR 2020 • Ya Jing, Wei Wang, Liang Wang, Tieniu Tan
Specially, we propose a moment alignment network (MAN) to solve the cross-modal cross-domain person search task in this paper.
no code implementations • 22 Sep 2018 • Ya Jing, Chenyang Si, Jun-Bo Wang, Wei Wang, Liang Wang, Tieniu Tan
To exploit the multilevel corresponding visual contents, we propose a pose-guided multi-granularity attention network (PMA).
no code implementations • ECCV 2018 • Chenyang Si, Ya Jing, Wei Wang, Liang Wang, Tieniu Tan
Skeleton-based action recognition has made great progress recently, but many problems still remain unsolved.
Ranked #81 on Skeleton Based Action Recognition on NTU RGB+D