Search Results for author: Ya Jing

Found 11 papers, 3 papers with code

Knowledge Boundary and Persona Dynamic Shape A Better Social Media Agent

1 code implementation28 Mar 2024 Junkai Zhou, Liang Pang, Ya Jing, Jia Gu, HuaWei Shen, Xueqi Cheng

For dynamic persona information, we use current action information to internally retrieve the persona information of the agent, thereby reducing the interference of diverse persona information on the current action.

World Knowledge

Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation

1 code implementation20 Dec 2023 Hongtao Wu, Ya Jing, Chilam Cheang, Guangzeng Chen, Jiafeng Xu, Xinghang Li, Minghuan Liu, Hang Li, Tao Kong

In this paper, we extend the scope of this effectiveness by showing that visual robot manipulation can significantly benefit from large-scale video generative pre-training.

Ranked #2 on Zero-shot Generalization on CALVIN (using extra training data)

Robot Manipulation Zero-shot Generalization

Vision-Language Foundation Models as Effective Robot Imitators

no code implementations2 Nov 2023 Xinghang Li, Minghuan Liu, Hanbo Zhang, Cunjun Yu, Jie Xu, Hongtao Wu, Chilam Cheang, Ya Jing, Weinan Zhang, Huaping Liu, Hang Li, Tao Kong

We believe RoboFlamingo has the potential to be a cost-effective and easy-to-use solution for robotics manipulation, empowering everyone with the ability to fine-tune their own robotics policy.

Imitation Learning

Learning to Explore Informative Trajectories and Samples for Embodied Perception

no code implementations20 Mar 2023 Ya Jing, Tao Kong

Unlike static perception methods trained on pre-collected images, the embodied agent can move around in the environment and obtain images of objects from any viewpoints.

Towards Unifying Reference Expression Generation and Comprehension

1 code implementation24 Oct 2022 Duo Zheng, Tao Kong, Ya Jing, Jiaan Wang, Xiaojie Wang

Additionally, IRTF could generate pseudo input regions for the REC task to enable a uniform way for sharing the identical representation space across the REC and REG.

Language Modelling Masked Language Modeling +1

Cross-Modal Cross-Domain Moment Alignment Network for Person Search

no code implementations CVPR 2020 Ya Jing, Wei Wang, Liang Wang, Tieniu Tan

Specially, we propose a moment alignment network (MAN) to solve the cross-modal cross-domain person search task in this paper.

Person Search Text based Person Search

Pose-Guided Multi-Granularity Attention Network for Text-Based Person Search

no code implementations22 Sep 2018 Ya Jing, Chenyang Si, Jun-Bo Wang, Wei Wang, Liang Wang, Tieniu Tan

To exploit the multilevel corresponding visual contents, we propose a pose-guided multi-granularity attention network (PMA).

Person Search Sentence +1

Cannot find the paper you are looking for? You can Submit a new open access paper.