no code implementations • 11 Jul 2024 • Zeyang Zhao, Qilong Xue, Yuhang He, Yifan Bai, Xing Wei, Yihong Gong
This paper introduces the point-axis representation for oriented object detection, emphasizing its flexibility and geometrically intuitive nature with two key components: points and axes.
no code implementations • 28 May 2024 • Yifan Bai, Dongming Wu, Yingfei Liu, Fan Jia, Weixin Mao, Ziheng Zhang, Yucheng Zhao, Jianbing Shen, Xing Wei, Tiancai Wang, Xiangyu Zhang
Despite its simplicity, Atlas demonstrates superior performance in both 3D detection and ego planning tasks on nuScenes dataset, proving that 3D-tokenized LLM is the key to reliable autonomous driving.
1 code implementation • CVPR 2024 • Yifan Bai, Zeyang Zhao, Yihong Gong, Xing Wei
We present ARTrackV2, which integrates two pivotal aspects of tracking: determining where to look (localization) and how to describe (appearance analysis) the target object across video frames.
Ranked #1 on Visual Object Tracking on NeedForSpeed
1 code implementation • CVPR 2023 2023 • Xing Wei, Yifan Bai, Yongchao Zheng, Dahu Shi, Yihong Gong
We present ARTrack, an autoregressive framework for visual object tracking.
Ranked #1 on Visual Tracking on TNL2K
no code implementations • 20 Mar 2022 • Wanshui Li, Yifan Bai, Jiaxuan Lu, Kexin Yi
We designed and built a game called \textit{Immersive Text Game}, which allows the player to choose a story and a character, and interact with other characters in the story in an immersive manner of dialogues.
10 code implementations • 21 Sep 2020 • Yuning Du, Chenxia Li, Ruoyu Guo, Xiaoting Yin, Weiwei Liu, Jun Zhou, Yifan Bai, Zilin Yu, Yehua Yang, Qingqing Dang, Haoshuang Wang
Meanwhile, several pre-trained models for the Chinese and English recognition are released, including a text detector (97K images are used), a direction classifier (600K images are used) as well as a text recognizer (17. 9M images are used).
1 code implementation • 23 Jul 2019 • Shang Wang, Yifan Bai, Gennady Pekhimenko
In an era when the performance of a single compute device plateaus, software must be designed to scale on massively parallel systems for better runtime performance.