Search Results for author: Yifan Bai

Found 7 papers, 4 papers with code

Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation

no code implementations11 Jul 2024 Zeyang Zhao, Qilong Xue, Yuhang He, Yifan Bai, Xing Wei, Yihong Gong

This paper introduces the point-axis representation for oriented object detection, emphasizing its flexibility and geometrically intuitive nature with two key components: points and axes.

object-detection Object Detection +2

Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?

no code implementations28 May 2024 Yifan Bai, Dongming Wu, Yingfei Liu, Fan Jia, Weixin Mao, Ziheng Zhang, Yucheng Zhao, Jianbing Shen, Xing Wei, Tiancai Wang, Xiangyu Zhang

Despite its simplicity, Atlas demonstrates superior performance in both 3D detection and ego planning tasks on nuScenes dataset, proving that 3D-tokenized LLM is the key to reliable autonomous driving.

3D Object Detection Autonomous Driving +4

ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to Describe

1 code implementation CVPR 2024 Yifan Bai, Zeyang Zhao, Yihong Gong, Xing Wei

We present ARTrackV2, which integrates two pivotal aspects of tracking: determining where to look (localization) and how to describe (appearance analysis) the target object across video frames.

Object Template Matching +2

Immersive Text Game and Personality Classification

no code implementations20 Mar 2022 Wanshui Li, Yifan Bai, Jiaxuan Lu, Kexin Yi

We designed and built a game called \textit{Immersive Text Game}, which allows the player to choose a story and a character, and interact with other characters in the story in an immersive manner of dialogues.

Classification Language Modelling +1

PP-OCR: A Practical Ultra Lightweight OCR System

10 code implementations21 Sep 2020 Yuning Du, Chenxia Li, Ruoyu Guo, Xiaoting Yin, Weiwei Liu, Jun Zhou, Yifan Bai, Zilin Yu, Yehua Yang, Qingqing Dang, Haoshuang Wang

Meanwhile, several pre-trained models for the Chinese and English recognition are released, including a text detector (97K images are used), a direction classifier (600K images are used) as well as a text recognizer (17. 9M images are used).

Computational Efficiency Optical Character Recognition +1

BPPSA: Scaling Back-propagation by Parallel Scan Algorithm

1 code implementation23 Jul 2019 Shang Wang, Yifan Bai, Gennady Pekhimenko

In an era when the performance of a single compute device plateaus, software must be designed to scale on massively parallel systems for better runtime performance.

Cannot find the paper you are looking for? You can Submit a new open access paper.