no code implementations • 31 May 2024 • Pengyuan Lyu, Yulin Li, Hao Zhou, Weihong Ma, Xingyu Wan, Qunyi Xie, Liang Wu, Chengquan Zhang, Kun Yao, Errui Ding, Jingdong Wang
Text-rich images have significant and extensive value, deeply integrated into various aspects of human life.
no code implementations • 30 May 2024 • Xingyu Wan, Chengquan Zhang, Pengyuan Lyu, Sen Fan, Zihan Ni, Kun Yao, Errui Ding, Jingdong Wang
Existing OCR engines or document image analysis systems typically rely on training separate models for text detection in varying scenarios and granularities, leading to significant computational complexity and resource demands.
Document Layout Analysis Optical Character Recognition (OCR) +3
1 code implementation • 12 Dec 2020 • Rongye Meng, Sanping Zhou, Xingyu Wan, Mengliu Li, Jinjun Wang
The radical student uses multi-source supervision signals from the same task to update parameters, while the calm teacher uses a single-source supervision signal to update parameters.
no code implementations • 13 Jul 2020 • Xingyu Wan, Jiakai Cao, Sanping Zhou, Jinjun Wang
Most existing Multi-Object Tracking (MOT) approaches follow the Tracking-by-Detection paradigm and the data association framework where objects are firstly detected and then associated.