Search Results for author: Rongfu Zheng

Found 1 papers, 0 papers with code

Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding

no code implementations • 19 Dec 2022 • Haoli Bai, Zhiguang Liu, Xiaojun Meng, Wentao Li, Shuang Liu, Nian Xie, Rongfu Zheng, Liangwei Wang, Lu Hou, Jiansheng Wei, Xin Jiang, Qun Liu

While various vision-language pre-training objectives are studied in existing solutions, the document textline, as an intrinsic granularity in VDU, has seldom been explored so far.

Contrastive Learning document understanding +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.