Search Results for author: Chaohu Liu

Found 2 papers, 0 papers with code

HRVDA: High-Resolution Visual Document Assistant

no code implementations10 Apr 2024 Chaohu Liu, Kun Yin, Haoyu Cao, Xinghua Jiang, Xin Li, Yinsong Liu, Deqiang Jiang, Xing Sun, Linli Xu

In addition, we construct a document-oriented visual instruction tuning dataset and apply a multi-stage training strategy to enhance the model's document modeling capabilities.

document understanding

Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration

no code implementations ICCV 2023 Haoyu Cao, Changcun Bao, Chaohu Liu, Huang Chen, Kun Yin, Hao liu, Yinsong Liu, Deqiang Jiang, Xing Sun

We propose a novel end-to-end document understanding model called SeRum (SElective Region Understanding Model) for extracting meaningful information from document images, including document analysis, retrieval, and office automation.

document understanding Retrieval +1

Cannot find the paper you are looking for? You can Submit a new open access paper.