Search Results for author: Wenqing Cheng

Found 7 papers, 4 papers with code

OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition

1 code implementation • 28 Mar 2024 • Jianqiang Wan, Sibo Song, Wenwen Yu, Yuliang Liu, Wenqing Cheng, Fei Huang, Xiang Bai, Cong Yao, Zhibo Yang

Recently, visually-situated text parsing (VsTP) has experienced notable advancements, driven by the increasing demand for automated document understanding and the emergence of Generative Large Language Models (LLMs) capable of processing document-based questions.

document understanding Key Information Extraction +3

930

Paper
Code

DD-RobustBench: An Adversarial Robustness Benchmark for Dataset Distillation

no code implementations • 20 Mar 2024 • Yifan Wu, Jiawei Du, Ping Liu, Yuewei Lin, Wenqing Cheng, Wei Xu

Dataset distillation is an advanced technique aimed at compressing datasets into significantly smaller counterparts, while preserving formidable training performance.

Adversarial Attack Adversarial Robustness

Paper
Add Code

Modeling Entities as Semantic Points for Visual Information Extraction in the Wild

no code implementations • CVPR 2023 • Zhibo Yang, Rujiao Long, Pengfei Wang, Sibo Song, Humen Zhong, Wenqing Cheng, Xiang Bai, Cong Yao

As the first contribution of this work, we curate and release a new dataset for VIE, in which the document images are much more challenging in that they are taken from real applications, and difficulties such as blur, partial occlusion, and printing shift are quite common.

Text Spotting

Paper
Add Code

Vision-Language Pre-Training for Boosting Scene Text Detectors

2 code implementations • CVPR 2022 • Sibo Song, Jianqiang Wan, Zhibo Yang, Jun Tang, Wenqing Cheng, Xiang Bai, Cong Yao

In this paper, we specifically adapt vision-language joint learning for scene text detection, a task that intrinsically involves cross-modal interaction between the two modalities: vision and language, since text is the written form of language.

Contrastive Learning Language Modelling +4

930

Paper
Code

YOLOP: You Only Look Once for Panoptic Driving Perception

5 code implementations • 25 Aug 2021 • Dong Wu, Manwen Liao, Weitian Zhang, Xinggang Wang, Xiang Bai, Wenqing Cheng, Wenyu Liu

A panoptic driving perception system is an essential part of autonomous driving.

Ranked #3 on Drivable Area Detection on BDD100K val

Autonomous Driving Drivable Area Detection +5

1,821

Paper
Code

MOST: A Multi-Oriented Scene Text Detector with Localization Refinement

no code implementations • CVPR 2021 • Minghang He, Minghui Liao, Zhibo Yang, Humen Zhong, Jun Tang, Wenqing Cheng, Cong Yao, Yongpan Wang, Xiang Bai

Over the past few years, the field of scene text detection has progressed rapidly that modern text detectors are able to hunt text in various challenging scenarios.

Scene Text Detection Text Detection

Paper
Add Code

Progressive and Aligned Pose Attention Transfer for Person Image Generation

1 code implementation • 22 Mar 2021 • Zhen Zhu, Tengteng Huang, Mengde Xu, Baoguang Shi, Wenqing Cheng, Xiang Bai

This paper proposes a new generative adversarial network for pose transfer, i. e., transferring the pose of a given person to a target pose.

Data Augmentation Generative Adversarial Network +2

731

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.