Search Results for author: Chongyu Liu

Found 12 papers, 10 papers with code

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

1 code implementation • 7 May 2024 • Jiaxin Zhang, Dezhi Peng, Chongyu Liu, Peirong Zhang, Lianwen Jin

This underscores the potential of DocRes across a broader spectrum of document image restoration tasks.

Binarization Deblurring +3

152

Paper
Code

Datasets for Large Language Models: A Comprehensive Survey

1 code implementation • 28 Feb 2024 • Yang Liu, Jiahuan Cao, Chongyu Liu, Kai Ding, Lianwen Jin

Additionally, a comprehensive review of the existing available dataset resources is also provided, including statistics from 444 datasets, covering 8 language categories and spanning 32 domains.

Language Modelling Large Language Model

597

Paper
Code

SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting

no code implementations • 15 Jan 2024 • Mingxin Huang, Dezhi Peng, Hongliang Li, Zhenghao Peng, Chongyu Liu, Dahua Lin, Yuliang Liu, Xiang Bai, Lianwen Jin

In this paper, we propose a new end-to-end scene text spotting framework termed SwinTextSpotter v2, which seeks to find a better synergy between text detection and recognition.

Text Detection Text Spotting

Paper
Add Code

UPOCR: Towards Unified Pixel-Level OCR Interface

no code implementations • 5 Dec 2023 • Dezhi Peng, Zhenhua Yang, Jiaxin Zhang, Chongyu Liu, Yongxin Shi, Kai Ding, Fengjun Guo, Lianwen Jin

Without bells and whistles, the experimental results showcase that the proposed method can simultaneously achieve state-of-the-art performance on three tasks with a unified single model, which provides valuable strategies and insights for future research on generalist OCR models.

Decoder Optical Character Recognition +3

Paper
Add Code

Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation

1 code implementation • 25 Oct 2023 • Yongxin Shi, Dezhi Peng, Wenhui Liao, Zening Lin, Xinhong Chen, Chongyu Liu, Yuyi Zhang, Lianwen Jin

We assess the model's performance across a range of OCR tasks, including scene text recognition, handwritten text recognition, handwritten mathematical expression recognition, table structure recognition, and information extraction from visually-rich document.

Handwritten Text Recognition Optical Character Recognition +2

105

Paper
Code

Revisiting Scene Text Recognition: A Data Perspective

1 code implementation • ICCV 2023 • Qing Jiang, Jiapeng Wang, Dezhi Peng, Chongyu Liu, Lianwen Jin

To this end, we consolidate a large-scale real STR dataset, namely Union14M, which comprises 4 million labeled images and 10 million unlabeled images, to assess the performance of STR models in more complex real-world scenarios.

Scene Text Recognition

1,924

Paper
Code

ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining

1 code implementation • 21 Jun 2023 • Dezhi Peng, Chongyu Liu, Yuliang Liu, Lianwen Jin

As ViTEraser implicitly integrates text localization and inpainting, we propose a novel end-to-end pretraining method, termed SegMIM, which focuses the encoder and decoder on the text box segmentation and masked image modeling tasks, respectively.

Decoder Long-range modeling +2

Paper
Code

Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution

1 code implementation • CVPR 2023 • Chenfan Qu, Chongyu Liu, Yuliang Liu, Xinhong Chen, Dezhi Peng, Fengjun Guo, Lianwen Jin

In this paper, we propose a novel framework to capture more fine-grained clues in complex scenarios for tampered text detection, termed as Document Tampering Detector (DTD), which consists of a Frequency Perception Head (FPH) to compensate the deficiencies caused by the inconspicuous visual features, and a Multi-view Iterative Decoder (MID) for fully utilizing the information of features in different scales.

Decoder Image and Video Forgery Detection +2

Paper
Code

Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context

1 code implementation • 21 Jul 2022 • Chongyu Liu, Lianwen Jin, Yuliang Liu, Canjie Luo, Bangdong Chen, Fengjun Guo, Kai Ding

To address this issue, we propose a Contextual-guided Text Removal Network, termed as CTRNet.

Paper
Code

SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition

2 code implementations • CVPR 2022 • Mingxin Huang, Yuliang Liu, Zhenghao Peng, Chongyu Liu, Dahua Lin, Shenggao Zhu, Nicholas Yuan, Kai Ding, Lianwen Jin

End-to-end scene text spotting has attracted great attention in recent years due to the success of excavating the intrinsic synergy of the scene text detection and recognition.

Ranked #3 on Text Spotting on Inverse-Text

Scene Text Detection Text Detection +1

259

Paper
Code

ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting

1 code implementation • 8 May 2021 • Yuliang Liu, Chunhua Shen, Lianwen Jin, Tong He, Peng Chen, Chongyu Liu, Hao Chen

Previous methods can be roughly categorized into two groups: character-based and segmentation-based, which often require character-level annotations and/or complex post-processing due to the unstructured output.

Ranked #7 on Text Spotting on Inverse-Text

Text Spotting

Paper
Code

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

1 code implementation • 24 Jan 2021 • Jiapeng Wang, Chongyu Liu, Lianwen Jin, Guozhi Tang, Jiaxin Zhang, Shuaitao Zhang, Qianying Wang, Yaqiang Wu, Mingxiang Cai

Visual information extraction (VIE) has attracted considerable attention recently owing to its various advanced applications such as document understanding, automatic marking and intelligent education.

3D Feature Matching document understanding +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.