Search Results for author: Pengyuan Lyu

Found 15 papers, 7 papers with code

Robust Scene Text Recognition with Automatic Rectification

5 code implementations • CVPR 2016 • Baoguang Shi, Xinggang Wang, Pengyuan Lyu, Cong Yao, Xiang Bai

We show that the model is able to recognize several types of irregular text, including perspective text and curved text.

Ranked #10 on Scene Text Recognition on ICDAR 2003

Optical Character Recognition (OCR) Scene Text Detection +1

38,291

Paper
Code

Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification

no code implementations • 15 Apr 2017 • Xiang Bai, Mingkun Yang, Pengyuan Lyu, Yongchao Xu, Jiebo Luo

Then, we combine the word embedding of the recognized words and the deep visual features into a single representation, which is optimized by a convolutional neural network for fine-grained image classification.

Classification Fine-Grained Image Classification +2

Paper
Add Code

Auto-Encoder Guided GAN for Chinese Calligraphy Synthesis

no code implementations • 27 Jun 2017 • Pengyuan Lyu, Xiang Bai, Cong Yao, Zhen Zhu, Tengteng Huang, Wenyu Liu

In this paper, we investigate the Chinese calligraphy synthesis problem: synthesizing Chinese calligraphy images with specified style from standard font(eg.

Image-to-Image Translation Translation

Paper
Add Code

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

1 code implementation • CVPR 2018 • Pengyuan Lyu, Cong Yao, Wenhao Wu, Shuicheng Yan, Xiang Bai

We propose to detect scene text by localizing corner points of text bounding boxes and segmenting text regions in relative positions.

Ranked #2 on Scene Text Detection on ICDAR 2017 MLT

Multi-Oriented Scene Text Detection object-detection +2

315

Paper
Code

ASTER: An Attentional Scene Text Recognizer with Flexible Rectification

3 code implementations • good 2018 • Baoguang Shi, Mingkun Yang, Xinggang Wang, Pengyuan Lyu, Cong Yao, and Xiang Bai

SCENE text recognition has attracted great interest from the academia and the industry in recent years owing to its importance in a wide range of applications.

Ranked #21 on Scene Text Recognition on ICDAR2015

Optical Character Recognition Optical Character Recognition (OCR) +1

714

Paper
Code

Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes

1 code implementation • ECCV 2018 • Pengyuan Lyu, Minghui Liao, Cong Yao, Wenhao Wu, Xiang Bai

Recently, models based on deep neural networks have dominated the fields of scene text detection and recognition.

Ranked #3 on Scene Text Detection on ICDAR 2013

Scene Text Detection Semantic Segmentation +2

261

Paper
Code

Scene Text Recognition from Two-Dimensional Perspective

no code implementations • 18 Sep 2018 • Minghui Liao, Jian Zhang, Zhaoyi Wan, Fengming Xie, Jiajun Liang, Pengyuan Lyu, Cong Yao, Xiang Bai

Inspired by speech recognition, recent state-of-the-art algorithms mostly consider scene text recognition as a sequence prediction problem.

Ranked #30 on Scene Text Recognition on SVT

Scene Text Recognition Semantic Segmentation +4

Paper
Add Code

2D Attentional Irregular Scene Text Recognizer

no code implementations • 13 Jun 2019 • Pengyuan Lyu, Zhicheng Yang, Xinhang Leng, Xiao-Jun Wu, Ruiyu Li, Xiaoyong Shen

Irregular scene text, which has complex layout in 2D space, is challenging to most previous scene text recognizers.

Paper
Add Code

Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes

1 code implementation • ECCV 2018 • Minghui Liao, Pengyuan Lyu, Minghang He, Cong Yao, Wenhao Wu, Xiang Bai

Moreover, we further investigate the recognition module of our method separately, which significantly outperforms state-of-the-art methods on both regular and irregular text datasets for scene text recognition.

Scene Text Recognition Semantic Segmentation +2

413

Paper
Code

PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network

2 code implementations • 12 Apr 2021 • Pengfei Wang, Chengquan Zhang, Fei Qi, Shanshan Liu, Xiaoqiang Zhang, Pengyuan Lyu, Junyu Han, Jingtuo Liu, Errui Ding, Guangming Shi

With a PG-CTC decoder, we gather high-level character classification vectors from two-dimensional space and decode them into text symbols without NMS and RoI operations involved, which guarantees high efficiency.

Ranked #1 on Scene Text Detection on ICDAR 2015 (Accuracy metric)

Optical Character Recognition (OCR) Scene Text Detection +1

38,291

Paper
Code

MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining

no code implementations • 1 Jun 2022 • Pengyuan Lyu, Chengquan Zhang, Shanshan Liu, Meina Qiao, Yangliu Xu, Liang Wu, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang

Specifically, we transform text data into synthesized text images to unify the data modalities of vision and language, and enhance the language modeling capability of the sequence decoder using a proposed masked image-language modeling scheme.

Ranked #2 on Optical Character Recognition (OCR) on Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study

Language Modelling Optical Character Recognition (OCR) +1

Paper
Add Code

Single Shot Self-Reliant Scene Text Spotter by Decoupled yet Collaborative Detection and Recognition

1 code implementation • 15 Jul 2022 • Jingjing Wu, Pengyuan Lyu, Guangming Lu, Chengquan Zhang, Wenjie Pei

Typical text spotters follow the two-stage spotting paradigm which detects the boundary for a text instance first and then performs text recognition within the detected regions.

Ranked #5 on Text Spotting on ICDAR 2015

Text Detection Text Spotting

Paper
Code

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images

no code implementations • 5 Jun 2023 • Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, MingYu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai

It is hoped that this competition will attract many researchers in the field of CV and NLP, and bring some new thoughts to the field of Document AI.

Document AI Entity Linking +1

Paper
Add Code

Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning

no code implementations • 14 Aug 2023 • Xugong Qin, Pengyuan Lyu, Chengquan Zhang, Yu Zhou, Kun Yao, Peng Zhang, Hailun Lin, Weiping Wang

Different from existing methods which integrate multiple-granularity features or multiple outputs, we resort to the perspective of representation learning in which auxiliary tasks are utilized to enable the encoder to jointly learn robust features with the main task of per-pixel classification during optimization.

Representation Learning Scene Text Detection +1

Paper
Add Code

GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction

no code implementations • 26 Sep 2023 • Pengyuan Lyu, Weihong Ma, Hongyi Wang, Yuechen Yu, Chengquan Zhang, Kun Yao, Yang Xue, Jingdong Wang

In this representation, the vertexes and edges of the grid store the localization and adjacency information of the table.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.