UTRNet: High-Resolution Urdu Text Recognition In Printed Documents

27 Jun 2023  Â·  Abdur Rahman, Arjun Ghosh, Chetan Arora ·

In this paper, we propose a novel approach to address the challenges of printed Urdu text recognition using high-resolution, multi-scale semantic feature extraction. Our proposed UTRNet architecture, a hybrid CNN-RNN model, demonstrates state-of-the-art performance on benchmark datasets. To address the limitations of previous works, which struggle to generalize to the intricacies of the Urdu script and the lack of sufficient annotated real-world data, we have introduced the UTRSet-Real, a large-scale annotated real-world dataset comprising over 11,000 lines and UTRSet-Synth, a synthetic dataset with 20,000 lines closely resembling real-world and made corrections to the ground truth of the existing IIITH dataset, making it a more reliable resource for future research. We also provide UrduDoc, a benchmark dataset for Urdu text line detection in scanned documents. Additionally, we have developed an online tool for end-to-end Urdu OCR from printed documents by integrating UTRNet with a text detection model. Our work not only addresses the current limitations of Urdu OCR but also paves the way for future research in this area and facilitates the continued advancement of Urdu OCR technology. The project page with source code, datasets, annotations, trained models, and online tool is available at abdur75648.github.io/UTRNet.

PDF Abstract

Datasets


Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Printed Text Recognition IIITH-Urdu VisionLAN [68] Character Accuracy Rate (CAR) 27.82 # 21
Printed Text Recognition IIITH-Urdu ResNet-BiLSTM [31] Character Accuracy Rate (CAR) 84.18 # 6
Printed Text Recognition IIITH-Urdu CDistNet [74] Character Accuracy Rate (CAR) 34.96 # 19
Printed Text Recognition IIITH-Urdu R2AM [39] Character Accuracy Rate (CAR) 81.39 # 14
Printed Text Recognition IIITH-Urdu ABINet [19] Character Accuracy Rate (CAR) 40.2 # 17
Printed Text Recognition IIITH-Urdu VGG-DBiLSTM-Attn [12] Character Accuracy Rate (CAR) 82.72 # 8
Printed Text Recognition IIITH-Urdu TrOCR [40] Character Accuracy Rate (CAR) 36.1 # 18
Printed Text Recognition IIITH-Urdu UTRNet-Small Character Accuracy Rate (CAR) 86.35 # 2
Printed Text Recognition IIITH-Urdu ViTSTR [7] Character Accuracy Rate (CAR) 32.63 # 20
Printed Text Recognition IIITH-Urdu VGG-LSTM-Attn [62] Character Accuracy Rate (CAR) 82.21 # 9
Printed Text Recognition IIITH-Urdu Parseq [10] Character Accuracy Rate (CAR) 25.6 # 22
Printed Text Recognition IIITH-Urdu GRCNN [65] Character Accuracy Rate (CAR) 81.09 # 16
Printed Text Recognition IIITH-Urdu TRBA [8] Character Accuracy Rate (CAR) 85.61 # 3
Printed Text Recognition IIITH-Urdu VGG-MDLSTM [45, 44] Character Accuracy Rate (CAR) 81.72 # 11
Printed Text Recognition IIITH-Urdu STAR-Net [42] Character Accuracy Rate (CAR) 84.27 # 5
Printed Text Recognition IIITH-Urdu UTRNet Character Accuracy Rate (CAR) 88.01 # 1
Printed Text Recognition IIITH-Urdu RARE [59] Character Accuracy Rate (CAR) 83.59 # 7
Printed Text Recognition IIITH-Urdu VGG-BiLSTM [30] Character Accuracy Rate (CAR) 81.45 # 12
Printed Text Recognition IIITH-Urdu Rosetta [11] Character Accuracy Rate (CAR) 81.94 # 10
Printed Text Recognition IIITH-Urdu DenseNet-GRU-Attn [6] Character Accuracy Rate (CAR) 85.32 # 4
Printed Text Recognition IIITH-Urdu 5LayerCNN-DBiLSTM [20] Character Accuracy Rate (CAR) 81.15 # 15
Printed Text Recognition IIITH-Urdu CRNN [58] Character Accuracy Rate (CAR) 81.45 # 12
Printed Text Recognition UPTI GRCNN [65] Character Accuracy Rate (CAR) 92.28 # 8
Printed Text Recognition UPTI UTRNet-Small Character Accuracy Rate (CAR) 95.08 # 2
Printed Text Recognition UPTI DenseNet-GRU-Attn [6] Character Accuracy Rate (CAR) 94.63 # 3
Printed Text Recognition UPTI ResNet-BiLSTM [31] Character Accuracy Rate (CAR) 93.61 # 5
Printed Text Recognition UPTI VGG-DBiLSTM-Attn [12] Character Accuracy Rate (CAR) 92.01 # 11
Printed Text Recognition UPTI VGG-LSTM-Attn [62] Character Accuracy Rate (CAR) 91.88 # 12
Printed Text Recognition UPTI VGG-MDLSTM [45, 44] Character Accuracy Rate (CAR) 91.17 # 14
Printed Text Recognition UPTI VGG-BiLSTM [30] Character Accuracy Rate (CAR) 91.49 # 13
Printed Text Recognition UPTI 5LayerCNN-DBiLSTM [20] Character Accuracy Rate (CAR) 90.67 # 15
Printed Text Recognition UPTI VisionLAN [68] Character Accuracy Rate (CAR) 29.07 # 20
Printed Text Recognition UPTI CDistNet [74] Character Accuracy Rate (CAR) 32.48 # 19
Printed Text Recognition UPTI ABINet [19] Character Accuracy Rate (CAR) 38.96 # 16
Printed Text Recognition UPTI TrOCR [40] Character Accuracy Rate (CAR) 37.61 # 17
Printed Text Recognition UPTI ViTSTR [7] Character Accuracy Rate (CAR) 35.78 # 18
Printed Text Recognition UPTI Parseq [10] Character Accuracy Rate (CAR) 26.41 # 21
Printed Text Recognition UPTI TRBA [8] Character Accuracy Rate (CAR) 94.16 # 4
Printed Text Recognition UPTI STAR-Net [42] Character Accuracy Rate (CAR) 93.59 # 6
Printed Text Recognition UPTI RARE [59] Character Accuracy Rate (CAR) 92.74 # 7
Printed Text Recognition UPTI Rosetta [11] Character Accuracy Rate (CAR) 92.15 # 9
Printed Text Recognition UPTI UTRNet Character Accuracy Rate (CAR) 95.97 # 1
Printed Text Recognition UPTI R2AM [39] Character Accuracy Rate (CAR) 92.07 # 10
Text Detection UrduDoc EAST Precision 70.43 # 5
Text Detection UrduDoc ContourNet [69] Precision 86.99 # 1
Recall 88.68 # 1
Text Detection UrduDoc DRRG [72] Precision 83.87 # 2
Recall 84.72 # 2
Text Detection UrduDoc PSENet [67] Precision 78.11 # 3
Recall 77.91 # 3
Text Detection UrduDoc EAST [75] Precision 71.48 # 4
Recall 72.56 # 4
Printed Text Recognition UTRSet-Real 5LayerCNN-DBiLSTM [20] Character Accuracy Rate (CAR) 82.92 # 13
Printed Text Recognition UTRSet-Real VisionLAN [68] Character Accuracy Rate (CAR) 28.4 # 18
Printed Text Recognition UTRSet-Real CDistNet [74] Character Accuracy Rate (CAR) 33.72 # 17
Printed Text Recognition UTRSet-Real ABINet [19] Character Accuracy Rate (CAR) 41.17 # 14
Printed Text Recognition UTRSet-Real TrOCR [40] Character Accuracy Rate (CAR) 38.43 # 15
Printed Text Recognition UTRSet-Real ViTSTR [7] Character Accuracy Rate (CAR) 34.86 # 16
Printed Text Recognition UTRSet-Real Parseq [10] Character Accuracy Rate (CAR) 26.13 # 19
Printed Text Recognition UTRSet-Real TRBA [8] Character Accuracy Rate (CAR) 88.92 # 4
Printed Text Recognition UTRSet-Real STAR-Net [42] Character Accuracy Rate (CAR) 87.05 # 5
Printed Text Recognition UTRSet-Real RARE [59] Character Accuracy Rate (CAR) 85.63 # 7
Printed Text Recognition UTRSet-Real GRCNN [65] Character Accuracy Rate (CAR) 84.21 # 9
Printed Text Recognition UTRSet-Real CRNN [58] Character Accuracy Rate (CAR) 83.11 # 12
Printed Text Recognition UTRSet-Real R2AM Character Accuracy Rate (CAR) 84.12 # 10
Printed Text Recognition UTRSet-Real UTRNet Character Accuracy Rate (CAR) 92.97 # 1
Printed Text Recognition UTRSet-Real UTRNet-Small Character Accuracy Rate (CAR) 90.87 # 3
Printed Text Recognition UTRSet-Real DenseNet-GRU-Attn [6] Character Accuracy Rate (CAR) 91.1 # 2
Printed Text Recognition UTRSet-Real ResNet-BiLSTM [31] Character Accuracy Rate (CAR) 86.96 # 6
Printed Text Recognition UTRSet-Real VGG-DBiLSTM-Attn [12] Character Accuracy Rate (CAR) 84.58 # 8
Printed Text Recognition UTRSet-Real VGG-MDLSTM [45, 44] Character Accuracy Rate (CAR) 83.3 # 11

Methods


No methods listed for this paper. Add relevant methods here