Character Region Awareness for Text Detection

CVPR 2019  ยท  Youngmin Baek, Bado Lee, Dongyoon Han, Sangdoo Yun, Hwalsuk Lee ยท

Scene text detection methods based on neural networks have emerged recently and have shown promising results. Previous methods trained with rigid word-level bounding boxes exhibit limitations in representing the text region in an arbitrary shape. In this paper, we propose a new scene text detection method to effectively detect text area by exploring each character and affinity between characters. To overcome the lack of individual character level annotations, our proposed framework exploits both the given character-level annotations for synthetic images and the estimated character-level ground-truths for real images acquired by the learned interim model. In order to estimate affinity between characters, the network is trained with the newly proposed representation for affinity. Extensive experiments on six benchmarks, including the TotalText and CTW-1500 datasets which contain highly curved texts in natural images, demonstrate that our character-level text detection significantly outperforms the state-of-the-art detectors. According to the results, our proposed method guarantees high flexibility in detecting complicated scene text images, such as arbitrarily-oriented, curved, or deformed texts.

PDF Abstract CVPR 2019 PDF CVPR 2019 Abstract

Results from the Paper


 Ranked #1 on Scene Text Detection on ICDAR 2013 (Precision metric)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Scene Text Detection ICDAR 2013 CRAFT Precision 97.4 # 1
Recall 93.1 # 1
H-Mean 95.2 # 1
Scene Text Detection ICDAR 2015 CRAFT F-Measure 86.9 # 19
Precision 89.8 # 17
Recall 84.3 # 18
Scene Text Detection ICDAR 2017 MLT CRAFT Precision 80.6 # 7
Recall 68.2 # 11
H-Mean 73.9 # 1
Scene Text Detection MSRA-TD500 CRAFT Recall 78.2 # 10
Precision 88.2 # 8
F-Measure 82.9 # 10
Scene Text Detection SCUT-CTW1500 CRAFT F-Measure 83.5 # 9
Precision 86 # 9
Recall 81.1 # 9
Scene Text Detection Total-Text CRAFT F-Measure 83.6% # 17
Precision 87.6 # 14
Recall 79.9 # 15

Methods


No methods listed for this paper. Add relevant methods here