These lead to the fact that traditional data-driven detection model is not suitable for diagrams.
Through key-value matching based on relevancy evaluation, the proposed MatchVIE can bypass the recognitions to various semantics, and simply focuses on the strong relevancy between entities.
For building a robust point detector, a fully convolutional network with feature fusion module is adopted, which can distinguish close points compared to traditional methods.
Visual information extraction (VIE) has attracted considerable attention recently owing to its various advanced applications such as document understanding, automatic marking and intelligent education.
To remedy this issue, we propose a decoupled attention network (DAN), which decouples the alignment operation from using historical decoding results.
Ranked #4 on Scene Text Recognition on ICDAR 2003
Scene text in the wild is commonly presented with high variant characteristics.
Ranked #1 on Scene Text Detection on IC19-ReCTs (using extra training data)