StackMix and Blot Augmentations for Handwritten Text Recognition

26 Aug 2021  ยท  Alex Shonenkov, Denis Karachev, Maxim Novopoltsev, Mark Potanin, Denis Dimitrov ยท

This paper proposes a handwritten text recognition(HTR) system that outperforms current state-of-the-artmethods. The comparison was carried out on three of themost frequently used in HTR task datasets, namely Ben-tham, IAM, and Saint Gall. In addition, the results on tworecently presented datasets, Peter the Greats manuscriptsand HKR Dataset, are provided.The paper describes the architecture of the neural net-work and two ways of increasing the volume of train-ing data: augmentation that simulates strikethrough text(HandWritten Blots) and a new text generation method(StackMix), which proved to be very effective in HTR tasks.StackMix can also be applied to the standalone task of gen-erating handwritten text based on printed text.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Handwritten Text Recognition Bentham StackMix+Blots CER 1.73 # 1
Handwritten Text Recognition Digital Peter StackMix+Blots CER 2.5 # 1
Handwritten Text Recognition HKR StackMix+Blots CER 3.49 # 1
Handwritten Text Recognition IAM-B StackMix+Blots CER 3.77 # 1
Handwritten Text Recognition IAM-D StackMix+Blots CER 3.01 # 1
Handwritten Text Recognition Saint Gall StackMix+Blots CER 3.65 # 1

Methods


No methods listed for this paper. Add relevant methods here