Search Results for author: Charangan Vasantharajan

Found 5 papers, 2 papers with code

Tamizhi-Net OCR: Creating A Quality Large Scale Tamil-Sinhala-English Parallel Corpus Using Deep Learning Based Printed Character Recognition (PCR)

1 code implementation13 Sep 2021 Charangan Vasantharajan, Uthayasanker Thayasivam

It is shown that this approach can boost the character-level accuracy of Tesseract 4. 1. 1 from 85. 5 to 98. 2 for Tamil (+12. 9% relative change) and 91. 8 to 94. 8 for Sinhala (+3. 26% relative change) on a dataset that is considered as challenging by its authors.

Optical Character Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.