Search Results for author: Michal Hradiš

Found 10 papers, 3 papers with code

OCR, Classification & Machine Translation (OCCAM)

no code implementations EAMT 2020 Joachim Van den Bogaert, Arne Defauw, Frederic Everaert, Koen Van Winckel, Alina Kramchaninova, Anna Bardadym, Tom Vanallemeersch, Pavel Smrž, Michal Hradiš

The OCCAM project (Optical Character recognition, ClassificAtion & Machine Translation) aims at integrating the CEF (Connecting Europe Facility) Automated Translation service with image classification, Translation Memories (TMs), Optical Character Recognition (OCR), and Machine Translation (MT).

Classification Image Classification +4

Towards Writing Style Adaptation in Handwriting Recognition

no code implementations13 Feb 2023 Jan Kohút, Michal Hradiš, Martin Kišš

We experimented with various placements and settings of WSB and contrastively pre-trained embeddings.

Domain Adaptation Handwriting Recognition

Importance of Textlines in Historical Document Classification

no code implementations24 Jan 2022 Martin Kišš, Jan Kohút, Karel Beneš, Michal Hradiš

The line-level system significantly improves results in script and font classification and in the dating task.

Classification Document Classification +1

AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions

1 code implementation27 Apr 2021 Martin Kišš, Karel Beneš, Michal Hradiš

This paper addresses text recognition for domains with limited manual annotations by a simple self-training strategy.

Optical Character Recognition (OCR)

TS-Net: OCR Trained to Switch Between Text Transcription Styles

no code implementations9 Mar 2021 Jan Kohút, Michal Hradiš

Users of OCR systems, from different institutions and scientific disciplines, prefer and produce different transcription styles.

Optical Character Recognition (OCR)

Page Layout Analysis System for Unconstrained Historic Documents

no code implementations23 Feb 2021 Oldřich Kodym, Michal Hradiš

Extraction of text regions and individual text lines from historic documents is necessary for automatic transcription.

Brno Mobile OCR Dataset

1 code implementation2 Jul 2019 Martin Kišš, Michal Hradiš, Oldřich Kodym

We introduce the Brno Mobile OCR Dataset (B-MOD) for document Optical Character Recognition from low-quality images captured by handheld mobile devices.

Binarization Denoising +3

Cannot find the paper you are looking for? You can Submit a new open access paper.