no code implementations • 12 Mar 2024 • Harsh Lunia, Ajoy Mondal, C V Jawahar
Several benchmark datasets and substantial work on deep learning models are available for Latin languages to meet this need.
no code implementations • 17 Dec 2022 • Ajoy Mondal, Rohit Saluja, C. V. Jawahar
The service providers encourage the users who provide data where the OCR model fails by rewarding them based on data complexity, readability, and available budget.
Handwritten Text Recognition Optical Character Recognition (OCR)
1 code implementation • 15 Dec 2022 • Ajoy Mondal, C. V. Jawahar
We use a semantic module in an encoder-decoder framework for extracting global semantic information to recognize the Indic handwritten texts.
no code implementations • 21 Jan 2022 • Jobin K. V., Ajoy Mondal, C. V. Jawahar
With this information, we build a Classroom Slide Narration System (CSNS) to help VI students understand the slide content.
no code implementations • 13 Nov 2021 • Ajoy Mondal
In this article, we propose three new auxiliary performance measures based on ground truth information to evaluate the quality of a developed tracking algorithm under such complex environments.
no code implementations • 13 Nov 2021 • Sachin Raja, Ajoy Mondal, C V Jawahar
Tables in unstructured business documents are tough to parse due to the high diversity of layouts, varying alignments of contents, and the presence of empty cells.
no code implementations • 13 Nov 2021 • Rajdeep Das, Ajoy Mondal, Tapan Chakraborty, Kuntal Ghosh
The microscopic images of sandstone contain many mineral grains and their surrounding matrix/cement.
no code implementations • 25 Dec 2020 • Ajoy Mondal
In this article, we review the existing camouflaged object detection and tracking techniques using computer vision algorithms from the theoretical point of view.
1 code implementation • ECCV 2020 • Sachin Raja, Ajoy Mondal, C. V. Jawahar
We present an approach for table structure recognition that combines cell detection and interaction modules to localize the cells and predict their row and column associations with other detected cells.
Ranked #9 on Table Recognition on PubTabNet
1 code implementation • 25 Aug 2020 • Ranajit Saha, Ajoy Mondal, C. V. Jawahar
Graphical elements: particularly tables and figures contain a visual summary of the most valuable information contained in a document.
3 code implementations • 25 Aug 2020 • Madhav Agarwal, Ajoy Mondal, C. V. Jawahar
Localizing page elements/objects such as tables, figures, equations, etc.
Ranked #1 on Table Detection on ICDAR2013
1 code implementation • 7 Aug 2020 • Ajoy Mondal, C. V. Jawahar
Reading of mathematical expression or equation in the document images is very challenging due to the large variability of mathematical symbols and expressions.
no code implementations • 6 Aug 2020 • Ajoy Mondal, Peter Lipps, C. V. Jawahar
This dataset, IIIT-AR-13k, is created by manually annotating the bounding boxes of graphical or page objects in publicly available annual reports.
no code implementations • 1 Aug 2020 • Ajoy Mondal, Kuntal Ghosh
In this article, our aim is to review the existing fuzzy active contour models from the theoretical point of view and also evaluate them experimentally on a large set of images under the various conditions.