Search Results for author: Umapada Pal

Found 73 papers, 28 papers with code

SVGCraft: Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout

no code implementations • 30 Mar 2024 • Ayan Banerjee, Nityanand Mathur, Josep Lladós, Umapada Pal, Anjan Dutta

In response, this work introduces SVGCraft, a novel end-to-end framework for the creation of vector graphics depicting entire scenes from textual descriptions.

Vector Graphics

Paper
Add Code

GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation

1 code implementation • 17 Feb 2024 • Ayan Banerjee, Sanket Biswas, Josep Lladós, Umapada Pal

Object detection in documents is a key step to automate the structural elements identification process in a digital or scanned document through understanding the hierarchical structure and relationships between different elements.

Knowledge Distillation object-detection +1

Paper
Code

Static and Dynamic Synthesis of Bengali and Devanagari Signatures

no code implementations • 30 Jan 2024 • Miguel A. Ferrer, Sukalpa Chanda, Moises Diaz, Chayan Kr. Banerjee, Anirban Majumdar, Cristina Carmona-Duarte, Parikshit Acharya, Umapada Pal

This paper aims to adapt this scheme for the generation of synthetic signatures in two Indic scripts, Bengali (Bangla), and Devanagari (Hindi).

Handwriting generation

Paper
Add Code

A Layer-Wise Tokens-to-Token Transformer Network for Improved Historical Document Image Enhancement

1 code implementation • 6 Dec 2023 • Risab Biswas, Swalpa Kumar Roy, Umapada Pal

Instead of using a simple ViT and hard splitting of images for the document image enhancement task, we employed a progressive tokenization technique to capture this local information from an image to achieve more effective results.

Binarization Image Enhancement

Paper
Code

DocBinFormer: A Two-Level Transformer Network for Effective Document Image Binarization

no code implementations • 6 Dec 2023 • Risab Biswas, Swalpa Kumar Roy, Ning Wang, Umapada Pal, Guang-Bin Huang

Instead of using a simple vision transformer block to extract information from the image patches, the proposed architecture uses two transformer blocks for greater coverage of the extracted feature space on a global and local scale.

Binarization

Paper
Add Code

Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance

1 code implementation • 2 Oct 2023 • Alloy Das, Sanket Biswas, Ayan Banerjee, Josep Lladós, Umapada Pal, Saumik Bhattacharya

The adaptation capability to a wide range of domains is crucial for scene text spotting models when deployed to real-world conditions.

Scene Text Detection Text Detection +1

Paper
Code

Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes

no code implementations • 1 Oct 2023 • Alloy Das, Sanket Biswas, Umapada Pal, Josep Lladós

When used in a real-world noisy environment, the capacity to generalize to multiple domains is essential for any autonomous scene text spotting system.

Super-Resolution Text Spotting

Paper
Add Code

FAST: Font-Agnostic Scene Text Editing

no code implementations • 5 Aug 2023 • Alloy Das, Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal, Michael Blumenstein

However, most of the existing STE methods show inferior editing performance because of (1) complex image backgrounds, (2) various font styles, and (3) varying word lengths within the text.

Scene Text Editing Style Transfer +1

Paper
Add Code

DySTreSS: Dynamically Scaled Temperature in Self-Supervised Contrastive Learning

no code implementations • 2 Aug 2023 • Siladittya Manna, Soumitri Chattopadhyay, Rakesh Dey, Saumik Bhattacharya, Umapada Pal

We propose a cosine similarity-dependent temperature scaling function to effectively optimize the distribution of the samples in the feature space.

Contrastive Learning

Paper
Add Code

Scene Text Recognition with Image-Text Matching-guided Dictionary

no code implementations • 8 May 2023 • Jiajun Wei, Hongjian Zhan, Xiao Tu, Yue Lu, Umapada Pal

Inspired by ITC, the SITM network combines the visual features and the text features of all candidates to identify the candidate with the minimum distance in the feature space.

Image-text matching Language Modelling +2

Paper
Add Code

SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation

1 code implementation • 8 May 2023 • Ayan Banerjee, Sanket Biswas, Josep Lladós, Umapada Pal

Instance-level segmentation of documents consists in assigning a class-aware and instance-aware label to each pixel of the image.

Instance Segmentation Segmentation +1

Paper
Code

SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation

1 code implementation • 1 May 2023 • Subhajit Maity, Sanket Biswas, Siladittya Manna, Ayan Banerjee, Josep Lladós, Saumik Bhattacharya, Umapada Pal

Document layout analysis is a known problem to the documents research community and has been vastly explored yielding a multitude of solutions ranging from text mining, and recognition to graph-based representation, visual feature extraction, etc.

Document Layout Analysis object-detection +1

Paper
Code

MMC: Multi-Modal Colorization of Images using Textual Descriptions

no code implementations • 24 Apr 2023 • Subhankar Ghosh, Saumik Bhattacharya, Prasun Roy, Umapada Pal, Michael Blumenstein

Handling various objects with different colors is a significant challenge for image colorization techniques.

Colorization Image Colorization +1

Paper
Add Code

ICDAR 2023 Video Text Reading Competition for Dense and Small Text

no code implementations • 10 Apr 2023 • Weijia Wu, Yuzhong Zhao, Zhuang Li, Jiahong Li, Mike Zheng Shou, Umapada Pal, Dimosthenis Karatzas, Xiang Bai

In this competition report, we establish a video text reading benchmark, DSText, which focuses on dense and small text reading challenges in the video with various scenarios.

Task 2 Text Detection +2

Paper
Add Code

A CNN Based Framework for Unistroke Numeral Recognition in Air-Writing

1 code implementation • 14 Mar 2023 • Prasun Roy, Subhankar Ghosh, Umapada Pal

Air-writing refers to virtually writing linguistic characters through hand gestures in three-dimensional space with six degrees of freedom.

Segmentation Transfer Learning

Paper
Code

Global Context-Aware Person Image Generation

no code implementations • 28 Feb 2023 • Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal, Michael Blumenstein

The proposed strategy enables us to synthesize semantically coherent realistic persons that can blend into an existing scene without altering the global context.

Image Generation

Paper
Add Code

Effective Document Image Enhancement Using tokens-to-token Transformer Network

1 code implementation • Preprint 2023 • Risab Biswas, Swalpa Kumar Roy, Umapada Pal

Instead of using a simple ViT and hard splitting of images for the document image enhancement task, we employed a progressive tokeniza-tion technique to capture this local information from an image for achieving more effective results.

Ranked #1 on Binarization on H-DIBCO 2012

Binarization Image Enhancement

Paper
Code

TIC: Text-Guided Image Colorization

no code implementations • 4 Aug 2022 • Subhankar Ghosh, Prasun Roy, Saumik Bhattacharya, Umapada Pal, Michael Blumenstein

Image colorization is a well-known problem in computer vision.

Colorization Image Colorization

Paper
Add Code

TIPS: Text-Induced Pose Synthesis

no code implementations • 24 Jul 2022 • Prasun Roy, Subhankar Ghosh, Saumik Bhattacharya, Umapada Pal, Michael Blumenstein

In computer vision, human pose synthesis and transfer deal with probabilistic image generation of a person in a previously unseen pose from an already available observation of that person.

Descriptive Pose Transfer

Paper
Add Code

SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition

no code implementations • 21 Jul 2022 • Dajian Zhong, Shujing Lyu, Palaiahnakote Shivakumara, Bing Yin, Jiajia Wu, Umapada Pal, Yue Lu

For target images (scene text images), the Semantic Generator Module generates simple semantic features that share the same feature distribution with support images (clear text images).

Image-to-Image Translation Scene Text Recognition

Paper
Add Code

Scene Aware Person Image Generation through Global Contextual Conditioning

no code implementations • 6 Jun 2022 • Prasun Roy, Subhankar Ghosh, Saumik Bhattacharya, Umapada Pal, Michael Blumenstein

Finally, the target image is generated from the refined skeleton using another generative network conditioned on a given image of the target person.

Generative Adversarial Network Image Generation

Paper
Add Code

SWIS: Self-Supervised Representation Learning For Writer Independent Offline Signature Verification

no code implementations • 26 Feb 2022 • Siladittya Manna, Soumitri Chattopadhyay, Saumik Bhattacharya, Umapada Pal

Writer independent offline signature verification is one of the most challenging tasks in pattern recognition as there is often a scarcity of training data.

Representation Learning Self-Supervised Learning

Paper
Add Code

Multi-scale Attention Guided Pose Transfer

1 code implementation • 14 Feb 2022 • Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal

Pose transfer refers to the probabilistic image generation of a person with a previously unseen novel pose from another image of that person having a different pose.

Pose Transfer

Paper
Code

DocSegTr: An Instance-Level End-to-End Document Image Segmentation Transformer

1 code implementation • 27 Jan 2022 • Sanket Biswas, Ayan Banerjee, Josep Lladós, Umapada Pal

has emerged as an interesting problem for the document analysis and understanding community.

Decision Making Document Layout Analysis +4

Paper
Code

SURDS: Self-Supervised Attention-guided Reconstruction and Dual Triplet Loss for Writer Independent Offline Signature Verification

1 code implementation • 25 Jan 2022 • Soumitri Chattopadhyay, Siladittya Manna, Saumik Bhattacharya, Umapada Pal

This results in robust discriminative learning of the embedding space.

Image Reconstruction Metric Learning +2

Paper
Code

DocEnTr: An End-to-End Document Image Enhancement Transformer

1 code implementation • 25 Jan 2022 • Mohamed Ali Souibgui, Sanket Biswas, Sana Khamekhem Jemni, Yousri Kessentini, Alicia Fornés, Josep Lladós, Umapada Pal

Document images can be affected by many degradation scenarios, which cause recognition and processing difficulties.

Ranked #1 on Binarization on H-DIBCO 2011

Binarization Image Enhancement

130

Paper
Code

MIO : Mutual Information Optimization using Self-Supervised Binary Contrastive Learning

no code implementations • 24 Nov 2021 • Siladittya Manna, Umapada Pal, Saumik Bhattacharya

After 200 epochs of pre-training with ResNet-18 as the backbone, the proposed model achieves an accuracy of 86. 2\%, 58. 18\%, 77. 49\%, and 30. 87\% on CIFAR-10, CIFAR-100, STL-10, and Tiny-ImageNet datasets, respectively, and surpasses the SOTA contrastive baseline by 1. 23\%, 3. 57\%, 2. 00\%, and 0. 33\%, respectively.

Binary Classification Contrastive Learning

Paper
Add Code

GMSRF-Net: An improved generalizability with global multi-scale residual fusion network for polyp segmentation

1 code implementation • 20 Nov 2021 • Abhishek Srivastava, Sukalpa Chanda, Debesh Jha, Umapada Pal, Sharib Ali

The repeated fusion operations gated by CMSA and MSFS demonstrate improved generalizability of the network.

Ranked #13 on Medical Image Segmentation on Kvasir-SEG

feature selection Medical Image Segmentation +1

Paper
Code

AGA-GAN: Attribute Guided Attention Generative Adversarial Network with U-Net for Face Hallucination

no code implementations • 20 Nov 2021 • Abhishek Srivastava, Sukalpa Chanda, Umapada Pal

The performance of facial super-resolution methods relies on their ability to recover facial structures and salient features effectively.

Attribute Face Hallucination +3

Paper
Add Code

Exploiting Multi-Scale Fusion, Spatial Attention and Patch Interaction Techniques for Text-Independent Writer Identification

1 code implementation • 20 Nov 2021 • Abhishek Srivastava, Sukalpa Chanda, Umapada Pal

Our methods are based on the hypothesis that handwritten text images have specific spatial regions which are more unique to a writer's style, multi-scale features propagate characteristic features with respect to individual writers and patch-based features give more general and robust representations that helps to discriminate handwriting from different writers.

Paper
Code

PAANet: Progressive Alternating Attention for Automatic Medical Image Segmentation

no code implementations • 20 Nov 2021 • Abhishek Srivastava, Sukalpa Chanda, Debesh Jha, Michael A. Riegler, Pål Halvorsen, Dag Johansen, Umapada Pal

We develop progressive alternating attention dense (PAAD) blocks, which construct a guiding attention map (GAM) after every convolutional layer in the dense blocks using features from all scales.

Decision Making Image Segmentation +3

Paper
Add Code

GradML: A Gradient-based Loss for Deep Metric Learning

no code implementations • NeurIPS Workshop ICBINB 2021 • Bhavya Vasudeva, Puneesh Deora, Saumik Bhattacharya, Umapada Pal, Sukalpa Chanda

Deep metric learning (ML) uses a carefully designed loss function to learn distance metrics for improving the discriminatory ability for tasks like clustering and retrieval.

Metric Learning Retrieval

Paper
Add Code

LoOp: Looking for Optimal Hard Negative Embeddings for Deep Metric Learning

1 code implementation • ICCV 2021 • Bhavya Vasudeva, Puneesh Deora, Saumik Bhattacharya, Umapada Pal, Sukalpa Chanda

Deep metric learning has been effectively used to learn distance metrics for different visual tasks like image retrieval, clustering, etc.

Image Retrieval Metric Learning +1

Paper
Code

Graph-based Deep Generative Modelling for Document Layout Generation

no code implementations • 9 Jul 2021 • Sanket Biswas, Pau Riba, Josep Lladós, Umapada Pal

One of the major prerequisites for any deep learning approach is the availability of large-scale training data.

Paper
Add Code

DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis

1 code implementation • 6 Jul 2021 • Sanket Biswas, Pau Riba, Josep Lladós, Umapada Pal

The results highlight that our model can successfully generate realistic and diverse document images with multiple objects.

Document Layout Analysis Image Generation

Paper
Code

MSRF-Net: A Multi-Scale Residual Fusion Network for Biomedical Image Segmentation

1 code implementation • 16 May 2021 • Abhishek Srivastava, Debesh Jha, Sukalpa Chanda, Umapada Pal, Håvard D. Johansen, Dag Johansen, Michael A. Riegler, Sharib Ali, Pål Halvorsen

The proposed MSRF-Net allows to capture object variabilities and provides improved results on different biomedical datasets.

Ranked #3 on Medical Image Segmentation on 2018 Data Science Bowl

Image Segmentation Lesion Segmentation +3

Paper
Code

PLSM: A Parallelized Liquid State Machine for Unintentional Action Detection

1 code implementation • 6 May 2021 • Dipayan Das, Saumik Bhattacharya, Umapada Pal, Sukalpa Chanda

Reservoir Computing (RC) offers a viable option to deploy AI algorithms on low-end embedded system platforms.

Action Detection

Paper
Code

SKID: Self-Supervised Learning for Knee Injury Diagnosis from MRI Data

2 code implementations • 21 Apr 2021 • Siladittya Manna, Saumik Bhattacharya, Umapada Pal

The downstream task in our paper is a class imbalanced multi-label classification.

Ranked #2 on Multi-Label Classification on MRNet

Medical Diagnosis Medical Image Classification +2

Paper
Code

AIM 2020 Challenge on Learned Image Signal Processing Pipeline

1 code implementation • 10 Nov 2020 • Andrey Ignatov, Radu Timofte, Zhilu Zhang, Ming Liu, Haolin Wang, WangMeng Zuo, Jiawei Zhang, Ruimao Zhang, Zhanglin Peng, Sijie Ren, Linhui Dai, Xiaohong Liu, Chengqi Li, Jun Chen, Yuichi Ito, Bhavya Vasudeva, Puneesh Deora, Umapada Pal, Zhenyu Guo, Yu Zhu, Tian Liang, Chenghua Li, Cong Leng, Zhihong Pan, Baopu Li, Byung-Hoon Kim, Joonyoung Song, Jong Chul Ye, JaeHyun Baek, Magauiya Zhussip, Yeskendir Koishekenov, Hwechul Cho Ye, Xin Liu, Xueying Hu, Jun Jiang, Jinwei Gu, Kai Li, Pengliang Tan, Bingxin Hou

This paper reviews the second AIM learned ISP challenge and provides the description of the proposed solutions and results.

Demosaicking Denoising +1

Paper
Code

Position and Rotation Invariant Sign Language Recognition from 3D Kinect Data with Recurrent Neural Networks

1 code implementation • 23 Oct 2020 • Prasun Roy, Saumik Bhattacharya, Partha Pratim Roy, Umapada Pal

Sign language is a gesture-based symbolic communication medium among speech and hearing impaired people.

Position Sign Language Recognition +2

Paper
Code

Self-Supervised Representation Learning for Detection of ACL Tear Injury in Knee MR Videos

1 code implementation • 15 Jul 2020 • Siladittya Manna, Saumik Bhattacharya, Umapada Pal

In this paper, we propose a self-supervised learning approach to learn transferable features from MR video clips by enforcing the model to learn anatomical features.

Representation Learning Self-Supervised Learning

Paper
Code

UDBNET: Unsupervised Document Binarization Network via Adversarial Game

1 code implementation • 14 Jul 2020 • Amandeep Kumar, Shuvozit Ghose, Pinaki Nath Chowdhury, Partha Pratim Roy, Umapada Pal

In this paper, we present a novel approach towards document image binarization by introducing three-player min-max adversarial game.

Ranked #2 on Binarization on DIBCO 2011

Binarization

Paper
Code

A New Unified Method for Detecting Text from Marathon Runners and Sports Players in Video

no code implementations • 26 May 2020 • Sauradip Nag, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Michael Blumenstein

The proposed method fuses gradient magnitude and direction coherence of text pixels in a new way for detecting candidate regions.

Clustering Text Detection

Paper
Add Code

Modeling Extent-of-Texture Information for Ground Terrain Recognition

1 code implementation • 17 Apr 2020 • Shuvozit Ghose, Pinaki Nath Chowdhury, Partha Pratim Roy, Umapada Pal

Ground Terrain Recognition is a difficult task as the context information varies significantly over the regions of a ground terrain image.

Image Classification

Paper
Code

DELP-DAR System for License Plate Detection and Recognition

no code implementations • 4 Oct 2019 • Zied Selmi, Mohamed Ben Halima, Umapada Pal, M. Adel Alimi

For this, we present in this paper an automatic framework for License Plate (LP) detection and recognition from complex scenes.

License Plate Detection

Paper
Add Code

ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition -- RRC-MLT-2019

no code implementations • 1 Jul 2019 • Nibal Nayef, Yash Patel, Michal Busta, Pinaki Nath Chowdhury, Dimosthenis Karatzas, Wafa Khlif, Jiri Matas, Umapada Pal, Jean-Christophe Burie, Cheng-Lin Liu, Jean-Marc Ogier

With the growing cosmopolitan culture of modern cities, the need of robust Multi-Lingual scene Text (MLT) detection and recognition systems has never been more immense.

Cultural Vocal Bursts Intensity Prediction General Classification +2

Paper
Add Code

Distance Metric Learned Collaborative Representation Classifier

no code implementations • 3 May 2019 • Tapabrata Chakraborti, Brendan McCane, Steven Mills, Umapada Pal

We present a simple effective way of achieving this by learning a generic Mahalanabis distance in a collaborative loss function in an end-to-end fashion with any standard convolutional network as the feature learner.

General Classification

Paper
Add Code

PProCRC: Probabilistic Collaboration of Image Patches

no code implementations • 21 Mar 2019 • Tapabrata Chakraborti, Brendan McCane, Steven Mills, Umapada Pal

We present a conditional probabilistic framework for collaborative representation of image patches.

Face Recognition

Paper
Add Code

STEFANN: Scene Text Editor using Font Adaptive Neural Network

1 code implementation • CVPR 2020 • Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal

In this paper, we propose a method to modify text in an image at character-level.

Scene Text Editing

249

Paper
Code

CoCoNet: A Collaborative Convolutional Network

no code implementations • 28 Jan 2019 • Tapabrata Chakraborti, Brendan McCane, Steven Mills, Umapada Pal

We present an end-to-end deep network for fine-grained visual categorization called Collaborative Convolutional Network (CoCoNet).

Fine-Grained Visual Categorization Fine-Grained Visual Recognition +1

Paper
Add Code

A Deep One-Shot Network for Query-based Logo Retrieval

2 code implementations • 4 Nov 2018 • Ayan Kumar Bhunia, Ankan Kumar Bhunia, Shuvozit Ghose, Abhirup Das, Partha Pratim Roy, Umapada Pal

Logo detection in real-world scene images is an important problem with applications in advertisement and marketing.

Marketing object-detection +4

Paper
Code

Effects of Degradations on Deep Neural Network Architectures

2 code implementations • 26 Jul 2018 • Prasun Roy, Subhankar Ghosh, Saumik Bhattacharya, Umapada Pal

Deep convolutional neural networks (CNN) have massively influenced recent advances in large-scale image classification.

General Classification Image Classification

Paper
Code

Bag-of-Visual-Words for Signature-Based Multi-Script Document Retrieval

no code implementations • 18 Jul 2018 • Ranju Mandal, Partha Pratim Roy, Umapada Pal, Michael Blumenstein

Finally, three distance measures were used to match a query signature with the signature present in target documents for retrieval.

Retrieval

Paper
Add Code

A New COLD Feature based Handwriting Analysis for Ethnicity/Nationality Identification

no code implementations • 19 Jun 2018 • Sauradip Nag, Palaiahnakote Shivakumara, Wu Yirui, Umapada Pal, Tong Lu

For each line segment, the proposed method estimates angle and length, which gives a point in polar domain.

Paper
Add Code

Learning Cross-Modal Deep Embeddings for Multi-Object Image Retrieval using Text and Sketch

no code implementations • 28 Apr 2018 • Sounak Dey, Anjan Dutta, Suman K. Ghosh, Ernest Valveny, Josep Lladós, Umapada Pal

In this work we introduce a cross modal image retrieval system that allows both text and sketch as input modalities for the query.

Image Retrieval Retrieval

Paper
Add Code

Indic Handwritten Script Identification using Offline-Online Multimodal Deep Network

no code implementations • 23 Feb 2018 • Ayan Kumar Bhunia, Subham Mukherjee, Aneeshan Sain, Ankan Kumar Bhunia, Partha Pratim Roy, Umapada Pal

In this paper, we propose a novel approach of word-level Indic script identification using only character-level data in training stage.

Paper
Add Code

Staff line Removal using Generative Adversarial Networks

no code implementations • 22 Jan 2018 • Aishik Konwer, Ayan Kumar Bhunia, Abir Bhowmick, Ankan Kumar Bhunia, Prithaj Banerjee, Partha Pratim Roy, Umapada Pal

Staff line removal is a crucial pre-processing step in Optical Music Recognition.

Binary Classification

Paper
Add Code

Handwriting Trajectory Recovery using End-to-End Deep Encoder-Decoder Network

no code implementations • 22 Jan 2018 • Ayan Kumar Bhunia, Abir Bhowmick, Ankan Kumar Bhunia, Aishik Konwer, Prithaj Banerjee, Partha Pratim Roy, Umapada Pal

Our encoder module consists of Convolutional LSTM network, which takes an offline character image as the input and encodes the feature sequence to a hidden representation.

Retrieval

Paper
Add Code

Word Level Font-to-Font Image Translation using Convolutional Recurrent Generative Adversarial Networks

no code implementations • 22 Jan 2018 • Ankan Kumar Bhunia, Ayan Kumar Bhunia, Prithaj Banerjee, Aishik Konwer, Abir Bhowmick, Partha Pratim Roy, Umapada Pal

We employ a novel convolutional recurrent model architecture in the Generator that efficiently deals with the word images of arbitrary width.

Translation

Paper
Add Code

Script Identification in Natural Scene Image and Video Frame using Attention based Convolutional-LSTM Network

1 code implementation • 1 Jan 2018 • Ankan Kumar Bhunia, Aishik Konwer, Ayan Kumar Bhunia, Abir Bhowmick, Partha P. Roy, Umapada Pal

In this paper, we propose a novel method that involves extraction of local and global features using CNN-LSTM framework and weighting them dynamically for script identification.

Paper
Code

Cross-language Framework for Word Recognition and Spotting of Indic Scripts

no code implementations • 19 Dec 2017 • Ayan Kumar Bhunia, Partha Pratim Roy, Akash Mohta, Umapada Pal

This paper presents a novel cross language platform for handwritten word recognition and spotting for such low-resource scripts where training is performed with a sufficiently large dataset of an available script (considered as source script) and testing is done on other scripts (considered as target script).

Paper
Add Code

Zone-based Keyword Spotting in Bangla and Devanagari Documents

no code implementations • 5 Dec 2017 • Ayan Kumar Bhunia, Partha Pratim Roy, Umapada Pal

Also, we propose a novel feature combining foreground and background information of text line images for keyword-spotting by character filler models.

Keyword Spotting Segmentation

Paper
Add Code

LOOP Descriptor: Local Optimal Oriented Pattern

no code implementations • 25 Oct 2017 • Tapabrata Chakraborti, Brendan McCane, Steven Mills, Umapada Pal

This letter introduces the LOOP binary descriptor (local optimal oriented pattern) that encodes rotation invariance into the main formulation itself.

Paper
Add Code

Word Searching in Scene Image and Video Frame in Multi-Script Scenario using Dynamic Shape Coding

no code implementations • 18 Aug 2017 • Partha Pratim Roy, Ayan Kumar Bhunia, Avirup Bhattacharyya, Umapada Pal

To evaluate the proposed system for searching keyword from natural scene image and video frames, we have considered two popular Indic scripts such as Bangla (Bengali) and Devanagari along with English.

Keyword Spotting Optical Character Recognition (OCR) +2

Paper
Add Code

HMM-based Indic Handwritten Word Recognition using Zone Segmentation

no code implementations • 1 Aug 2017 • Partha Pratim Roy, Ayan Kumar Bhunia, Ayan Das, Prasenjit Dey, Umapada Pal

To avoid character segmentation in such scripts, HMM-based sequence modeling has been used earlier in holistic way.

Segmentation

Paper
Add Code

Multi-Oriented Text Detection and Verification in Video Frames and Scene Images

no code implementations • 22 Jul 2017 • Aneeshan Sain, Ayan Kumar Bhunia, Partha Pratim Roy, Umapada Pal

Until now only a few methods have been proposed that look into curved text detection in video frames, wherein lies our novelty.

Clustering Curved Text Detection +3

Paper
Add Code

HMM-based Writer Identification in Music Score Documents without Staff-Line Removal

no code implementations • 21 Jul 2017 • Partha Pratim Roy, Ayan Kumar Bhunia, Umapada Pal

A novel Factor Analysis based feature selection technique is applied in sliding window features to reduce the noise appearing from staff lines which proves efficiency in writer identification performance. In our framework we have also proposed a novel score line detection approach in musical sheet using HMM.

feature selection Line Detection

Paper
Add Code

Date-Field Retrieval in Scene Image and Video Frames using Text Enhancement and Shape Coding

no code implementations • 21 Jul 2017 • Partha Pratim Roy, Ayan Kumar Bhunia, Umapada Pal

We propose a line based date spotting approach using Hidden Markov Model (HMM) which is used to detect the date information in a given text.

Information Retrieval Retrieval

Paper
Add Code

Text Recognition in Scene Image and Video Frame using Color Channel Selection

no code implementations • 21 Jul 2017 • Ayan Kumar Bhunia, Gautam Kumar, Partha Pratim Roy, R. Balasubramanian, Umapada Pal

In this paper, we present a novel approach based on color channel selection for text recognition from scene images and video frames.

Binarization Optical Character Recognition (OCR)

Paper
Add Code

SigNet: Convolutional Siamese Network for Writer Independent Offline Signature Verification

5 code implementations • 7 Jul 2017 • Sounak Dey, Anjan Dutta, J. Ignacio Toledo, Suman K. Ghosh, Josep Llados, Umapada Pal

Offline signature verification is one of the most challenging tasks in biometrics and document forensics.

Ranked #1 on Handwriting Verification on CEDAR Signature

Handwriting Verification

Paper
Code

Product Graph-based Higher Order Contextual Similarities for Inexact Subgraph Matching

no code implementations • 1 Feb 2017 • Anjan Dutta, Josep Lladós, Horst Bunke, Umapada Pal

Many algorithms formulate graph matching as an optimization of an objective function of pairwise quantification of nodes and edges of two graphs to be matched.

Graph Matching

Paper
Add Code

Evaluation of the Effect of Improper Segmentation on Word Spotting

no code implementations • 21 Apr 2016 • Sounak Dey, Anguelos Nicolaou, Josep Llados, Umapada Pal

Word spotting is an important recognition task in historical document analysis.

Segmentation

Paper
Add Code

Local Binary Pattern for Word Spotting in Handwritten Historical Document

no code implementations • 20 Apr 2016 • Sounak Dey, Anguelos Nicolaou, Josep Llados, Umapada Pal

Digital libraries store images which can be highly degraded and to index this kind of images we resort to word spot- ting as our information retrieval system.

Information Retrieval Retrieval

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.