Search Results for author: Sanket Biswas

Found 29 papers, 16 papers with code

Recurrent Few-Shot model for Document Verification

no code implementations3 Oct 2024 Maxime Talarmain, Carlos Boned, Sanket Biswas, Oriol Ramos

This task is particularly challenging when dealing with unseen class of ID, or travel, documents.

Towards Generative Class Prompt Learning for Fine-grained Visual Recognition

1 code implementation3 Sep 2024 Soumitri Chattopadhyay, Sanket Biswas, Emanuele Vivoli, Josep Lladós

Specifically, we propose two novel methods: Generative Class Prompt Learning (GCPL) and Contrastive Multi-class Prompt Learning (CoMPLe).

Attribute Contrastive Learning +1

FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting

no code implementations27 Aug 2024 Alloy Das, Sanket Biswas, Umapada Pal, Josep Lladós, Saumik Bhattacharya

The proliferation of scene text in both structured and unstructured environments presents significant challenges in optical character recognition (OCR), necessitating more efficient and robust text spotting solutions.

Benchmarking Decoder +3

DistilDoc: Knowledge Distillation for Visually-Rich Document Applications

no code implementations12 Jun 2024 Jordy Van Landeghem, Subhajit Maity, Ayan Banerjee, Matthew Blaschko, Marie-Francine Moens, Josep Lladós, Sanket Biswas

This work explores knowledge distillation (KD) for visually-rich document (VRD) applications such as document layout analysis (DLA) and document image classification (DIC).

Document Image Classification Document Layout Analysis +5

DocSynthv2: A Practical Autoregressive Modeling for Document Generation

no code implementations12 Jun 2024 Sanket Biswas, Rajiv Jain, Vlad I. Morariu, Jiuxiang Gu, Puneet Mathur, Curtis Wigington, Tong Sun, Josep Lladós

While the generation of document layouts has been extensively explored, comprehensive document generation encompassing both layout and content presents a more complex challenge.

LayeredDoc: Domain Adaptive Document Restoration with a Layer Separation Approach

1 code implementation12 Jun 2024 Maria Pilligua, Nil Biescas, Javier Vazquez-Corral, Josep Lladós, Ernest Valveny, Sanket Biswas

The rapid evolution of intelligent document processing systems demands robust solutions that adapt to diverse domains without extensive retraining.

Domain Adaptation Image Restoration

SketchGPT: Autoregressive Modeling for Sketch Generation and Recognition

no code implementations6 May 2024 Adarsh Tiwari, Sanket Biswas, Josep Lladós

We present SketchGPT, a flexible framework that employs a sequence-to-sequence autoregressive model for sketch generation, and completion, and an interpretation case study for sketch recognition.

Sketch Recognition

GeoContrastNet: Contrastive Key-Value Edge Learning for Language-Agnostic Document Understanding

1 code implementation6 May 2024 Nil Biescas, Carlos Boned, Josep Lladós, Sanket Biswas

This paper presents GeoContrastNet, a language-agnostic framework to structured document understanding (DU) by integrating a contrastive learning objective with graph attention networks (GATs), emphasizing the significant role of geometric features.

Contrastive Learning document understanding +4

GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation

1 code implementation17 Feb 2024 Ayan Banerjee, Sanket Biswas, Josep Lladós, Umapada Pal

Object detection in documents is a key step to automate the structural elements identification process in a digital or scanned document through understanding the hierarchical structure and relationships between different elements.

Knowledge Distillation object-detection +1

Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes

no code implementations1 Oct 2023 Alloy Das, Sanket Biswas, Umapada Pal, Josep Lladós

When used in a real-world noisy environment, the capacity to generalize to multiple domains is essential for any autonomous scene text spotting system.

Super-Resolution Text Spotting

Beyond Document Page Classification: Design, Datasets, and Challenges

1 code implementation24 Aug 2023 Jordy Van Landeghem, Sanket Biswas, Matthew B. Blaschko, Marie-Francine Moens

This paper highlights the need to bring document classification benchmarking closer to real-world applications, both in the nature of data tested ($X$: multi-channel, multi-paged, multi-industry; $Y$: class distributions and label set variety) and in classification tasks considered ($f$: multi-page document, page stream, and document bundle classification, ...).

Benchmarking Classification +1

FASTER: A Font-Agnostic Scene Text Editing and Rendering Framework

no code implementations5 Aug 2023 Alloy Das, Sanket Biswas, Prasun Roy, Subhankar Ghosh, Umapada Pal, Michael Blumenstein, Josep Lladós, Saumik Bhattacharya

Scene Text Editing (STE) is a challenging research problem, that primarily aims towards modifying existing texts in an image while preserving the background and the font style of the original text.

Scene Text Editing Style Transfer +1

SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation

1 code implementation8 May 2023 Ayan Banerjee, Sanket Biswas, Josep Lladós, Umapada Pal

Instance-level segmentation of documents consists in assigning a class-aware and instance-aware label to each pixel of the image.

Decoder Instance Segmentation +2

SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation

1 code implementation1 May 2023 Subhajit Maity, Sanket Biswas, Siladittya Manna, Ayan Banerjee, Josep Lladós, Saumik Bhattacharya, Umapada Pal

Document layout analysis is a known problem to the documents research community and has been vastly explored yielding a multitude of solutions ranging from text mining, and recognition to graph-based representation, visual feature extraction, etc.

Document Layout Analysis object-detection +1

Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement

1 code implementation9 Mar 2022 Mohamed Ali Souibgui, Sanket Biswas, Andres Mafla, Ali Furkan Biten, Alicia Fornés, Yousri Kessentini, Josep Lladós, Lluis Gomez, Dimosthenis Karatzas

In this paper, we propose a Text-Degradation Invariant Auto Encoder (Text-DIAE), a self-supervised model designed to tackle two tasks, text recognition (handwritten or scene-text) and document image enhancement.

Document Enhancement Scene Text Recognition

Graph-based Deep Generative Modelling for Document Layout Generation

no code implementations9 Jul 2021 Sanket Biswas, Pau Riba, Josep Lladós, Umapada Pal

One of the major prerequisites for any deep learning approach is the availability of large-scale training data.

DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis

1 code implementation6 Jul 2021 Sanket Biswas, Pau Riba, Josep Lladós, Umapada Pal

The results highlight that our model can successfully generate realistic and diverse document images with multiple objects.

Document Layout Analysis Image Generation

Ehrhart-Equivalence, Equidecomposability, and Unimodular Equivalence of Integral Polytopes

no code implementations21 Jan 2021 Fiona Abney-McPeek, Sanket Biswas, Senjuti Dutta, Yongyuan Huang, Deyuan Li, Nancy Xu

In this paper, we establish a relationship between Ehrhart-equivalence and other forms of equivalence: the $\operatorname{GL}_n(\mathbb{Z})$-equidecomposability and unimodular equivalence of two integral $n$-polytopes in $\mathbb{R}^n$.

Combinatorics

Fault Area Detection in Leaf Diseases using k-means Clustering

no code implementations24 Oct 2018 Subhajit Maity, Sujan Sarkar, Avinaba Tapadar, Ayan Dutta, Sanket Biswas, Sayon Nayek, Pritam Saha

With increasing population the crisis of food is getting bigger day by day. In this time of crisis, the leaf disease of crops is the biggest problem in the food industry. In this paper, we have addressed that problem and proposed an efficient method to detect leaf disease. Leaf diseases can be detected from sample images of the leaf with the help of image processing and segmentation. Using k-means clustering and Otsu's method the faulty region in a leaf is detected which helps to determine proper course of action to be taken. Further the ratio of normal and faulty region if calculated would be able to predict if the leaf can be cured at all.

Clustering

A Statistical Approach to Adult Census Income Level Prediction

1 code implementation23 Oct 2018 Navoneel Chakrabarty, Sanket Biswas

The prominent inequality of wealth and income is a huge concern especially in the United States.

valid

Cannot find the paper you are looking for? You can Submit a new open access paper.