Search Results for author: Pinaki Nath Chowdhury

Found 31 papers, 10 papers with code

SketchINR: A First Look into Sketches as Implicit Neural Representations

no code implementations • 14 Mar 2024 • Hmrishav Bandyopadhyay, Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Tao Xiang, Timothy Hospedales, Yi-Zhe Song

(ii) SketchINR's auto-decoder provides a much higher-fidelity representation than other learned vector sketch representations, and is uniquely able to scale to complex vector sketches such as FS-COCO.

Data Compression

Paper
Add Code

What Sketch Explainability Really Means for Downstream Tasks

no code implementations • 14 Mar 2024 • Hmrishav Bandyopadhyay, Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Aneeshan Sain, Tao Xiang, Yi-Zhe Song

In this paper, we explore the unique modality of sketch for explainability, emphasising the profound impact of human strokes compared to conventional pixel-oriented studies.

Retrieval

Paper
Add Code

You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval

no code implementations • 12 Mar 2024 • Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

Two primary input modalities prevail in image retrieval: sketch and text.

Attribute Image Retrieval +1

Paper
Add Code

It's All About Your Sketch: Democratising Sketch Control in Diffusion Models

1 code implementation • 12 Mar 2024 • Subhadeep Koley, Ayan Kumar Bhunia, Deeptanshu Sekhri, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

This paper unravels the potential of sketches for diffusion models, addressing the deceptive promise of direct sketch control in generative AI.

Retrieval Sketch-Based Image Retrieval

Paper
Code

Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers

no code implementations • 12 Mar 2024 • Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

This paper, for the first time, explores text-to-image diffusion models for Zero-Shot Sketch-based Image Retrieval (ZS-SBIR).

Retrieval Sketch-Based Image Retrieval

Paper
Add Code

How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval?

no code implementations • 11 Mar 2024 • Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

@q loss to inject that understanding into the system.

Retrieval Sketch-Based Image Retrieval

Paper
Add Code

Doodle Your 3D: From Abstract Freehand Sketches to Precise 3D Shapes

no code implementations • 7 Dec 2023 • Hmrishav Bandyopadhyay, Subhadeep Koley, Ayan Das, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Ayan Kumar Bhunia, Yi-Zhe Song

In this paper, we democratise 3D content creation, enabling precise generation of 3D shapes from abstract sketches while overcoming limitations tied to drawing skills.

Position

Paper
Add Code

DemoCaricature: Democratising Caricature Generation with a Rough Sketch

no code implementations • 7 Dec 2023 • Dar-Yen Chen, Ayan Kumar Bhunia, Subhadeep Koley, Aneeshan Sain, Pinaki Nath Chowdhury, Yi-Zhe Song

In this paper, we democratise caricature generation, empowering individuals to effortlessly craft personalised caricatures with just a photo and a conceptual sketch.

Caricature Model Editing

Paper
Add Code

3D VR Sketch Guided 3D Shape Prototyping and Exploration

1 code implementation • ICCV 2023 • Ling Luo, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song, Yulia Gryaditskaya

3D shape modeling is labor-intensive, time-consuming, and requires years of expertise.

3D Shape Generation 3D Shape Modeling +1

Paper
Code

What Can Human Sketches Do for Object Detection?

no code implementations • CVPR 2023 • Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Aneeshan Sain, Subhadeep Koley, Tao Xiang, Yi-Zhe Song

In particular, we first perform independent prompting on both sketch and photo branches of an SBIR model to build highly generalisable sketch and photo encoders on the back of the generalisation ability of CLIP.

Object object-detection +3

Paper
Add Code

Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR

no code implementations • CVPR 2023 • Aneeshan Sain, Ayan Kumar Bhunia, Subhadeep Koley, Pinaki Nath Chowdhury, Soumitri Chattopadhyay, Tao Xiang, Yi-Zhe Song

This paper advances the fine-grained sketch-based image retrieval (FG-SBIR) literature by putting forward a strong baseline that overshoots prior state-of-the-arts by ~11%.

Knowledge Distillation Sketch-Based Image Retrieval

Paper
Add Code

CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not

no code implementations • CVPR 2023 • Aneeshan Sain, Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Subhadeep Koley, Tao Xiang, Yi-Zhe Song

At the very core of our solution is a prompt learning setup.

Retrieval Sketch-Based Image Retrieval

Paper
Add Code

Sketch2Saliency: Learning to Detect Salient Objects from Human Drawings

no code implementations • CVPR 2023 • Ayan Kumar Bhunia, Subhadeep Koley, Amandeep Kumar, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

Human sketch has already proved its worth in various visual understanding tasks (e. g., retrieval, segmentation, image-captioning, etc).

Image Captioning Retrieval +1

Paper
Add Code

Picture that Sketch: Photorealistic Image Generation from Abstract Sketches

no code implementations • CVPR 2023 • Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

We further introduce specific designs to tackle the abstract nature of human sketches, including a fine-grained discriminative loss on the back of a trained sketch-photo retrieval model, and a partial-aware sketch augmentation strategy.

Image Generation Retrieval +1

Paper
Add Code

Democratising 2D Sketch to 3D Shape Retrieval Through Pivoting

no code implementations • ICCV 2023 • Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Aneeshan Sain, Subhadeep Koley, Tao Xiang, Yi-Zhe Song

We perform pivoting on two existing datasets, each from a distant research domain to the other: 2D sketch and photo pairs from the sketch-based image retrieval field (SBIR), and 3D shapes from ShapeNet.

3D Shape Retrieval Retrieval +1

Paper
Add Code

Adaptive Fine-Grained Sketch-Based Image Retrieval

1 code implementation • 4 Jul 2022 • Ayan Kumar Bhunia, Aneeshan Sain, Parth Shah, Animesh Gupta, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

To solve this new problem, we introduce a novel model-agnostic meta-learning (MAML) based framework with several key modifications: (1) As a retrieval task with a margin-based contrastive loss, we simplify the MAML training in the inner loop to make it more stable and tractable.

Meta-Learning Retrieval +1

Paper
Code

SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo and Text

no code implementations • CVPR 2023 • Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Aneeshan Sain, Subhadeep Koley, Tao Xiang, Yi-Zhe Song

In this paper, we extend scene understanding to include that of human sketch.

Image Retrieval Retrieval +1

Paper
Add Code

Sketch3T: Test-Time Training for Zero-Shot SBIR

no code implementations • CVPR 2022 • Aneeshan Sain, Ayan Kumar Bhunia, Vaishnav Potlapalli, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

In this paper, we question to argue that this setup by definition is not compatible with the inherent abstract and subjective nature of sketches, i. e., the model might transfer well to new categories, but will not understand sketches existing in different test-time distribution as a result.

Meta-Learning Retrieval +1

Paper
Add Code

Partially Does It: Towards Scene-Level FG-SBIR with Partial Input

no code implementations • CVPR 2022 • Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Viswanatha Reddy Gajjala, Aneeshan Sain, Tao Xiang, Yi-Zhe Song

We scrutinise an important observation plaguing scene-level sketch research -- that a significant portion of scene sketches are "partial".

Retrieval Sketch-Based Image Retrieval

Paper
Add Code

Sketching without Worrying: Noise-Tolerant Sketch-Based Image Retrieval

1 code implementation • CVPR 2022 • Ayan Kumar Bhunia, Subhadeep Koley, Abdullah Faiz Ur Rahman Khilji, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

We first conducted a pilot study that revealed the secret lies in the existence of noisy strokes, but not so much of the "I can't sketch".

Retrieval Sketch-Based Image Retrieval

Paper
Code

FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context

1 code implementation • 4 Mar 2022 • Pinaki Nath Chowdhury, Aneeshan Sain, Ayan Kumar Bhunia, Tao Xiang, Yulia Gryaditskaya, Yi-Zhe Song

We advance sketch research to scenes with the first dataset of freehand scene sketches, FS-COCO.

Image Captioning Image Retrieval +2

Paper
Code

SketchLattice: Latticed Representation for Sketch Manipulation

no code implementations • ICCV 2021 • Yonggang Qi, Guoyao Su, Pinaki Nath Chowdhury, Mingkang Li, Yi-Zhe Song

The key challenge in designing a sketch representation lies with handling the abstract and iconic nature of sketches.

Paper
Add Code

Text is Text, No Matter What: Unifying Text Recognition using Knowledge Distillation

no code implementations • ICCV 2021 • Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Yi-Zhe Song

In this paper, for the first time, we argue for their unification -- we aim for a single model that can compete favourably with two separate state-of-the-art STR and HTR models.

Handwriting Recognition HTR +2

Paper
Add Code

Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition

no code implementations • ICCV 2021 • Ayan Kumar Bhunia, Aneeshan Sain, Amandeep Kumar, Shuvozit Ghose, Pinaki Nath Chowdhury, Yi-Zhe Song

In this paper, we argue that semantic information offers a complementary role in addition to visual only.

Rolling Shutter Correction

Paper
Add Code

Towards the Unseen: Iterative Text Recognition by Distilling from Errors

no code implementations • ICCV 2021 • Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Yi-Zhe Song

Our framework is iterative in nature, in that it utilises predicted knowledge of character sequences from a previous iteration, to augment the main network in improving the next prediction.

Paper
Add Code

MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition

1 code implementation • CVPR 2021 • Ayan Kumar Bhunia, Shuvozit Ghose, Amandeep Kumar, Pinaki Nath Chowdhury, Aneeshan Sain, Yi-Zhe Song

In this paper, we take a completely different perspective -- we work on the assumption that there is always a new style that is drastically different, and that we will only have very limited data during testing to perform adaptation.

Handwritten Text Recognition HTR +1

Paper
Code

Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting

1 code implementation • CVPR 2021 • Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song

This data is uniquely characterised by its existence in dual modalities of rasterized images and vector coordinate sequences.

Self-Supervised Learning Translation

Paper
Code

More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval

1 code implementation • CVPR 2021 • Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Yongxin Yang, Tao Xiang, Yi-Zhe Song

A fundamental challenge faced by existing Fine-Grained Sketch-Based Image Retrieval (FG-SBIR) models is the data scarcity -- model performances are largely bottlenecked by the lack of sketch-photo pairs.

Cross-Modal Retrieval Retrieval +2

Paper
Code

UDBNET: Unsupervised Document Binarization Network via Adversarial Game

1 code implementation • 14 Jul 2020 • Amandeep Kumar, Shuvozit Ghose, Pinaki Nath Chowdhury, Partha Pratim Roy, Umapada Pal

In this paper, we present a novel approach towards document image binarization by introducing three-player min-max adversarial game.

Ranked #2 on Binarization on DIBCO 2011

Binarization

Paper
Code

Modeling Extent-of-Texture Information for Ground Terrain Recognition

1 code implementation • 17 Apr 2020 • Shuvozit Ghose, Pinaki Nath Chowdhury, Partha Pratim Roy, Umapada Pal

Ground Terrain Recognition is a difficult task as the context information varies significantly over the regions of a ground terrain image.

Image Classification

Paper
Code

ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition -- RRC-MLT-2019

no code implementations • 1 Jul 2019 • Nibal Nayef, Yash Patel, Michal Busta, Pinaki Nath Chowdhury, Dimosthenis Karatzas, Wafa Khlif, Jiri Matas, Umapada Pal, Jean-Christophe Burie, Cheng-Lin Liu, Jean-Marc Ogier

With the growing cosmopolitan culture of modern cities, the need of robust Multi-Lingual scene Text (MLT) detection and recognition systems has never been more immense.

Cultural Vocal Bursts Intensity Prediction General Classification +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.