Search Results for author: Abhra Chaudhuri

Found 6 papers, 4 papers with code

Learning Conditional Invariances through Non-Commutativity

1 code implementation18 Feb 2024 Abhra Chaudhuri, Serban Georgescu, Anjan Dutta

Invariance learning algorithms that conditionally filter out domain-specific random variables as distractors, do so based only on the data semantics, and not the target domain under evaluation.

Domain Adaptation

CLIPDrawX: Primitive-based Explanations for Text Guided Sketch Synthesis

no code implementations4 Dec 2023 Nityanand Mathur, Shyam Marjit, Abhra Chaudhuri, Anjan Dutta

With the goal of understanding the visual concepts that CLIP associates with text prompts, we show that the latent space of CLIP can be visualized solely in terms of linear transformations on simple geometric primitives like circles and straight lines.

Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection

no code implementations29 Sep 2023 Swapnil Bhosale, Abhra Chaudhuri, Alex Lee Robert Williams, Divyank Tiwari, Anjan Dutta, Xiatian Zhu, Pushpak Bhattacharyya, Diptesh Kanojia

The introduction of the MUStARD dataset, and its emotion recognition extension MUStARD++, have identified sarcasm to be a multi-modal phenomenon -- expressed not only in natural language text, but also through manners of speech (like tonality and intonation) and visual cues (facial expression).

Benchmarking Emotion Recognition +1

Data-Free Sketch-Based Image Retrieval

1 code implementation CVPR 2023 Abhra Chaudhuri, Ayan Kumar Bhunia, Yi-Zhe Song, Anjan Dutta

For the first time, we identify that for data-scarce tasks like Sketch-Based Image Retrieval (SBIR), where the difficulty in acquiring paired photos and hand-drawn sketches limits data-dependent cross-modal learning algorithms, DFL can prove to be a much more practical paradigm.

Retrieval Sketch-Based Image Retrieval

Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval

1 code implementation19 Oct 2022 Abhra Chaudhuri, Massimiliano Mancini, Yanbei Chen, Zeynep Akata, Anjan Dutta

Representation learning for sketch-based image retrieval has mostly been tackled by learning embeddings that discard modality-specific information.

Cross-Modal Retrieval Knowledge Distillation +3

Relational Proxies: Emergent Relationships as Fine-Grained Discriminators

1 code implementation5 Oct 2022 Abhra Chaudhuri, Massimiliano Mancini, Zeynep Akata, Anjan Dutta

Fine-grained categories that largely share the same set of parts cannot be discriminated based on part information alone, as they mostly differ in the way the local parts relate to the overall global structure of the object.

Cannot find the paper you are looking for? You can Submit a new open access paper.