1 code implementation • 18 Apr 2024 • Nakul Sharma, Aditay Tripathi, Anirban Chakraborty, Anand Mishra
In this work, we study the task of sketch-guided image inpainting.
no code implementations • 6 Aug 2023 • Onkar Susladkar, Prajwal Gatti, Anand Mishra
In this work, we study the task of ``visually" translating scene text from a source language (e. g., English) to a target language (e. g., Chinese).
no code implementations • 29 Jun 2023 • Abhirama Subramanyam Penamakuri, Manish Gupta, Mithun Das Gupta, Anand Mishra
We study visual question answering in a setting where the answer has to be mined from a pool of relevant and irrelevant images given as a context.
no code implementations • 15 Mar 2023 • Aditay Tripathi, Anand Mishra, Anirban Chakraborty
and Sketchy datasets, respectively, and a $12. 2\%$ improvement in AP@50 for large objects that are `unseen' during training.
1 code implementation • CVPR 2023 • Yogesh Kumar, Anand Mishra
Given a query visual relationship as <subject, predicate, object> and a test video, our objective is to localize the subject and object that are connected via the predicate.
no code implementations • 1 Dec 2022 • Aditay Tripathi, Rajath R Dani, Anand Mishra, Anirban Chakraborty
In such a scenario, a hand-drawn sketch of the object could be a choice for a query.
no code implementations • 23 Nov 2022 • Nakul Sharma, Abhirama S. Penamakuri, Anand Mishra
To fill this gap in the literature, we introduce Wikidata Reference Logo Dataset (WiRLD), containing logos for 100K business brands harvested from Wikidata.
no code implementations • 23 Nov 2022 • Soumya Jahagirdar, Shankar Gangisetty, Anand Mishra
However, it is challenging as it requires an in-depth understanding of the scene and the ability to semantically bridge the visual content with the text present in the image.
no code implementations • 3 Nov 2022 • Aditay Tripathi, Anand Mishra, Anirban Chakraborty
In VL-MPAG Net, we first construct a directed graph with object proposals as nodes and an edge between a pair of nodes representing a plausible relation between them.
no code implementations • 16 Oct 2022 • Prajwal Gatti, Abhirama Subramanyam Penamakuri, Revant Teotia, Anand Mishra, Shubhashis Sengupta, Roshni Ramnani
To enable both commonsense and factual reasoning in the image search, we present a unified framework, namely Knowledge Retrieval-Augmented Multimodal Transformer (KRAMT), that treats the named visual entities in an image as a gateway to encyclopedic knowledge and leverages them along with natural language query to ground relevant knowledge.
1 code implementation • ICCV 2021 • Revant Teotia, Vaibhav Mishra, Mayank Maheshwari, Anand Mishra
In this paper, given a small bag of images, each containing a common but latent predicate, we are interested in localizing visual subject-object pairs connected via the common predicate in each of the images.
1 code implementation • ECCV 2020 • Aditay Tripathi, Rajath R Dani, Anand Mishra, Anirban Chakraborty
We refer to this problem as sketch-guided object localization.
no code implementations • 6 Dec 2018 • Anand Mishra, Ajeet Kumar Singh
In this paper, we address the problem of hand-drawn sketch recognition.
no code implementations • 13 Jan 2016 • Anand Mishra, Karteek Alahari, C. V. Jawahar
We build a conditional random field model on these detections to jointly model the strength of the detections and the interactions between them.
1 code implementation • BMVC 2012 - Electronic Proceedings of the British Machine Vision Conference 2012 2012 • Anand Mishra, Karteek Alahari, C. V. Jawahar
The problem of recognizing text in images taken in the wild has gained significant attention from the computer vision community in recent years.