no code implementations • 15 Sep 2023 • Soumya Banerjee, Vinay K. Verma, Avideep Mukherjee, Deepak Gupta, Vinay P. Namboodiri, Piyush Rai
Most of the existing methods primarily focus on lifelong learning within a static environment and lack the ability to mitigate forgetting in a quickly-changing dynamic environment.
no code implementations • 13 Apr 2023 • Badri N. Patro, Vinay P. Namboodiri, Vijay Srinivas Agneeswaran
Vision transformers have been applied successfully for image recognition tasks.
no code implementations • 27 Jan 2023 • Soumya Banerjee, Vinay Kumar Verma, Vinay P. Namboodiri
Despite rapid advancements in lifelong learning (LLL) research, a large body of research mainly focuses on improving the performance in the existing \textit{static} continual learning (CL) setups.
no code implementations • 7 Oct 2022 • Madhav Agarwal, Anchit Gupta, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C V Jawahar
We use a state-of-the-art face reenactment network to detect key points in the non-pivot frames and transmit them to the receiver.
no code implementations • 4 Jun 2022 • Munender Varshney, Ravindra Yadav, Vinay P. Namboodiri, Rajesh M Hegde
This work aims to understand the correlation/mapping between speech and the sequence of lip movement of individual speakers in an unconstrained and large vocabulary.
no code implementations • 17 Feb 2022 • Vinod K Kurmi, Rishabh Sharma, Yash Vardhan Sharma, Vinay P. Namboodiri
The main drawback in such a model is that it directly introduces a trade-off with accuracy as the features that the discriminator deems to be sensitive for discrimination of bias could be correlated with classification.
no code implementations • 20 Oct 2021 • Soumya Banerjee, Vinay Kumar Verma, Toufiq Parag, Maneesh Singh, Vinay P. Namboodiri
We propose a novel approach (CIOSL) for the class-incremental learning in an \emph{online streaming setting} to address these challenges.
no code implementations • 16 Oct 2021 • Anchit Gupta, Faizan Farooq Khan, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar
Our evaluations show a clear improvement in the efficiency of using human editors and an improved video generation quality.
no code implementations • 24 Sep 2021 • Avideep Mukherjee, Badri Narayan Patro, Vinay P. Namboodiri
Normalizing flows provide an elegant method for obtaining tractable density estimates from distributions by using invertible transformations.
1 code implementation • Findings (ACL) 2021 • Zeeshan Khan, Kartheek Akella, Vinay P. Namboodiri, C V Jawahar
We propose a novel adaptation strategy, where we iteratively prune and retrain the redundant parameters of an MNMT to improve bilingual representations while retaining the multilinguality.
no code implementations • 12 Jul 2021 • Blessen George, Vinod K. Kurmi, Vinay P. Namboodiri
Generative adversarial networks (GANs) are very popular to generate realistic images, but they often suffer from the training instability issues and the phenomenon of mode loss.
no code implementations • 9 Jul 2021 • Vinod K Kurmi, Venkatesh K Subramanian, Vinay P. Namboodiri
Among the methodologies used, that of adversarial learning is widely applied to solve many deep learning problems along with domain adaptation.
no code implementations • 30 Jun 2021 • Pratik Mazumder, Pravendra Singh, Vinay P. Namboodiri
Our approach significantly improves their performance and further reduces the model biases in the limited data regime.
no code implementations • CVPR 2021 • Pravendra Singh, Pratik Mazumder, Piyush Rai, Vinay P. Namboodiri
Our proposed method uses weight rectifications and affine transformations in order to adapt the model to different tasks that arrive sequentially.
no code implementations • 3 Feb 2021 • Vinod K Kurmi, Badri N. Patro, Venkatesh K. Subramanian, Vinay P. Namboodiri
We define distillation losses in terms of aleatoric uncertainty and self-attention.
no code implementations • ICON 2020 • Kartheek Akella, Sai Himal Allu, Sridhar Suresh Ragupathi, Aman Singhal, Zeeshan Khan, Vinay P. Namboodiri, C V Jawahar
In this paper, we address the task of improving pair-wise machine translation for specific low resource Indian languages.
no code implementations • 22 Nov 2020 • Pratik Mazumder, Pravendra Singh, Vinay P. Namboodiri
Our method relies on generating robust prototypes from a set of few examples.
2 code implementations • 13 Nov 2020 • Raghav B. Venkataramaiyer, Abhishek Joshi, Saisha Narang, Vinay P. Namboodiri
Hatching is a common method used by artists to accentuate the third dimension of a sketch, and to illuminate the scene.
1 code implementation • 26 Aug 2020 • Samik Some, Mithun Das Gupta, Vinay P. Namboodiri
We present a determinantal point process (DPP) inspired alternative to non-maximum suppression (NMS) which has become an integral step in all state-of-the-art object detection frameworks.
no code implementations • 11 Aug 2020 • Jerin Philip, Shashank Siripragada, Vinay P. Namboodiri, C. V. Jawahar
Through this paper, we provide and analyse an automated framework to obtain such a corpus for Indian language neural machine translation (NMT) systems.
2 code implementations • LREC 2020 • Shashank Siripragada, Jerin Philip, Vinay P. Namboodiri, C. V. Jawahar
We present sentence aligned parallel corpora across 10 Indian Languages - Hindi, Telugu, Tamil, Malayalam, Gujarati, Urdu, Bengali, Oriya, Marathi, Punjabi, and English - many of which are categorized as low resource.
no code implementations • 9 Jul 2020 • Prem Raj, Vinay P. Namboodiri, L. Behera
The idea of switching a CNN is due to the fact that the dataset for training a relative camera pose regressor for visual servo control must contain variations in relative pose ranging from a very small scale to eventually a larger scale.
no code implementations • 29 Jun 2020 • Pratik Mazumder, Pravendra Singh, Vinay P. Namboodiri
We then simultaneously train for the composite rotation prediction task along with the original classification task, which forces the network to learn high-quality generic features that help improve the few-shot classification performance.
no code implementations • 8 Jun 2020 • Pravendra Singh, Pratik Mazumder, Vinay P. Namboodiri
Our proposed technique, namely Passive Batch Injection Training Technique (PBITT), even reduces the level of overfitting in networks that already use the standard techniques for reducing overfitting such as $L_2$ regularization and batch normalization, resulting in significant accuracy improvements.
no code implementations • 27 May 2020 • Pratik Mazumder, Pravendra Singh, Kranti Kumar Parida, Vinay P. Namboodiri
We use the semantic relatedness of text embeddings as a means for zero-shot learning by aligning audio and video embeddings with the corresponding class label text feature space.
no code implementations • 23 Jan 2020 • Badri N. Patro, Vinod K. Kurmi, Sandeep Kumar, Vinay P. Namboodiri
This is a Bayesian framework and the results show a remarkable similarity to natural questions as validated by a human study.
no code implementations • 23 Jan 2020 • Badri N. Patro, Mayank Lunayach, Vinay P. Namboodiri
These have two-fold benefits: a) improvement in obtaining the certainty estimates that correlate better with misclassified samples and b) improved attention maps that provide state-of-the-art results in terms of correlation with human attention regions.
1 code implementation • 23 Jan 2020 • Badri N. Patro, Shivansh Pate, Vinay P. Namboodiri
Our model explains the answers obtained through a VQA model by providing visual and textual explanations.
no code implementations • 15 Jan 2020 • Vinay Kumar Verma, Pravendra Singh, Vinay P. Namboodiri, Piyush Rai
The pruner is essentially a multitask deep neural network with binary outputs that help identify the filters from each layer of the original network that do not have any significant contribution to the model and can therefore be pruned.
no code implementations • 5 Jan 2020 • Pravendra Singh, Munender Varshney, Vinay P. Namboodiri
In this paper, we have proposed a cooperative initialization for training the deep network using ReLU activation function to improve the network performance.
1 code implementation • 31 Dec 2019 • Badri N. Patro, Dev Chauhan, Vinod K. Kurmi, Vinay P. Namboodiri
One way to ensure this is by adding constraints for true paraphrase embeddings to be close and unrelated paraphrase candidate sentence embeddings to be far.
no code implementations • 19 Dec 2019 • Badri N. Patro, Vinay P. Namboodiri
Specifically, we incorporate exemplar based approaches and show that an exemplar based module can be incorporated in almost any of the deep learning architectures proposed in the literature and the addition of such a block results in improved performance for solving these tasks.
no code implementations • 17 Dec 2019 • Yatin Dandi, Aniket Das, Soumye Singhal, Vinay P. Namboodiri, Piyush Rai
The proposed model allows minor variations in content across frames while maintaining the temporal dependence through latent vectors encoding the pose or motion features.
no code implementations • 19 Nov 2019 • Badri N. Patro, Anupriy, Vinay P. Namboodiri
It also results in a good improvement in rank correlation metric on the VQA task.
no code implementations • 17 Oct 2019 • Raghav Brahmadesam Venkataramaiyer, Subham Kumar, Vinay P. Namboodiri
Line art is arguably one of the fundamental and versatile modes of expression.
no code implementations • 14 Oct 2019 • Soumik Dasgupta, Badri N. Patro, Vinay P. Namboodiri
In this work, we show that Dynamic Attention helps in achieving grounding and also aids in the policy learning objective.
no code implementations • 13 Oct 2019 • Badri N. Patro, Shivansh Patel, Vinay P. Namboodiri
Particularly, in this work, we propose a new method Granular Multi-modal Attention, where we aim to particularly address the question of the right granularity at which one needs to attend while solving the Visual Dialog task.
no code implementations • 11 Sep 2019 • Badri N. Patro, Anupriy, Vinay P. Namboodiri
In this paper, we propose a probabilistic framework for solving the task of `Visual Dialog'.
Ranked #1 on
Common Sense Reasoning
on Visual Dialog v0.9
no code implementations • ICCV 2019 • Badri N. Patro, Mayank Lunayach, Shivansh Patel, Vinay P. Namboodiri
These have two-fold benefits: a) improvement in obtaining the certainty estimates that correlate better with misclassified samples and b) improved attention maps that provide state-of-the-art results in terms of correlation with human attention regions.
no code implementations • 29 Jul 2019 • Jerin Philip, Vinay P. Namboodiri, C. V. Jawahar
We present a simple, yet effective, Neural Machine Translation system for Indian languages.
1 code implementation • 24 Jul 2019 • Vinod Kumar Kurmi, Vipul Bajaj, Venkatesh K Subramanian, Vinay P. Namboodiri
However, here we suggest that rather than using a point estimate, it would be useful if a distribution based discriminator could be used to bridge this gap.
Ranked #29 on
Domain Adaptation
on Office-31
1 code implementation • CVPR 2019 • Vinod Kumar Kurmi, Shanu Kumar, Vinay P. Namboodiri
In this paper, we aim to solve for unsupervised domain adaptation of classifiers where we have access to label information for the source domain while these are not available for a target domain.
Ranked #10 on
Domain Adaptation
on ImageCLEF-DA
no code implementations • 24 May 2019 • Aadil Hayat, Utsav Singh, Vinay P. Namboodiri
Recent advances in reinforcement learning have proved that given an environment we can learn to perform a task in that environment if we have access to some form of a reward function (dense, sparse or derived from IRL).
1 code implementation • 11 May 2019 • Pravendra Singh, Vinay Kumar Verma, Piyush Rai, Vinay P. Namboodiri
Our framework, called Play and Prune (PP), jointly prunes and fine-tunes CNN model parameters, with an adaptive pruning rate, while maintaining the model's predictive performance.
no code implementations • 14 Apr 2019 • Abhishek Joshi, Vinay P. Namboodiri
Finally through this synthesis approach we obtain a comparable set of abnormal samples that can be used for training the CNN for the classification of normal vs abnormal samples.
1 code implementation • 2 Apr 2019 • Vinod Kumar Kurmi, Vinay P. Namboodiri
Our observation relies on the analysis that shows that if the discriminator has access to all the information available including the class structure present in the source dataset, then it can guide the transformation of features of the target set of classes to a more structure adapted space.
Ranked #13 on
Domain Adaptation
on ImageCLEF-DA
no code implementations • PACLIC 2018 • Jerin Philip, Vinay P. Namboodiri, C. V. Jawahar
This document describes the machine translation system used in the submissions of IIIT-Hyderabad CVIT-MT for the WAT-2018 English-Hindi translation task.
no code implementations • 11 Mar 2019 • Pravendra Singh, Pratik Mazumder, Vinay P. Namboodiri
Recently researchers have tried to boost the performance of CNNs by re-calibrating the feature maps produced by these filters, e. g., Squeeze-and-Excitation Networks (SENets).
1 code implementation • CVPR 2019 • Pravendra Singh, Vinay Kumar Verma, Piyush Rai, Vinay P. Namboodiri
We present a novel deep learning architecture in which the convolution operation leverages heterogeneous kernels.
no code implementations • 5 Feb 2019 • Saurabh Srivastava, Vinay P. Namboodiri, T. V. Prabhakar
AI intensive systems that operate upon user data face the challenge of balancing data utility with privacy concerns.
no code implementations • 26 Nov 2018 • Pravendra Singh, Vinay Kumar Verma, Piyush Rai, Vinay P. Namboodiri
We present a filter correlation based model compression approach for deep convolutional neural networks.
no code implementations • 20 Nov 2018 • Pravendra Singh, Vinay Sameer Raja Kadi, Nikhil Verma, Vinay P. Namboodiri
Convolutional neural networks (CNN) have achieved impressive performance on the wide variety of tasks (classification, detection, etc.)
no code implementations • 20 Nov 2018 • Pravendra Singh, Manikandan. R, Neeraj Matiyali, Vinay P. Namboodiri
Additionally, we also empirically show our method's adaptability for classification based architecture VGG16 on datasets CIFAR and German Traffic Sign Recognition Benchmark (GTSRB) achieving a compression rate of 125X and 200X with the reduction in flops by 90. 50% and 96. 6% respectively with no loss of accuracy.
1 code implementation • EMNLP 2018 • Badri N. Patro, Sandeep Kumar, Vinod K. Kurmi, Vinay P. Namboodiri
Generating natural questions from an image is a semantic task that requires using visual and language modality to learn multimodal representations.
2 code implementations • COLING 2018 • Badri N. Patro, Vinod K. Kurmi, Sandeep Kumar, Vinay P. Namboodiri
One way to ensure this is by adding constraints for true paraphrase embeddings to be close and unrelated paraphrase candidate sentence embeddings to be far.
1 code implementation • CVPR 2018 • Badri Patro, Vinay P. Namboodiri
A number of methods have focused on solving this problem by using image based attention.
1 code implementation • 2 Feb 2018 • Shashank Sharma, Vinay P. Namboodiri
Given a set of data that has an imbalance in the distribution, the networks are susceptible to missing modes and not capturing the data distribution.
no code implementations • ICLR 2018 • Prannay Khosla, Preethi Jyothi, Vinay P. Namboodiri, Mukundhan Srinivasan
In this paper, we propose the generation of accented speech using generative adversarial networks.
no code implementations • 23 Sep 2017 • Unnat Jain, Vinay P. Namboodiri, Gaurav Pandey
The modified system learns (in a supervised setting) compact binary codes from image feature descriptors.
1 code implementation • 13 Apr 2017 • Prabuddha Chakraborty, Vinay P. Namboodiri
In this paper we propose a technique for obtaining coarse pose estimation of humans in an image that does not require any manual supervision.
no code implementations • 22 Nov 2016 • Soumya Roy, Vinay P. Namboodiri, Arijit Biswas
Previous works on object detection model the problem as a structured regression problem which ranks the correct bounding boxes more than the background ones.
no code implementations • 26 Mar 2016 • Ayush Mittal, Anant Raj, Vinay P. Namboodiri, Tinne Tuytelaars
Most methods for unsupervised domain adaptation proposed in the literature to date, assume that the set of classes present in the target domain is identical to the set of classes present in the source domain.
no code implementations • 20 Jul 2015 • Anant Raj, Vinay P. Namboodiri, Tinne Tuytelaars
In this paper, we propose subspace alignment based domain adaptation of the state of the art RCNN based object detector.
no code implementations • 16 Jan 2015 • Anant Raj, Vinay P. Namboodiri, Tinne Tuytelaars
Domain adaptation techniques aim at adapting a classifier learnt on a source domain to work on the target domain.
no code implementations • CVPR 2014 • Hakan Bilen, Marco Pedersoli, Vinay P. Namboodiri, Tinne Tuytelaars, Luc van Gool
In classification of objects substantial work has gone into improving the low level representation of an image by considering various aspects such as different features, a number of feature pooling and coding techniques and considering different kernels.