Search Results for author: Shyamgopal Karthik

Found 13 papers, 9 papers with code

Vision-by-Language for Training-Free Compositional Image Retrieval

1 code implementation • 13 Oct 2023 • Shyamgopal Karthik, Karsten Roth, Massimiliano Mancini, Zeynep Akata

Finally, we show that CIReVL makes CIR human-understandable by composing image and text in a modular fashion in the language domain, thereby making it intervenable, allowing to post-hoc re-align failure cases.

Ranked #1 on Zero-Shot Composed Image Retrieval (ZS-CIR) on CIRCO

Image Retrieval Retrieval +1

Paper
Code

ProbVLM: Probabilistic Adapter for Frozen Vision-Language Models

1 code implementation • ICCV 2023 • Uddeshya Upadhyay, Shyamgopal Karthik, Massimiliano Mancini, Zeynep Akata

We propose ProbVLM, a probabilistic adapter that estimates probability distributions for the embeddings of pre-trained VLMs via inter/intra-modal alignment in a post-hoc manner without needing large-scale datasets or computing.

Active Learning Model Selection +1

Paper
Code

If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection

1 code implementation • 22 May 2023 • Shyamgopal Karthik, Karsten Roth, Massimiliano Mancini, Zeynep Akata

Despite their impressive capabilities, diffusion-based text-to-image (T2I) models can lack faithfulness to the text prompt, where generated images may not contain all the mentioned objects, attributes or relations.

Text-to-Image Generation

Paper
Code

Test-Time Amendment with a Coarse Classifier for Fine-Grained Classification

1 code implementation • NeurIPS 2023 • Kanishk Jain, Shyamgopal Karthik, Vineet Gandhi

We investigate the problem of reducing mistake severity for fine-grained classification.

Avg Classification

Paper
Code

BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks

1 code implementation • 14 Jul 2022 • Uddeshya Upadhyay, Shyamgopal Karthik, Yanbei Chen, Massimiliano Mancini, Zeynep Akata

Moreover, many of the high-performing deep learning models that are already trained and deployed are non-Bayesian in nature and do not provide uncertainty estimates.

Autonomous Driving Deblurring +2

Paper
Code

KG-SP: Knowledge Guided Simple Primitives for Open World Compositional Zero-Shot Learning

1 code implementation • CVPR 2022 • Shyamgopal Karthik, Massimiliano Mancini, Zeynep Akata

The goal of open-world compositional zero-shot learning (OW-CZSL) is to recognize compositions of state and objects in images, given only a subset of them during training and no prior on the unseen compositions.

Compositional Zero-Shot Learning Missing Labels

Paper
Code

Bringing Generalization to Deep Multi-View Pedestrian Detection

1 code implementation • 24 Sep 2021 • Jeet Vora, Swetanjal Dutta, Kanishk Jain, Shyamgopal Karthik, Vineet Gandhi

Multi-view Detection (MVD) is highly effective for occlusion reasoning in a crowded environment.

Ranked #2 on Multiview Detection on GMVD

Multiview Detection Pedestrian Detection

Paper
Code

Learning From Long-Tailed Data With Noisy Labels

no code implementations • 25 Aug 2021 • Shyamgopal Karthik, Jérome Revaud, Boris Chidlovskii

In addition, the resulting learned representations are also remarkably robust to label noise, when fine-tuned with an imbalance- and noise-resistant loss function.

Self-Supervised Learning

Paper
Add Code

No Cost Likelihood Manipulation at Test Time for Making Better Mistakes in Deep Networks

1 code implementation • 1 Apr 2021 • Shyamgopal Karthik, Ameya Prabhu, Puneet K. Dokania, Vineet Gandhi

There has been increasing interest in building deep hierarchy-aware classifiers that aim to quantify and reduce the severity of mistakes, and not just reduce the number of errors.

Paper
Code

Amending Mistakes Post-hoc in Deep Networks by Leveraging Class Hierarchies

no code implementations • ICLR 2021 • Shyamgopal Karthik, Ameya Prabhu, Puneet K. Dokania, Vineet Gandhi

There has been increasing interest in building deep hierarchy-aware classifiers, aiming to quantify and reduce the severity of mistakes and not just count the number of errors.

Paper
Add Code

ViNet: Pushing the limits of Visual Modality for Audio-Visual Saliency Prediction

1 code implementation • 11 Dec 2020 • Samyak Jain, Pradeep Yarlagadda, Shreyank Jyoti, Shyamgopal Karthik, Ramanathan Subramanian, Vineet Gandhi

We also explore a variation of ViNet architecture by augmenting audio features into the decoder.

Ranked #1 on Video Saliency Detection on MSU Video Saliency Prediction

Action Recognition Saliency Prediction +2

Paper
Code

Simple Unsupervised Multi-Object Tracking

no code implementations • 4 Jun 2020 • Shyamgopal Karthik, Ameya Prabhu, Vineet Gandhi

Multi-object tracking has seen a lot of progress recently, albeit with substantial annotation costs for developing better and larger labeled datasets.

Multi-Object Tracking Object

Paper
Add Code

Exploring 3 R's of Long-term Tracking: Re-detection, Recovery and Reliability

no code implementations • 27 Oct 2019 • Shyamgopal Karthik, Abhinav Moudgil, Vineet Gandhi

Recent works have proposed several long term tracking benchmarks and highlight the importance of moving towards long-duration tracking to bridge the gap with application requirements.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.