Search Results for author: Sai Saketh Rambhatla

Found 11 papers, 2 papers with code

Detecting Human-Object Interactions via Functional Generalization

no code implementations5 Apr 2019 Ankan Bansal, Sai Saketh Rambhatla, Abhinav Shrivastava, Rama Chellappa

We present an approach for detecting human-object interactions (HOIs) in images, based on the idea that humans interact with functionally similar objects in a similar manner.

Human-Object Interaction Detection Object

Spatial Priming for Detecting Human-Object Interactions

no code implementations9 Apr 2020 Ankan Bansal, Sai Saketh Rambhatla, Abhinav Shrivastava, Rama Chellappa

The proposed method consists of a layout module which primes a visual module to predict the type of interaction between a human and an object.

Human-Object Interaction Detection Object

The Pursuit of Knowledge: Discovering and Localizing Novel Categories using Dual Memory

no code implementations ICCV 2021 Sai Saketh Rambhatla, Rama Chellappa, Abhinav Shrivastava

We tackle object category discovery, which is the problem of discovering and localizing novel objects in a large unlabeled dataset.

Object

To Boost or not to Boost: On the Limits of Boosted Neural Networks

no code implementations28 Jul 2021 Sai Saketh Rambhatla, Michael Jones, Rama Chellappa

Boosting is a method for finding a highly accurate hypothesis by linearly combining many ``weak" hypotheses, each of which may be only moderately accurate.

Object Recognition

Self-Denoising Neural Networks for Few Shot Learning

no code implementations26 Oct 2021 Steven Schwarcz, Sai Saketh Rambhatla, Rama Chellappa

This architecture, which we call a Self-Denoising Neural Network (SDNN), can be applied easily to most modern convolutional neural architectures, and can be used as a supplement to many existing few-shot learning techniques.

Action Detection Denoising +1

MOST: Multiple Object localization with Self-supervised Transformers for object discovery

no code implementations ICCV 2023 Sai Saketh Rambhatla, Ishan Misra, Rama Chellappa, Abhinav Shrivastava

In this work, we present Multiple Object localization with Self-supervised Transformers (MOST) that uses features of transformers trained using self-supervised learning to localize multiple objects in real world images.

Object object-detection +6

Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

no code implementations17 Nov 2023 Rohit Girdhar, Mannat Singh, Andrew Brown, Quentin Duval, Samaneh Azadi, Sai Saketh Rambhatla, Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra

We present Emu Video, a text-to-video generation model that factorizes the generation into two steps: first generating an image conditioned on the text, and then generating a video conditioned on the text and the generated image.

Text-to-Video Generation Video Generation

SelfEval: Leveraging the discriminative nature of generative models for evaluation

no code implementations17 Nov 2023 Sai Saketh Rambhatla, Ishan Misra

In this work, we show that text-to-image generative models can be 'inverted' to assess their own text-image understanding capabilities in a completely automated manner.

Attribute Visual Reasoning

Cannot find the paper you are looking for? You can Submit a new open access paper.