Search Results for author: Raghav Goyal

Found 9 papers, 3 papers with code

TAM-VT: Transformation-Aware Multi-scale Video Transformer for Segmentation and Tracking

no code implementations13 Dec 2023 Raghav Goyal, Wan-Cyuan Fan, Mennatullah Siam, Leonid Sigal

In this work we propose a novel, clip-based DETR-style encoder-decoder architecture, which focuses on systematically analyzing and addressing aforementioned challenges.

Semantic Segmentation Video Object Segmentation +1

MINOTAUR: Multi-task Video Grounding From Multimodal Queries

no code implementations16 Feb 2023 Raghav Goyal, Effrosyni Mavroudi, Xitong Yang, Sainbayar Sukhbaatar, Leonid Sigal, Matt Feiszli, Lorenzo Torresani, Du Tran

Video understanding tasks take many forms, from action detection to visual query localization and spatio-temporal grounding of sentences.

Action Detection Sentence +2

Beyond Simple Meta-Learning: Multi-Purpose Models for Multi-Domain, Active and Continual Few-Shot Learning

2 code implementations13 Jan 2022 Peyman Bateni, Jarred Barber, Raghav Goyal, Vaden Masrani, Jan-Willem van de Meent, Leonid Sigal, Frank Wood

The first method, Simple CNAPS, employs a hierarchically regularized Mahalanobis-distance based classifier combined with a state of the art neural adaptive feature extractor to achieve strong performance on Meta-Dataset, mini-ImageNet and tiered-ImageNet benchmarks.

Active Learning continual few-shot learning +3

UniT: Unified Knowledge Transfer for Any-shot Object Detection and Segmentation

no code implementations CVPR 2021 Siddhesh Khandelwal, Raghav Goyal, Leonid Sigal

Weakly-supervised approaches draw on image-level labels to build detectors/segmentors, while zero/few-shot methods assume abundant instance-level data for a set of base classes, and none to a few examples for novel classes.

object-detection Object Detection +1

Improved Few-Shot Visual Classification

2 code implementations CVPR 2020 Peyman Bateni, Raghav Goyal, Vaden Masrani, Frank Wood, Leonid Sigal

Few-shot learning is a fundamental task in computer vision that carries the promise of alleviating the need for exhaustively labeled data.

Classification Few-Shot Image Classification +3

Natural Language Generation through Character-based RNNs with Finite-state Prior Knowledge

no code implementations COLING 2016 Raghav Goyal, Marc Dymetman, Eric Gaussier

Recently Wen et al. (2015) have proposed a Recurrent Neural Network (RNN) approach to the generation of utterances from dialog acts, and shown that although their model requires less effort to develop than a rule-based system, it is able to improve certain aspects of the utterances, in particular their naturalness.

Attribute Language Modelling +3

Cannot find the paper you are looking for? You can Submit a new open access paper.