Search Results for author: Vivek Rathod

Found 14 papers, 10 papers with code

Speed/accuracy trade-offs for modern convolutional object detectors

14 code implementations • CVPR 2017 • Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang song, Sergio Guadarrama, Kevin Murphy

On the opposite end in which accuracy is critical, we present a detector that achieves state-of-the-art performance measured on the COCO detection task.

Ranked #209 on Object Detection on COCO test-dev (using extra training data)

Object object-detection +1

76,588

Paper
Code

Pooling Pyramid Network for Object Detection

2 code implementations • 9 Jul 2018 • Pengchong Jin, Vivek Rathod, Xiangxin Zhu

We share box predictors across all scales, and replace convolution between scales with max pooling.

Object object-detection +1

76,588

Paper
Code

Context R-CNN: Long Term Temporal Context for Per-Camera Object Detection

3 code implementations • CVPR 2020 • Sara Beery, Guanhang Wu, Vivek Rathod, Ronny Votel, Jonathan Huang

In this paper we propose a method that leverages temporal context from the unlabeled frames of a novel camera to improve performance at that camera.

object-detection Video Object Detection +1

76,588

Paper
Code

The surprising impact of mask-head architecture on novel class segmentation

3 code implementations • ICCV 2021 • Vighnesh Birodkar, Zhichao Lu, Siyang Li, Vivek Rathod, Jonathan Huang

Under this family, we study Mask R-CNN and discover that instead of its default strategy of training the mask-head with a combination of proposals and groundtruth boxes, training the mask-head with only groundtruth boxes dramatically improves its performance on novel classes.

Instance Segmentation Segmentation +1

76,587

Paper
Code

RetinaTrack: Online Single Stage Joint Detection and Tracking

1 code implementation • CVPR 2020 • Zhichao Lu, Vivek Rathod, Ronny Votel, Jonathan Huang

Traditionally multi-object tracking and object detection are performed using separate systems with most prior works focusing exclusively on one of these aspects over the other.

Ranked #1 on Multiple Object Tracking on Waymo Open Dataset

Autonomous Driving Multi-Object Tracking +3

Paper
Code

Semantic Instance Segmentation via Deep Metric Learning

1 code implementation • 30 Mar 2017 • Alireza Fathi, Zbigniew Wojna, Vivek Rathod, Peng Wang, Hyun Oh Song, Sergio Guadarrama, Kevin P. Murphy

We propose a new method for semantic instance segmentation, by first computing how likely two pixels are to belong to the same object, and then by grouping similar pixels together.

Ranked #3 on Object Proposal Generation on PASCAL VOC 2012, 60 proposals per image

Instance Segmentation Metric Learning +3

Paper
Code

What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision

1 code implementation • 5 Mar 2015 • Jonathan Malmaud, Jonathan Huang, Vivek Rathod, Nick Johnston, Andrew Rabinovich, Kevin Murphy

We present a novel method for aligning a sequence of instructions to a video of someone carrying out a task.

Keyword Spotting

Paper
Code

What’s Cookin’? Interpreting Cooking Videos using Text, Speech and Vision

1 code implementation • HLT 2015 • Jonathan Malmaud, Jonathan Huang, Vivek Rathod, Nicholas Johnston, Andrew Rabinovich, Kevin Murphy

Keyword Spotting

Paper
Code

Deep Metric Learning via Facility Location

1 code implementation • CVPR 2017 • Hyun Oh Song, Stefanie Jegelka, Vivek Rathod, Kevin Murphy

Learning the representation and the similarity metric in an end-to-end fashion with deep networks have demonstrated outstanding results for clustering and retrieval.

Clustering Metric Learning +2

Paper
Code

Bayesian Dark Knowledge

1 code implementation • NeurIPS 2015 • Anoop Korattikara, Vivek Rathod, Kevin Murphy, Max Welling

We consider the problem of Bayesian parameter estimation for deep neural networks, which is important in problem settings where we may have little data, and/ or where we need accurate posterior predictive densities, e. g., for applications involving bandits or active learning.

Active Learning

Paper
Code

Im2Calories: Towards an Automated Mobile Vision Food Diary

no code implementations • ICCV 2015 • Austin Meyers, Nick Johnston, Vivek Rathod, Anoop Korattikara, Alex Gorban, Nathan Silberman, Sergio Guadarrama, George Papandreou, Jonathan Huang, Kevin P. Murphy

We present a system which can recognize the contents of your meal from a single image, and then predict its nutritional contents, such as calories.

Paper
Add Code

DOPS: Learning to Detect 3D Objects and Predict their 3D Shapes

no code implementations • CVPR 2020 • Mahyar Najibi, Guangda Lai, Abhijit Kundu, Zhichao Lu, Vivek Rathod, Thomas Funkhouser, Caroline Pantofaru, David Ross, Larry S. Davis, Alireza Fathi

In contrast, we propose a general-purpose method that works on both indoor and outdoor scenes.

3D Object Detection Autonomous Driving +2

Paper
Add Code

The Auto Arborist Dataset: A Large-Scale Benchmark for Multiview Urban Forest Monitoring Under Domain Shift

no code implementations • CVPR 2022 • Sara Beery, Guanhang Wu, Trevor Edwards, Filip Pavetic, Bo Majewski, Shreyasee Mukherjee, Stanley Chan, John Morgan, Vivek Rathod, Jonathan Huang

We introduce baseline results on our dataset across modalities as well as metrics for the detailed analysis of generalization with respect to geographic distribution shifts, vital for such a system to be deployed at-scale.

Management

Paper
Add Code

Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features

no code implementations • 20 Dec 2022 • Vivek Rathod, Bryan Seybold, Sudheendra Vijayanarasimhan, Austin Myers, Xiuye Gu, Vighnesh Birodkar, David A. Ross

Detecting actions in untrimmed videos should not be limited to a small, closed set of classes.

Action Detection Optical Flow Estimation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.