Search Results for author: Aravind Srinivas

Found 27 papers, 19 papers with code

Attend, Adapt and Transfer: Attentive Deep Architecture for Adaptive Transfer from multiple sources in the same domain

2 code implementations • 10 Oct 2015 • Janarthanan Rajendran, Aravind Srinivas, Mitesh M. Khapra, P. Prasanna, Balaraman Ravindran

Second, the agent should be able to selectively transfer, which is the ability to select and transfer from different and multiple source tasks for different parts of the state space of the target task.

Paper
Code

Dynamic Frame skip Deep Q Network

no code implementations • 17 May 2016 • Aravind Srinivas, Sahil Sharma, Balaraman Ravindran

Deep Reinforcement Learning methods have achieved state of the art performance in learning control policies for the games in the Atari 2600 domain.

Atari Games

Paper
Add Code

Option Discovery in Hierarchical Reinforcement Learning using Spatio-Temporal Clustering

no code implementations • 17 May 2016 • Aravind Srinivas, Ramnandan Krishnamurthy, Peeyush Kumar, Balaraman Ravindran

This paper introduces an automated skill acquisition framework in reinforcement learning which involves identifying a hierarchical description of the given task in terms of abstract states and extended actions between abstract states.

Clustering Hierarchical Reinforcement Learning +3

Paper
Add Code

Learning to Repeat: Fine Grained Action Repetition for Deep Reinforcement Learning

no code implementations • 20 Feb 2017 • Sahil Sharma, Aravind Srinivas, Balaraman Ravindran

Reinforcement Learning algorithms can learn complex behavioral patterns for sequential decision making tasks wherein an agent interacts with an environment and acquires feedback in the form of rewards sampled from it.

Car Racing Decision Making +2

Paper
Add Code

Universal Planning Networks

1 code implementation • 2 Apr 2018 • Aravind Srinivas, Allan Jabri, Pieter Abbeel, Sergey Levine, Chelsea Finn

We find that the representations learned are not only effective for goal-directed visual imitation via gradient-based trajectory optimization, but can also provide a metric for specifying goals using images.

Imitation Learning Representation Learning +1

Paper
Code

Universal Planning Networks: Learning Generalizable Representations for Visuomotor Control

1 code implementation • ICML 2018 • Aravind Srinivas, Allan Jabri, Pieter Abbeel, Sergey Levine, Chelsea Finn

A key challenge in complex visuomotor control is learning abstract representations that are effective for specifying goals, planning, and generalization.

Imitation Learning

Paper
Code

Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design

4 code implementations • ICLR 2019 • Jonathan Ho, Xi Chen, Aravind Srinivas, Yan Duan, Pieter Abbeel

Flow-based generative models are powerful exact likelihood models with efficient sampling and inference.

Ranked #14 on Image Generation on ImageNet 32x32 (bpd metric)

Computational Efficiency Density Estimation +1

182

Paper
Code

Data-Efficient Image Recognition with Contrastive Predictive Coding

4 code implementations • ICML 2020 • Olivier J. Hénaff, Aravind Srinivas, Jeffrey De Fauw, Ali Razavi, Carl Doersch, S. M. Ali Eslami, Aaron van den Oord

Human observers can learn to recognize new categories of images from a handful of examples, yet doing so with artificial ones remains an open challenge.

Ranked #6 on Contrastive Learning on imagenet-1k

Contrastive Learning General Classification +5

394

Paper
Code

PatchFormer: A neural architecture for self-supervised representation learning on images

no code implementations • 25 Sep 2019 • Aravind Srinivas, Pieter Abbeel

In this paper, we propose a neural architecture for self-supervised representation learning on raw images called the PatchFormer which learns to model spatial dependencies across patches in a raw image.

Representation Learning Self-Supervised Learning

Paper
Add Code

CURL: Contrastive Unsupervised Representations for Reinforcement Learning

7 code implementations • 8 Apr 2020 • Aravind Srinivas, Michael Laskin, Pieter Abbeel

On the DeepMind Control Suite, CURL is the first image-based algorithm to nearly match the sample-efficiency of methods that use state-based features.

Ranked #1 on Continuous Control on Finger, spin (DMControl500k)

Atari Games Atari Games 100k +4

2,513

Paper
Code

Reinforcement Learning with Augmented Data

2 code implementations • NeurIPS 2020 • Michael Laskin, Kimin Lee, Adam Stooke, Lerrel Pinto, Pieter Abbeel, Aravind Srinivas

To this end, we present Reinforcement Learning with Augmented Data (RAD), a simple plug-and-play module that can enhance most RL algorithms.

Data Augmentation OpenAI Gym +2

397

Paper
Code

SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning

1 code implementation • 9 Jul 2020 • Kimin Lee, Michael Laskin, Aravind Srinivas, Pieter Abbeel

Off-policy deep reinforcement learning (RL) has been successful in a range of challenging domains.

Efficient Exploration Ensemble Learning +3

118

Paper
Code

SelfAugment: Automatic Augmentation Policies for Self-Supervised Learning

1 code implementation • CVPR 2021 • Colorado J Reed, Sean Metzger, Aravind Srinivas, Trevor Darrell, Kurt Keutzer

A common practice in unsupervised representation learning is to use labeled data to evaluate the quality of the learned representations.

Data Augmentation Representation Learning +1

Paper
Code

D2RL: Deep Dense Architectures in Reinforcement Learning

4 code implementations • 19 Oct 2020 • Samarth Sinha, Homanga Bharadhwaj, Aravind Srinivas, Animesh Garg

While improvements in deep learning architectures have played a crucial role in improving the state of supervised and unsupervised learning in computer vision and natural language processing, neural network architecture choices for reinforcement learning remain relatively under-explored.

reinforcement-learning Reinforcement Learning (RL)

247

Paper
Code

Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation

5 code implementations • CVPR 2021 • Golnaz Ghiasi, Yin Cui, Aravind Srinivas, Rui Qian, Tsung-Yi Lin, Ekin D. Cubuk, Quoc V. Le, Barret Zoph

Our baseline model outperforms the LVIS 2020 Challenge winning entry by +3. 6 mask AP on rare categories.

Ranked #1 on Object Detection on PASCAL VOC 2007

Image Augmentation Instance Segmentation +3

38,330

Paper
Code

Weighted Bellman Backups for Improved Signal-to-Noise in Q-Updates

no code implementations • 1 Jan 2021 • Kimin Lee, Michael Laskin, Aravind Srinivas, Pieter Abbeel

Furthermore, since our weighted Bellman backups rely on maintaining an ensemble, we investigate how weighted Bellman backups interact with other benefits previously derived from ensembles: (a) Bootstrap; (b) UCB Exploration.

Q-Learning Reinforcement Learning (RL)

Paper
Add Code

Compute- and Memory-Efficient Reinforcement Learning with Latent Experience Replay

no code implementations • 1 Jan 2021 • Lili Chen, Kimin Lee, Aravind Srinivas, Pieter Abbeel

In this paper, we present Latent Vector Experience Replay (LeVER), a simple modification of existing off-policy RL methods, to address these computational and memory requirements without sacrificing the performance of RL agents.

Atari Games reinforcement-learning +2

Paper
Add Code

VideoGen: Generative Modeling of Videos using VQ-VAE and Transformers

no code implementations • 1 Jan 2021 • Yunzhi Zhang, Wilson Yan, Pieter Abbeel, Aravind Srinivas

We present VideoGen: a conceptually simple architecture for scaling likelihood based generative modeling to natural videos.

Position Video Generation

Paper
Add Code

R-LAtte: Attention Module for Visual Control via Reinforcement Learning

no code implementations • 1 Jan 2021 • Mandi Zhao, Qiyang Li, Aravind Srinivas, Ignasi Clavera, Kimin Lee, Pieter Abbeel

Attention mechanisms are generic inductive biases that have played a critical role in improving the state-of-the-art in supervised learning, unsupervised pre-training and generative modeling for multiple domains including vision, language and speech.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Reinforcement Learning with Latent Flow

2 code implementations • NeurIPS 2021 • Wenling Shang, Xiaofei Wang, Aravind Srinivas, Aravind Rajeswaran, Yang Gao, Pieter Abbeel, Michael Laskin

Temporal information is essential to learning effective policies with Reinforcement Learning (RL).

Ranked #1 on Montezuma's Revenge on Atari 2600 Montezuma's Revenge

Continuous Control Montezuma's Revenge +4

Paper
Code

Bottleneck Transformers for Visual Recognition

13 code implementations • CVPR 2021 • Aravind Srinivas, Tsung-Yi Lin, Niki Parmar, Jonathon Shlens, Pieter Abbeel, Ashish Vaswani

Finally, we present a simple adaptation of the BoTNet design for image classification, resulting in models that achieve a strong performance of 84. 7% top-1 accuracy on the ImageNet benchmark while being up to 1. 64x faster in compute time than the popular EfficientNet models on TPU-v3 hardware.

Ranked #52 on Instance Segmentation on COCO minival

Image Classification Instance Segmentation +3

29,671

Paper
Code

Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings

1 code implementation • NeurIPS 2021 • Lili Chen, Kimin Lee, Aravind Srinivas, Pieter Abbeel

Recent advances in off-policy deep reinforcement learning (RL) have led to impressive success in complex tasks from visual observations.

Ranked #33 on Atari Games on Atari 2600 Amidar

Atari Games Computational Efficiency +3

Paper
Code

Revisiting ResNets: Improved Training and Scaling Strategies

3 code implementations • NeurIPS 2021 • Irwan Bello, William Fedus, Xianzhi Du, Ekin D. Cubuk, Aravind Srinivas, Tsung-Yi Lin, Jonathon Shlens, Barret Zoph

Using improved training and scaling strategies, we design a family of ResNet architectures, ResNet-RS, which are 1. 7x - 2. 7x faster than EfficientNets on TPUs, while achieving similar accuracies on ImageNet.

Ranked #1 on Semantic Object Interaction Classification on Kinetics-700

Action Classification Document Image Classification +2

29,671

Paper
Code

Scaling Local Self-Attention for Parameter Efficient Visual Backbones

7 code implementations • CVPR 2021 • Ashish Vaswani, Prajit Ramachandran, Aravind Srinivas, Niki Parmar, Blake Hechtman, Jonathon Shlens

Self-attention models have recently been shown to have encouraging improvements on accuracy-parameter trade-offs compared to baseline convolutional models such as ResNet-50.

Ranked #209 on Image Classification on ImageNet

Image Classification Instance Segmentation +4

29,671

Paper
Code

VideoGPT: Video Generation using VQ-VAE and Transformers

3 code implementations • 20 Apr 2021 • Wilson Yan, Yunzhi Zhang, Pieter Abbeel, Aravind Srinivas

We present VideoGPT: a conceptually simple architecture for scaling likelihood based generative modeling to natural videos.

Ranked #3 on Video Generation on UCF-101 16 frames, 128x128, Unconditional

Position Video Generation

874

Paper
Code

Decision Transformer: Reinforcement Learning via Sequence Modeling

16 code implementations • NeurIPS 2021 • Lili Chen, Kevin Lu, Aravind Rajeswaran, Kimin Lee, Aditya Grover, Michael Laskin, Pieter Abbeel, Aravind Srinivas, Igor Mordatch

In particular, we present Decision Transformer, an architecture that casts the problem of RL as conditional sequence modeling.

Ranked #3 on Offline RL on D4RL

Atari Games D4RL +5

2,513

Paper
Code

CURL: Contrastive Unsupervised Representation Learning for Reinforcement Learning

1 code implementation • ICML 2020 • Michael Laskin, Pieter Abbeel, Aravind Srinivas

CURL extracts high level features from raw pixels using a contrastive learning objective and performs off-policy control on top of the extracted features.

Contrastive Learning reinforcement-learning +2

555

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.