Search Results for author: Nitish Srivastava

Found 19 papers, 7 papers with code

Efficient Embedding of Semantic Similarity in Control Policies via Entangled Bisimulation

no code implementations • 28 Jan 2022 • Martin Bertran, Walter Talbott, Nitish Srivastava, Joshua Susskind

Learning generalizeable policies from visual input in the presence of visual distractions is a challenging problem in reinforcement learning.

Data Augmentation Reinforcement Learning (RL) +2

Paper
Add Code

Robust Robotic Control from Pixels using Contrastive Recurrent State-Space Models

1 code implementation • 2 Dec 2021 • Nitish Srivastava, Walter Talbott, Martin Bertran Lopez, Shuangfei Zhai, Josh Susskind

Modeling the world can benefit robot learning by providing a rich training signal for shaping an agent's latent state space.

Paper
Code

A Dot Product Attention Free Transformer

no code implementations • 29 Sep 2021 • Shuangfei Zhai, Walter Talbott, Nitish Srivastava, Chen Huang, Hanlin Goh, Ruixiang Zhang, Joshua M. Susskind

We introduce Dot Product Attention Free Transformer (DAFT), an efficient variant of Transformers \citep{transformer} that eliminates the query-key dot product in self attention.

Ranked #620 on Image Classification on ImageNet

Image Classification Language Modelling

Paper
Add Code

An Attention Free Transformer

6 code implementations • 28 May 2021 • Shuangfei Zhai, Walter Talbott, Nitish Srivastava, Chen Huang, Hanlin Goh, Ruixiang Zhang, Josh Susskind

We introduce Attention Free Transformer (AFT), an efficient variant of Transformers that eliminates the need for dot product self attention.

Position

47,906

Paper
Code

Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning

2 code implementations • 17 May 2021 • Yue Wu, Shuangfei Zhai, Nitish Srivastava, Joshua Susskind, Jian Zhang, Ruslan Salakhutdinov, Hanlin Goh

Offline Reinforcement Learning promises to learn effective policies from previously-collected, static datasets without the need for exploration.

Offline RL Q-Learning +2

Paper
Code

Unconstrained Scene Generation with Locally Conditioned Radiance Fields

1 code implementation • ICCV 2021 • Terrance DeVries, Miguel Angel Bautista, Nitish Srivastava, Graham W. Taylor, Joshua M. Susskind

In this paper, we introduce Generative Scene Networks (GSN), which learns to decompose scenes into a collection of many local radiance fields that can be rendered from a free moving camera.

Ranked #1 on Scene Generation on VizDoom

Scene Generation

290

Paper
Code

Uncertainty Weighted Offline Reinforcement Learning

no code implementations • 1 Jan 2021 • Yue Wu, Shuangfei Zhai, Nitish Srivastava, Joshua M. Susskind, Jian Zhang, Ruslan Salakhutdinov, Hanlin Goh

Offline Reinforcement Learning promises to learn effective policies from previously-collected, static datasets without the need for exploration.

Offline RL Q-Learning +2

Paper
Add Code

On the generalization of learning-based 3D reconstruction

no code implementations • 27 Jun 2020 • Miguel Angel Bautista, Walter Talbott, Shuangfei Zhai, Nitish Srivastava, Joshua M. Susskind

State-of-the-art learning-based monocular 3D reconstruction methods learn priors over object categories on the training set, and as a result struggle to achieve reasonable generalization to object categories unseen during training.

3D Reconstruction Position

Paper
Add Code

Capsules with Inverted Dot-Product Attention Routing

2 code implementations • ICLR 2020 • Yao-Hung Hubert Tsai, Nitish Srivastava, Hanlin Goh, Ruslan Salakhutdinov

We introduce a new routing algorithm for capsule networks, in which a child capsule is routed to a parent based only on agreement between the parent's state and the child's vote.

Image Classification

120

Paper
Code

Geometric Capsule Autoencoders for 3D Point Clouds

no code implementations • 6 Dec 2019 • Nitish Srivastava, Hanlin Goh, Ruslan Salakhutdinov

The pose encodes where the entity is, while the feature encodes what it is.

Object Retrieval

Paper
Add Code

Initialization Strategies of Spatio-Temporal Convolutional Neural Networks

no code implementations • 25 Mar 2015 • Elman Mansimov, Nitish Srivastava, Ruslan Salakhutdinov

We propose a new way of incorporating temporal information present in videos into Spatial Convolutional Neural Networks (ConvNets) trained on images, that avoids training Spatio-Temporal ConvNets from scratch.

Paper
Add Code

Exploiting Image-trained CNN Architectures for Unconstrained Video Classification

no code implementations • 13 Mar 2015 • Shengxin Zha, Florian Luisier, Walter Andrews, Nitish Srivastava, Ruslan Salakhutdinov

Our proposed late fusion of CNN- and motion-based features can further increase the mean average precision (mAP) on MED'14 from 34. 95% to 38. 74%.

Classification Event Detection +3

Paper
Add Code

Unsupervised Learning of Video Representations using LSTMs

10 code implementations • 16 Feb 2015 • Nitish Srivastava, Elman Mansimov, Ruslan Salakhutdinov

We further evaluate the representations by finetuning them for a supervised learning problem - human action recognition on the UCF-101 and HMDB-51 datasets.

Action Recognition Temporal Action Localization

353

Paper
Code

Dropout: A Simple Way to Prevent Neural Networks from Overfitting

no code implementations • Journal of Machine Learning Research 2014 • Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, Ruslan Salakhutdinov

The key idea is to randomly drop units (along with their connections) from the neural network during training.

Document Classification speech-recognition +1

Paper
Add Code

Learning Generative Models with Visual Attention

no code implementations • NeurIPS 2014 • Yichuan Tang, Nitish Srivastava, Ruslan Salakhutdinov

Our model is a proper graphical model where the 2D Similarity transformation is a part of the top-down process.

Paper
Add Code

Discriminative Transfer Learning with Tree-based Priors

no code implementations • NeurIPS 2013 • Nitish Srivastava, Ruslan R. Salakhutdinov

The tree structure can be used to impose a generative prior over classification parameters.

Ranked #183 on Image Classification on CIFAR-100

Classification General Classification +2

Paper
Add Code

Modeling Documents with Deep Boltzmann Machines

no code implementations • 26 Sep 2013 • Nitish Srivastava, Ruslan R. Salakhutdinov, Geoffrey E. Hinton

We introduce a Deep Boltzmann Machine model suitable for modeling and extracting latent semantic representations from a large unstructured collection of documents.

Document Classification General Classification +1

Paper
Add Code

Multimodal Learning with Deep Boltzmann Machines

no code implementations • NeurIPS 2012 • Nitish Srivastava, Ruslan R. Salakhutdinov

Our experimental results on bi-modal data consisting of images and text show that the Multimodal DBM can learn a good generative model of the joint space of image and text inputs that is useful for information retrieval from both unimodal and multimodal queries.

Information Retrieval Retrieval +2

Paper
Add Code

Improving neural networks by preventing co-adaptation of feature detectors

11 code implementations • 3 Jul 2012 • Geoffrey E. Hinton, Nitish Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov

When a large feedforward neural network is trained on a small training set, it typically performs poorly on held-out test data.

Ranked #205 on Image Classification on CIFAR-10

Image Classification Object Recognition

246

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.