Search Results for author: Sai Rajeswar

Found 19 papers, 9 papers with code

Multimodal foundation world models for generalist embodied agents

1 code implementation26 Jun 2024 Pietro Mazzaglia, Tim Verbelen, Bart Dhoedt, Aaron Courville, Sai Rajeswar

Furthermore, by introducing a data-free RL strategy, it lays the groundwork for foundation model-based RL for generalist embodied agents.

Benchmarking Reinforcement Learning (RL)

RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content

no code implementations17 Jun 2024 Joao Monteiro, Pierre-Andre Noel, Etienne Marcotte, Sai Rajeswar, Valentina Zantedeschi, David Vazquez, Nicolas Chapados, Christopher Pal, Perouz Taslakian

We run a large-scale benchmark comprising several state-of-the-art LLMs to uncover differences in performance across models of various types and sizes in a context-conditional language modeling setting.

Benchmarking General Knowledge +2

VCR: Visual Caption Restoration

1 code implementation10 Jun 2024 Tianyu Zhang, Suyuchen Wang, Lu Li, Ge Zhang, Perouz Taslakian, Sai Rajeswar, Jie Fu, Bang Liu, Yoshua Bengio

This task stems from the observation that text embedded in images is intrinsically different from common visual elements and natural language due to the need to align the modalities of vision, text, and text embedded in images.

Language Modelling Masked Language Modeling +3

Efficient Dynamics Modeling in Interactive Environments with Koopman Theory

no code implementations20 Jun 2023 Arnab Kumar Mondal, Siba Smarak Panigrahi, Sai Rajeswar, Kaleem Siddiqi, Siamak Ravanbakhsh

We approach this problem from the lens of Koopman theory, where the nonlinear dynamics of the environment can be linearized in a high-dimensional latent space.

Reinforcement Learning (RL)

Choreographer: Learning and Adapting Skills in Imagination

1 code implementation23 Nov 2022 Pietro Mazzaglia, Tim Verbelen, Bart Dhoedt, Alexandre Lacoste, Sai Rajeswar

Unsupervised skill learning aims to learn a rich repertoire of behaviors without external supervision, providing artificial agents with the ability to control and influence the environment.

Unsupervised Reinforcement Learning

Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels

1 code implementation24 Sep 2022 Sai Rajeswar, Pietro Mazzaglia, Tim Verbelen, Alexandre Piché, Bart Dhoedt, Aaron Courville, Alexandre Lacoste

In this work, we study the URLB and propose a new method to solve it, using unsupervised model-based RL, for pre-training the agent, and a task-aware fine-tuning strategy combined with a new proposed hybrid planner, Dyna-MPC, to adapt the agent for downstream tasks.

reinforcement-learning Reinforcement Learning (RL) +1

Touch-based Curiosity for Sparse-Reward Tasks

1 code implementation1 Apr 2021 Sai Rajeswar, Cyril Ibrahim, Nitin Surya, Florian Golemo, David Vazquez, Aaron Courville, Pedro O. Pinheiro

Robots in many real-world settings have access to force/torque sensors in their gripper and tactile sensing is often necessary in tasks that involve contact-rich motion.

Pix2Shape: Towards Unsupervised Learning of 3D Scenes from Images using a View-based Representation

1 code implementation23 Mar 2020 Sai Rajeswar, Fahim Mannan, Florian Golemo, Jérôme Parent-Lévesque, David Vazquez, Derek Nowrouzezahrai, Aaron Courville

We propose Pix2Shape, an approach to solve this problem with four components: (i) an encoder that infers the latent 3D representation from an image, (ii) a decoder that generates an explicit 2. 5D surfel-based reconstruction of a scene from the latent code (iii) a differentiable renderer that synthesizes a 2D image from the surfel representation, and (iv) a critic network trained to discriminate between images generated by the decoder-renderer and those from a training distribution.

Decoder

Adversarial Computation of Optimal Transport Maps

1 code implementation24 Jun 2019 Jacob Leygonie, Jennifer She, Amjad Almahairi, Sai Rajeswar, Aaron Courville

We show that during training, our generator follows the $W_2$-geodesic between the initial and the target distributions.

Pix2Scene: Learning Implicit 3D Representations from Images

no code implementations ICLR 2019 Sai Rajeswar, Fahim Mannan, Florian Golemo, David Vazquez, Derek Nowrouzezahrai, Aaron Courville

Modelling 3D scenes from 2D images is a long-standing problem in computer vision with implications in, e. g., simulation and robotics.

W2GAN: RECOVERING AN OPTIMAL TRANSPORT MAP WITH A GAN

no code implementations27 Sep 2018 Leygonie Jacob*, Jennifer She*, Amjad Almahairi, Sai Rajeswar, Aaron Courville

In this work we address the converse question: is it possible to recover an optimal map in a GAN fashion?

Augmented CycleGAN: Learning Many-to-Many Mappings from Unpaired Data

3 code implementations ICML 2018 Amjad Almahairi, Sai Rajeswar, Alessandro Sordoni, Philip Bachman, Aaron Courville

Learning inter-domain mappings from unpaired data can improve performance in structured prediction tasks, such as image segmentation, by reducing the need for paired data.

Image Segmentation Semantic Segmentation +1

Hierarchical Adversarially Learned Inference

no code implementations ICLR 2018 Mohamed Ishmael Belghazi, Sai Rajeswar, Olivier Mastropietro, Negar Rostamzadeh, Jovana Mitrovic, Aaron Courville

We propose a novel hierarchical generative model with a simple Markovian structure and a corresponding inference model.

Attribute

MINE: Mutual Information Neural Estimation

21 code implementations12 Jan 2018 Mohamed Ishmael Belghazi, Aristide Baratin, Sai Rajeswar, Sherjil Ozair, Yoshua Bengio, Aaron Courville, R. Devon Hjelm

We argue that the estimation of mutual information between high dimensional continuous random variables can be achieved by gradient descent over neural networks.

General Classification

Adversarial Generation of Natural Language

no code implementations WS 2017 Sai Rajeswar, Sandeep Subramanian, Francis Dutil, Christopher Pal, Aaron Courville

Generative Adversarial Networks (GANs) have gathered a lot of attention from the computer vision community, yielding impressive results for image generation.

Image Generation Language Modelling +1

A hypothesize-and-verify framework for Text Recognition using Deep Recurrent Neural Networks

no code implementations26 Feb 2015 Anupama Ray, Sai Rajeswar, Santanu Chaudhury

This paper proposes a hybrid text recognizer using a deep recurrent neural network with multiple layers of abstraction and long range context along with a language model to verify the performance of the deep neural network.

Language Modelling Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.