Search Results for author: Wilka Carvalho

Found 6 papers, 4 papers with code

Predictive representations: building blocks of intelligence

no code implementations • 9 Feb 2024 • Wilka Carvalho, Momchil S. Tomov, William de Cothi, Caswell Barry, Samuel J. Gershman

Adaptive behavior often requires predicting future events.

reinforcement-learning

Paper
Add Code

Composing Task Knowledge with Modular Successor Feature Approximators

1 code implementation • 28 Jan 2023 • Wilka Carvalho, Angelos Filos, Richard L. Lewis, Honglak Lee, Satinder Singh

Recently, the Successor Features and Generalized Policy Improvement (SF&GPI) framework has been proposed as a method for learning, composing, and transferring predictive knowledge and behavior.

444

Paper
Code

Feature-Attending Recurrent Modules for Generalization in Reinforcement Learning

1 code implementation • 15 Dec 2021 • Wilka Carvalho, Andrew Lampinen, Kyriacos Nikiforou, Felix Hill, Murray Shanahan

Many important tasks are defined in terms of object.

Object reinforcement-learning +1

Paper
Code

Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement Learning

1 code implementation • NeurIPS 2021 • Christopher Hoang, Sungryull Sohn, Jongwook Choi, Wilka Carvalho, Honglak Lee

SFL leverages the ability of successor features (SF) to capture transition dynamics, using it to drive exploration by estimating state-novelty and to enable high-level planning by abstracting the state-space as a non-parametric landmark-based graph.

Efficient Exploration reinforcement-learning +1

Paper
Code

Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-person Simulated 3D Environment

no code implementations • 28 Oct 2020 • Wilka Carvalho, Anthony Liang, Kimin Lee, Sungryull Sohn, Honglak Lee, Richard L. Lewis, Satinder Singh

In this work, we show that one can learn object-interaction tasks from scratch without supervision by learning an attentive object-model as an auxiliary task during task learning with an object-centric relational RL agent.

Object Reinforcement Learning (RL) +1

Paper
Add Code

Detecting Backdoor Attacks on Deep Neural Networks by Activation Clustering

1 code implementation • 9 Nov 2018 • Bryant Chen, Wilka Carvalho, Nathalie Baracaldo, Heiko Ludwig, Benjamin Edwards, Taesung Lee, Ian Molloy, Biplav Srivastava

While machine learning (ML) models are being increasingly trusted to make decisions in different and varying areas, the safety of systems using such models has become an increasing concern.

Clustering

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.