Search Results for author: Majid Abdolshah

Found 13 papers, 1 papers with code

BO-Muse: A human expert and AI teaming framework for accelerated experimental design

no code implementations • 3 Mar 2023 • Sunil Gupta, Alistair Shilton, Arun Kumar A V, Shannon Ryan, Majid Abdolshah, Hung Le, Santu Rana, Julian Berk, Mahad Rashid, Svetha Venkatesh

In this paper we introduce BO-Muse, a new approach to human-AI teaming for the optimization of expensive black-box functions.

Experimental Design

Paper
Add Code

Semi-Supervised Semantic Segmentation under Label Noise via Diverse Learning Groups

no code implementations • ICCV 2023 • Peixia Li, Pulak Purkait, Thalaiyasingam Ajanthan, Majid Abdolshah, Ravi Garg, Hisham Husain, Chenchen Xu, Stephen Gould, Wanli Ouyang, Anton Van Den Hengel

Each learning group consists of a teacher network, a student network and a novel filter module.

Semi-Supervised Semantic Segmentation

Paper
Add Code

Learning to Constrain Policy Optimization with Virtual Trust Region

no code implementations • 20 Apr 2022 • Hung Le, Thommen Karimpanal George, Majid Abdolshah, Dung Nguyen, Kien Do, Sunil Gupta, Svetha Venkatesh

We introduce a constrained optimization method for policy gradient reinforcement learning, which uses a virtual trust region to regulate each policy update.

Atari Games Policy Gradient Methods

Paper
Add Code

Episodic Policy Gradient Training

1 code implementation • 3 Dec 2021 • Hung Le, Majid Abdolshah, Thommen K. George, Kien Do, Dung Nguyen, Svetha Venkatesh

We introduce a novel training procedure for policy gradient methods wherein episodic memory is used to optimize the hyperparameters of reinforcement learning algorithms on-the-fly.

Policy Gradient Methods Scheduling

Paper
Code

Balanced Q-learning: Combining the Influence of Optimistic and Pessimistic Targets

no code implementations • 3 Nov 2021 • Thommen George Karimpanal, Hung Le, Majid Abdolshah, Santu Rana, Sunil Gupta, Truyen Tran, Svetha Venkatesh

The optimistic nature of the Q-learning target leads to an overestimation bias, which is an inherent problem associated with standard $Q-$learning.

Q-Learning

Paper
Add Code

Model-Based Episodic Memory Induces Dynamic Hybrid Controls

no code implementations • NeurIPS 2021 • Hung Le, Thommen Karimpanal George, Majid Abdolshah, Truyen Tran, Svetha Venkatesh

Episodic control enables sample efficiency in reinforcement learning by recalling past experiences from an episodic memory.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Neural Latent Traversal with Semantic Constraints

no code implementations • 29 Sep 2021 • Majid Abdolshah, Hung Le, Thommen Karimpanal George, Vuong Le, Sunil Gupta, Santu Rana, Svetha Venkatesh

Whilst Generative Adversarial Networks (GANs) generate visually appealing high resolution images, the latent representations (or codes) of these models do not allow controllable changes on the semantic attributes of the generated images.

Paper
Add Code

Reachability Traces for Curriculum Design in Reinforcement Learning

no code implementations • 29 Sep 2021 • Thommen Karimpanal George, Majid Abdolshah, Hung Le, Santu Rana, Sunil Gupta, Truyen Tran, Svetha Venkatesh

The objective in goal-based reinforcement learning is to learn a policy to reach a particular goal state within the environment.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Plug and Play, Model-Based Reinforcement Learning

no code implementations • 20 Aug 2021 • Majid Abdolshah, Hung Le, Thommen Karimpanal George, Sunil Gupta, Santu Rana, Svetha Venkatesh

This is achieved by representing the global transition dynamics as a union of local transition functions, each with respect to one active object in the scene.

Model-based Reinforcement Learning Object +3

Paper
Add Code

A New Representation of Successor Features for Transfer across Dissimilar Environments

no code implementations • 18 Jul 2021 • Majid Abdolshah, Hung Le, Thommen Karimpanal George, Sunil Gupta, Santu Rana, Svetha Venkatesh

Transfer in reinforcement learning is usually achieved through generalisation across tasks.

Gaussian Processes Reinforcement Learning (RL)

Paper
Add Code

Cost-aware Multi-objective Bayesian optimisation

no code implementations • 9 Sep 2019 • Majid Abdolshah, Alistair Shilton, Santu Rana, Sunil Gupta, Svetha Venkatesh

We introduce a cost-aware multi-objective Bayesian optimisation with non-uniform evaluation cost over objective functions by defining cost-aware constraints over the search space.

Bayesian Optimisation

Paper
Add Code

Stable Bayesian Optimisation via Direct Stability Quantification

no code implementations • 21 Feb 2019 • Alistair Shilton, Sunil Gupta, Santu Rana, Svetha Venkatesh, Majid Abdolshah, Dang Nguyen

In this paper we consider the problem of finding stable maxima of expensive (to evaluate) functions.

Bayesian Optimisation

Paper
Add Code

Multi-objective Bayesian optimisation with preferences over objectives

no code implementations • NeurIPS 2019 • Majid Abdolshah, Alistair Shilton, Santu Rana, Sunil Gupta, Svetha Venkatesh

We present a multi-objective Bayesian optimisation algorithm that allows the user to express preference-order constraints on the objectives of the type "objective A is more important than objective B".

Bayesian Optimisation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.