Search Results for author: Gregory Palmer

Found 11 papers, 3 papers with code

Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey

no code implementations11 Oct 2023 Gregory Palmer, Chris Parry, Daniel J. B. Harrold, Chris Willis

An overview of state-of-the-art approaches for scaling DRL to domains that confront learners with the curse of dimensionality, and; iv.)

Benchmarking Real-Time Strategy Games +1

Market Making with Scaled Beta Policies

1 code implementation7 Jul 2022 Joseph Jerome, Gregory Palmer, Rahul Savani

This paper introduces a new representation for the actions of a market maker in an order-driven market.

Management

Data Valuation for Offline Reinforcement Learning

no code implementations19 May 2022 Amir Abolfazli, Gregory Palmer, Daniel Kudenko

The success of deep reinforcement learning (DRL) hinges on the availability of training data, which is typically obtained via a large number of environment interactions.

Data Valuation reinforcement-learning +1

Identifying Cause-and-Effect Relationships of Manufacturing Errors using Sequence-to-Sequence Learning

no code implementations5 May 2022 Jeff Reimer, Yandong Wang, Sofiane Laridi, Juergen Urdich, Sören Wilmsmeier, Gregory Palmer

In car-body production the pre-formed sheet metal parts of the body are assembled on fully-automated production lines.

Robust Federated Learning Against Adversarial Attacks for Speech Emotion Recognition

no code implementations9 Mar 2022 Yi Chang, Sofiane Laridi, Zhao Ren, Gregory Palmer, Björn W. Schuller, Marco Fisichella

The proposed framework consists of i) federated learning for data privacy, and ii) adversarial training at the training stage and randomisation at the testing stage for model robustness.

Federated Learning Speech Emotion Recognition

A deep learning approach to identify unhealthy advertisements in street view images

no code implementations9 Jul 2020 Gregory Palmer, Mark Green, Emma Boyland, Yales Stefano Rios Vasconcelos, Rahul Savani, Alex Singleton

Our project presents a novel implementation for the incidental classification of street view images for identifying unhealthy advertisements, providing a means through which to identify areas that can benefit from tougher advertisement restriction policies for tackling social inequalities.

The Automated Inspection of Opaque Liquid Vaccines

no code implementations21 Feb 2020 Gregory Palmer, Benjamin Schnieders, Rahul Savani, Karl Tuyls, Joscha-David Fossel, Harry Flore

We train 3D-ConvNets to predict the likelihood of 20-frame video samples containing anomalies.

Negative Update Intervals in Deep Multi-Agent Reinforcement Learning

1 code implementation13 Sep 2018 Gregory Palmer, Rahul Savani, Karl Tuyls

For instance, hysteretic Q-learning addresses miscoordination while leaving agents vulnerable towards misleading stochastic rewards.

Multi-agent Reinforcement Learning Q-Learning +2

Lenient Multi-Agent Deep Reinforcement Learning

1 code implementation14 Jul 2017 Gregory Palmer, Karl Tuyls, Daan Bloembergen, Rahul Savani

We find that LDQN agents are more likely to converge to the optimal policy in a stochastic reward CMOTP compared to standard and scheduled-HDQN agents.

Multi-agent Reinforcement Learning reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.