Search Results for author: Gregory Palmer

Found 11 papers, 3 papers with code

Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey

no code implementations • 11 Oct 2023 • Gregory Palmer, Chris Parry, Daniel J. B. Harrold, Chris Willis

An overview of state-of-the-art approaches for scaling DRL to domains that confront learners with the curse of dimensionality, and; iv.)

Benchmarking Real-Time Strategy Games +1

Paper
Add Code

Cutting Through the Noise: An Empirical Comparison of Psychoacoustic and Envelope-based Features for Machinery Fault Detection

no code implementations • 3 Nov 2022 • Peter Wißbrock, Yvonne Richter, David Pelkmann, Zhao Ren, Gregory Palmer

We train a state-of-the-art one-class-classifier, on samples from healthy motors and separate the faulty ones for fault detection using a threshold.

Fault Detection One-class classifier

Paper
Add Code

Developing an AI-enabled IIoT platform -- Lessons learned from early use case validation

no code implementations • 10 Jul 2022 • Holger Eichelberger, Gregory Palmer, Svenja Reimer, Tat Trong Vu, Hieu Do, Sofiane Laridi, Alexander Weber, Claudia Niederée, Thomas Hildebrandt

For a broader adoption of AI in industrial production, adequate infrastructure capabilities are crucial.

Paper
Add Code

Market Making with Scaled Beta Policies

1 code implementation • 7 Jul 2022 • Joseph Jerome, Gregory Palmer, Rahul Savani

This paper introduces a new representation for the actions of a market maker in an order-driven market.

Management

Paper
Code

Data Valuation for Offline Reinforcement Learning

no code implementations • 19 May 2022 • Amir Abolfazli, Gregory Palmer, Daniel Kudenko

The success of deep reinforcement learning (DRL) hinges on the availability of training data, which is typically obtained via a large number of environment interactions.

Data Valuation reinforcement-learning +1

Paper
Add Code

Identifying Cause-and-Effect Relationships of Manufacturing Errors using Sequence-to-Sequence Learning

no code implementations • 5 May 2022 • Jeff Reimer, Yandong Wang, Sofiane Laridi, Juergen Urdich, Sören Wilmsmeier, Gregory Palmer

In car-body production the pre-formed sheet metal parts of the body are assembled on fully-automated production lines.

Paper
Add Code

Robust Federated Learning Against Adversarial Attacks for Speech Emotion Recognition

no code implementations • 9 Mar 2022 • Yi Chang, Sofiane Laridi, Zhao Ren, Gregory Palmer, Björn W. Schuller, Marco Fisichella

The proposed framework consists of i) federated learning for data privacy, and ii) adversarial training at the training stage and randomisation at the testing stage for model robustness.

Federated Learning Speech Emotion Recognition

Paper
Add Code

A deep learning approach to identify unhealthy advertisements in street view images

no code implementations • 9 Jul 2020 • Gregory Palmer, Mark Green, Emma Boyland, Yales Stefano Rios Vasconcelos, Rahul Savani, Alex Singleton

Our project presents a novel implementation for the incidental classification of street view images for identifying unhealthy advertisements, providing a means through which to identify areas that can benefit from tougher advertisement restriction policies for tackling social inequalities.

Paper
Add Code

The Automated Inspection of Opaque Liquid Vaccines

no code implementations • 21 Feb 2020 • Gregory Palmer, Benjamin Schnieders, Rahul Savani, Karl Tuyls, Joscha-David Fossel, Harry Flore

We train 3D-ConvNets to predict the likelihood of 20-frame video samples containing anomalies.

Paper
Add Code

Negative Update Intervals in Deep Multi-Agent Reinforcement Learning

1 code implementation • 13 Sep 2018 • Gregory Palmer, Rahul Savani, Karl Tuyls

For instance, hysteretic Q-learning addresses miscoordination while leaving agents vulnerable towards misleading stochastic rewards.

Multi-agent Reinforcement Learning Q-Learning +2

Paper
Code

Lenient Multi-Agent Deep Reinforcement Learning

1 code implementation • 14 Jul 2017 • Gregory Palmer, Karl Tuyls, Daan Bloembergen, Rahul Savani

We find that LDQN agents are more likely to converge to the optimal policy in a stochastic reward CMOTP compared to standard and scheduled-HDQN agents.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.