no code implementations • 11 Oct 2023 • Gregory Palmer, Chris Parry, Daniel J. B. Harrold, Chris Willis
An overview of state-of-the-art approaches for scaling DRL to domains that confront learners with the curse of dimensionality, and; iv.)
no code implementations • 3 Nov 2022 • Peter Wißbrock, Yvonne Richter, David Pelkmann, Zhao Ren, Gregory Palmer
We train a state-of-the-art one-class-classifier, on samples from healthy motors and separate the faulty ones for fault detection using a threshold.
no code implementations • 10 Jul 2022 • Holger Eichelberger, Gregory Palmer, Svenja Reimer, Tat Trong Vu, Hieu Do, Sofiane Laridi, Alexander Weber, Claudia Niederée, Thomas Hildebrandt
For a broader adoption of AI in industrial production, adequate infrastructure capabilities are crucial.
1 code implementation • 7 Jul 2022 • Joseph Jerome, Gregory Palmer, Rahul Savani
This paper introduces a new representation for the actions of a market maker in an order-driven market.
no code implementations • 19 May 2022 • Amir Abolfazli, Gregory Palmer, Daniel Kudenko
The success of deep reinforcement learning (DRL) hinges on the availability of training data, which is typically obtained via a large number of environment interactions.
no code implementations • 5 May 2022 • Jeff Reimer, Yandong Wang, Sofiane Laridi, Juergen Urdich, Sören Wilmsmeier, Gregory Palmer
In car-body production the pre-formed sheet metal parts of the body are assembled on fully-automated production lines.
no code implementations • 9 Mar 2022 • Yi Chang, Sofiane Laridi, Zhao Ren, Gregory Palmer, Björn W. Schuller, Marco Fisichella
The proposed framework consists of i) federated learning for data privacy, and ii) adversarial training at the training stage and randomisation at the testing stage for model robustness.
no code implementations • 9 Jul 2020 • Gregory Palmer, Mark Green, Emma Boyland, Yales Stefano Rios Vasconcelos, Rahul Savani, Alex Singleton
Our project presents a novel implementation for the incidental classification of street view images for identifying unhealthy advertisements, providing a means through which to identify areas that can benefit from tougher advertisement restriction policies for tackling social inequalities.
no code implementations • 21 Feb 2020 • Gregory Palmer, Benjamin Schnieders, Rahul Savani, Karl Tuyls, Joscha-David Fossel, Harry Flore
We train 3D-ConvNets to predict the likelihood of 20-frame video samples containing anomalies.
1 code implementation • 13 Sep 2018 • Gregory Palmer, Rahul Savani, Karl Tuyls
For instance, hysteretic Q-learning addresses miscoordination while leaving agents vulnerable towards misleading stochastic rewards.
1 code implementation • 14 Jul 2017 • Gregory Palmer, Karl Tuyls, Daan Bloembergen, Rahul Savani
We find that LDQN agents are more likely to converge to the optimal policy in a stochastic reward CMOTP compared to standard and scheduled-HDQN agents.
Multi-agent Reinforcement Learning reinforcement-learning +1