Search Results for author: Ercument Ilhan

Found 4 papers, 3 papers with code

AdaptEx: A Self-Service Contextual Bandit Platform

no code implementations8 Aug 2023 William Black, Ercument Ilhan, Andrea Marchini, Vilda Markeviciute

This paper presents AdaptEx, a self-service contextual bandit platform widely used at Expedia Group, that leverages multi-armed bandit algorithms to personalize user experiences at scale.

Action Advising with Advice Imitation in Deep Reinforcement Learning

2 code implementations17 Apr 2021 Ercument Ilhan, Jeremy Gow, Diego Perez-Liebana

Action advising is a peer-to-peer knowledge exchange technique built on the teacher-student paradigm to alleviate the sample inefficiency problem in deep reinforcement learning.

Atari Games Behavioural cloning +2

Learning on a Budget via Teacher Imitation

1 code implementation17 Apr 2021 Ercument Ilhan, Jeremy Gow, Diego Perez-Liebana

However, due to the realistic concerns, the number of these interactions is limited with a budget; therefore, it is crucial to perform these in the most appropriate moments.

Atari Games Reinforcement Learning (RL)

Student-Initiated Action Advising via Advice Novelty

1 code implementation1 Oct 2020 Ercument Ilhan, Jeremy Gow, Diego Perez-Liebana

Action advising is a budget-constrained knowledge exchange mechanism between teacher-student peers that can help tackle exploration and sample inefficiency problems in deep reinforcement learning (RL).

Atari Games Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.