Deep Q-Network

Introduced by Mnih et al. in Playing Atari with Deep Reinforcement Learning

A DQN, or Deep Q-Network, approximates a state-value function in a Q-Learning framework with a neural network. In the Atari Games case, they take in several frames of the game as an input and output state values for each action as an output.

It is usually used in conjunction with Experience Replay, for storing the episode steps in memory for off-policy learning, where samples are drawn from the replay memory at random. Additionally, the Q-Network is usually optimized towards a frozen target network that is periodically updated with the latest weights every $k$ steps (where $k$ is a hyperparameter). The latter makes training more stable by preventing short-term oscillations from a moving target. The former tackles autocorrelation that would occur from on-line learning, and having a replay memory makes the problem more like a supervised learning problem.

Image Source: here

Source: Playing Atari with Deep Reinforcement Learning

Latest Papers

PAPER DATE
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls
Jeongho KimJaeuk ShinInsoon Yang
2020-10-27
Adversarial Attacks on Deep Algorithmic Trading Policies
Yaser FaghanNancirose PiazzaVahid BehzadanAli Fathi
2020-10-22
Connections between Relational Event Model and Inverse Reinforcement Learning for Characterizing Group Interaction Sequences
Congyu Wu
2020-10-19
Chance-Constrained Control with Lexicographic Deep Reinforcement Learning
Alessandro GiuseppiAntonio Pietrabissa
2020-10-19
Multi-Agent Reinforcement Learning in NOMA-aided UAV Networks for Cellular Offloading
Ruikang ZhongXiao LiuYuanwei LiuYue Chen
2020-10-18
Value-based Bayesian Meta-reinforcement Learning and Traffic Signal Control
Yayi ZouZhiwei Qin
2020-10-01
Strategy and Benchmark for Converting Deep Q-Networks to Event-Driven Spiking Neural Networks
Weihao TanDevdhar PatelRobert Kozma
2020-09-30
Lineage Evolution Reinforcement Learning
Zeyu ZhangGuisheng Yin
2020-09-26
A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward
M. Ugur YavasN. Kemal UreTufan Kumbasar
2020-09-24
Tactical Decision Making for Emergency Vehicles based on a Combinational Learning Method
Haoyi NiuJianming Hu
2020-09-09
An adaptive synchronization approach for weights of deep reinforcement learning
S. Amirreza BadranMansoor Rezghi
2020-08-16
Chrome Dino Run using Reinforcement Learning
Divyanshu MarwahSneha SrivastavaAnusha GuptaShruti Verma
2020-08-15
Reinforcement Learning with Quantum Variational Circuits
Owen LockwoodMei Si
2020-08-15
Convex Q-Learning, Part 1: Deterministic Optimal Control
Prashant G. MehtaSean P. Meyn
2020-08-08
Deep Q-Network Based Multi-agent Reinforcement Learning with Binary Action Agents
Abdul Mueed HafizGhulam Mohiuddin Bhat
2020-08-06
UAV Target Tracking in Urban Environments Using Deep Reinforcement Learning
Sarthak BhagatSujit PB
2020-07-21
Mixture of Step Returns in Bootstrapped DQN
Po-Han ChiangHsuan-Kung YangZhang-Wei HongChun-Yi Lee
2020-07-16
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent
Bowen WengHuaqing XiongYingbin LiangWei Zhang
2020-07-15
Simulating multi-exit evacuation using deep reinforcement learning
Dong XuXiao HuangJoseph MangoXiang LiZhenlong Li
2020-07-11
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
| Kimin LeeMichael LaskinAravind SrinivasPieter Abbeel
2020-07-09
Auto-MAP: A DQN Framework for Exploring Distributed Execution Plans for DNN Workloads
Siyu WangYi RongShiqing FanZhen ZhengLanSong DiaoGuoping LongJun YangXiaoyong LiuWei Lin
2020-07-08
Cognitive Radio Network Throughput Maximization with Deep Reinforcement Learning
Kevin Shen Hoong OngYang ZhangDusit Niyato
2020-07-07
Decentralized Deep Reinforcement Learning for Network Level Traffic Signal Control
Jin Guo
2020-07-02
Noise, overestimation and exploration in Deep Reinforcement Learning
Rafael Stekolshchik
2020-06-25
Reducing Overestimation Bias by Increasing Representation Dissimilarity in Ensemble Based Deep Q-Learning
Hassam Ullah SheikhLadislau Bölöni
2020-06-24
RL Unplugged: Benchmarks for Offline Reinforcement Learning
| Caglar GulcehreZiyu WangAlexander NovikovTom Le PaineSergio Gomez ColmenarejoKonrad ZolnaRishabh AgarwalJosh MerelDaniel MankowitzCosmin PaduraruGabriel Dulac-ArnoldJerry LiMohammad NorouziMatt HoffmanOfir NachumGeorge TuckerNicolas HeessNando de Freitas
2020-06-24
Efficient Ridesharing Dispatch Using Multi-Agent Reinforcement Learning
| Oscar de LimaHansal ShahTing-Sheng ChuBrian Fogelson
2020-06-18
Interaction Networks: Using a Reinforcement Learner to train other Machine Learning algorithms
Florian Dietz
2020-06-15
Balancing a CartPole System with Reinforcement Learning -- A Tutorial
Swagat Kumar
2020-06-08
Conservative Q-Learning for Offline Reinforcement Learning
Aviral KumarAurick ZhouGeorge TuckerSergey Levine
2020-06-08
Acme: A Research Framework for Distributed Reinforcement Learning
| Matt HoffmanBobak ShahriariJohn AslanidesGabriel Barth-MaronFeryal BehbahaniTamara NormanAbbas AbdolmalekiAlbin CassirerFan YangKate BaumliSarah HendersonAlex NovikovSergio Gómez ColmenarejoSerkan CabiCaglar GulcehreTom Le PaineAndrew CowieZiyu WangBilal PiotNando de Freitas
2020-06-01
Learning to Charge RF-Energy Harvesting Devices in WiFi Networks
Yizhou LuoKwan-Wu Chin
2020-05-25
Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption
Hongyin LuoShang-Wen LiJames Glass
2020-05-19
Local and Global Explanations of Agent Behavior: Integrating Strategy Summaries with Saliency Maps
| Tobias HuberKatharina WeitzElisabeth AndréOfra Amir
2020-05-18
Risk-Aware High-level Decisions for Automated Driving at Occluded Intersections with Reinforcement Learning
Danial KamranCarlos Fernandez LopezMartin LauerChristoph Stiller
2020-04-09
An Application of Deep Reinforcement Learning to Algorithmic Trading
Thibaut ThéateDamien Ernst
2020-04-07
Uniform State Abstraction For Reinforcement Learning
John BurdenDaniel Kudenko
2020-04-06
Importance of using appropriate baselines for evaluation of data-efficiency in deep reinforcement learning for Atari
Kacper Kielak
2020-03-23
Interpretable Multi Time-scale Constraints in Model-free Deep Reinforcement Learning for Autonomous Driving
Gabriel KalweitMaria HuegleMoritz WerlingJoschka Boedecker
2020-03-20
Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations
| Huan ZhangHongge ChenChaowei XiaoBo LiMingyan LiuDuane BoningCho-Jui Hsieh
2020-03-19
Simultaneous Navigation and Radio Mapping for Cellular-Connected UAV with Deep Reinforcement Learning
| Yong ZengXiaoli XuShi JinRui Zhang
2020-03-17
Application of Deep Q-Network in Portfolio Management
Ziming GaoYuan GaoYi HuZhengyong JiangJionglong Su
2020-03-13
Dynamic Experience Replay
Jieliang LuoHui Li
2020-03-04
Optimistic Exploration even with a Pessimistic Initialisation
| Tabish RashidBei PengWendelin BöhmerShimon Whiteson
2020-02-26
Disentangling Controllable Object through Video Prediction Improves Visual Reinforcement Learning
Yuanyi ZhongAlexander SchwingJian Peng
2020-02-21
Langevin DQN
Vikranth DwaracherlaBenjamin Van Roy
2020-02-17
Reinforced active learning for image segmentation
| Arantxa CasanovaPedro O. PinheiroNegar RostamzadehChristopher J. Pal
2020-02-16
A Multimodal Dialogue System for Conversational Image Editing
Tzu-Hsiang LinTrung BuiDoo Soon KimJean Oh
2020-02-16
Fast Reinforcement Learning for Anti-jamming Communications
Pei-Gen YeYuan-Gen WangJin LiLiang Xiao
2020-02-13
Safe Wasserstein Constrained Deep Q-Learning
Aaron KandelScott J. Moura
2020-02-07
Deep RBF Value Functions for Continuous Control
Kavosh AsadiRonald E. ParrGeorge D. KonidarisMichael L. Littman
2020-02-05
Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning
| Jianyu ChenShengbo Eben LiMasayoshi Tomizuka
2020-01-23
Deep Interactive Reinforcement Learning for Path Following of Autonomous Underwater Vehicle
Qilei ZhangJinying LinQixin ShaBo HeGuangliang Li
2020-01-10
Deep Randomized Least Squares Value Iteration
Guy AdamTom ZahavyOron AnschelNahum Shimkin
2020-01-01
SLM Lab: A Comprehensive Benchmark and Modular Software Framework for Reproducible Deep Reinforcement Learning
| Keng Wah LoonLaura GraesserMilan Cvitkovic
2019-12-28
Exploiting the potential of deep reinforcement learning for classification tasks in high-dimensional and unstructured data
Johan S. Obando-CeronVictor Romero CanoWalter Mayor Toro
2019-12-20
Soft Q-network
Jingbin LiuXinyang GuShuai LiuDexiang Zhang
2019-12-20
Learning Sparse Representations Incrementally in Deep Reinforcement Learning
J. Fernando Hernandez-GarciaRichard S. Sutton
2019-12-09
Reconciling λ-Returns with Experience Replay
| Brett DaleyChristopher Amato
2019-12-01
Placement Optimization of Aerial Base Stations with Deep Reinforcement Learning
Jin QiuJiangbin LyuLiqun Fu
2019-11-19
Minimalistic Attacks: How Little it Takes to Fool a Deep Reinforcement Learning Policy
| Xinghua QuZhu SunYew-Soon OngAbhishek GuptaPengfei Wei
2019-11-10
An End-to-End Deep RL Framework for Task Arrangement in Crowdsourcing Platforms
Caihua ShanNikos MamoulisReynold ChengGuoliang LiXiang LiYuqiu Qian
2019-11-04
Task-Oriented Language Grounding for Language Input with Multiple Sub-Goals of Non-Linear Order
| Vladislav KurenkovBulat MaksudovAdil Khan
2019-10-27
Momentum in Reinforcement Learning
Nino VieillardBruno ScherrerOlivier PietquinMatthieu Geist
2019-10-21
Resource Allocation in Mobility-Aware Federated Learning Networks: A Deep Reinforcement Learning Approach
Huy T. NguyenNguyen Cong LuongJun ZhaoChau YuenDusit Niyato
2019-10-21
Reverse Experience Replay
Egor Rotinov
2019-10-19
Knowledge Induced Deep Q-Network for a Slide-to-Wall Object Grasping
Hengyue LiangXibai LouChanghyun Choi
2019-10-09
Multi-step Greedy Reinforcement Learning Algorithms
Manan TomarYonathan EfroniMohammad Ghavamzadeh
2019-10-07
Deep Q-Network for Angry Birds
Ekaterina NikonovaJakub Gemrot
2019-10-04
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action
Mathieu SeurinPhilippe PreuxOlivier Pietquin
2019-10-04
Benchmarking Batch Deep Reinforcement Learning Algorithms
| Scott FujimotoEdoardo ContiMohammad GhavamzadehJoelle Pineau
2019-10-03
AI Assisted Annotator using Reinforcement Learning
V. Ratna SaripalliGopal AvinashDibyajyoti PatiMichael PotterCharles W. Anderson
2019-10-02
Deep Reinforcement Learning with Modulated Hebbian plus Q Network Architecture
Pawel LadoszEseoghene Ben-IwhiwhuJeffery DickYang HuNicholas KetzSoheil KolouriJeffrey L. KrichmarPraveen PillyAndrea Soltoggio
2019-09-21
Split Deep Q-Learning for Robust Object Singulation
Iason SarantopoulosMarios KiatosZoe DoulgeriSotiris Malassiotis
2019-09-17
Reinforcement Learning with Non-Markovian Rewards
Mridul AgarwalVaneet Aggarwal
2019-09-06
An Optimistic Perspective on Offline Reinforcement Learning
| Rishabh AgarwalDale SchuurmansMohammad Norouzi
2019-07-10
Towards Empathic Deep Q-Learning
| Bart BussmannJacqueline HeinermanJoel Lehman
2019-06-26
Learning Causal State Representations of Partially Observable Environments
Amy ZhangZachary C. LiptonLuis PinedaKamyar AzizzadenesheliAnima AnandkumarLaurent IttiJoelle PineauTommaso Furlanello
2019-06-25
Sequential Triggers for Watermarking of Deep Reinforcement Learning Policies
Vahid BehzadanWilliam Hsu
2019-06-03
Analysis and Improvement of Adversarial Training in DQN Agents With Adversarially-Guided Exploration (AGE)
Vahid BehzadanWilliam Hsu
2019-06-03
RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies
Vahid BehzadanWilliam Hsu
2019-06-03
Learning distant cause and effect using only local and immediate credit assignment
| David RawlinsonAbdelrahman AhmedGideon Kowadlo
2019-05-28
Prioritized Sequence Experience Replay
Marc BrittainJosh BertramXuxi YangPeng Wei
2019-05-25
Adaptive Symmetric Reward Noising for Reinforcement Learning
Refael VivantiTalya D. Sohlberg-BarisShlomo CohenOrna Cohen
2019-05-24
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment
Jivitesh SharmaPer-Arne AndersenOle-Chrisoffer GranmoMorten Goodwin
2019-05-23
Mastering the Game of Sungka from Random Play
| Darwin BautistaRaimarc Dionido
2019-05-17
Comprehensible Context-driven Text Game Playing
| Xusen YinJonathan May
2019-05-06
Learning agents with prioritization and parameter noise in continuous state and action space
Rajesh DevaraddiG. Srinivasaraghavan
2019-05-01
Beyond Games: Bringing Exploration to Robots in Real-world
Deepak PathakDhiraj GandhiAbhinav Gupta
2019-05-01
Inducing Cooperation via Learning to reshape rewards in semi-cooperative multi-agent reinforcement learning
David Earl HostalleroDaewoo KimKyunghwan SonYung Yi
2019-05-01
Recurrent Experience Replay in Distributed Reinforcement Learning
Steven KapturowskiGeorg OstrovskiWill DabneyJohn QuanRemi Munos
2019-05-01
Generative Adversarial Imagination for Sample Efficient Deep Reinforcement Learning
Kacper Kielak
2019-04-30
Deep Q Learning Driven CT Pancreas Segmentation with Geometry-Aware U-Net
Yunze ManYangsibo HuangJunyi FengXi LiFei Wu
2019-04-19
Personalized Cancer Chemotherapy Schedule: a numerical comparison of performance and robustness in model-based and model-free scheduling methodologies
Jesus TordesillasJuncal Arbelaiz
2019-04-02
Lane Change Decision-making through Deep Reinforcement Learning with Rule-based Constraints
Junjie WangQichao ZhangDongbin ZhaoYaran Chen
2019-03-30
DQN with model-based exploration: efficient learning on environments with sparse rewards
Stephen Zhen GouYuyang Liu
2019-03-22
Deep Reinforcement Learning with Decorrelation
Borislav MavrinHengshuai YaoLinglong Kong
2019-03-18
Reinforcement Learning with Dynamic Boltzmann Softmax Updates
Ling PanQingpeng CaiQi MengWei ChenLongbo HuangTie-Yan Liu
2019-03-14
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis SteckelmacherHélène PlisnierDiederik M. RoijersAnn Nowé
2019-03-11
DeepPool: Distributed Model-free Algorithm for Ride-sharing using Deep Reinforcement Learning
Abubakr AlabbasiArnob GhoshVaneet Aggarwal
2019-03-09
MinAtar: An Atari-Inspired Testbed for Thorough and Reproducible Reinforcement Learning Experiments
| Kenny YoungTian Tian
2019-03-07
Reward Shaping via Meta-Learning
Haosheng ZouTongzheng RenDong YanHang SuJun Zhu
2019-01-27
Distillation Strategies for Proximal Policy Optimization
Sam GreenCraig M. VineyardÇetin Kaya Koç
2019-01-23
Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target
J. Fernando Hernandez-GarciaRichard S. Sutton
2019-01-22
A Theoretical Analysis of Deep Q-Learning
Jianqing FanZhaoran WangYuchen XieZhuoran Yang
2019-01-01
Generative Adversarial User Model for Reinforcement Learning Based Recommendation System
Xinshi ChenShuang LiHui LiShaohua JiangYuan QiLe Song
2018-12-27
Parallelized Interactive Machine Learning on Autonomous Vehicles
Xi ChenCaylin Hickey
2018-12-23
Learning to Navigate the Web
Izzeddin GurUlrich RueckertAleksandra FaustDilek Hakkani-Tur
2018-12-21
Double Deep Q-Learning for Optimal Execution
Brian NingFranco Ho Ting LinSebastian Jaimungal
2018-12-17
Decentralized Computation Offloading for Multi-User Mobile Edge Computing: A Deep Reinforcement Learning Approach
| Zhao ChenXiaodong Wang
2018-12-16
Off-Policy Deep Reinforcement Learning without Exploration
| Scott FujimotoDavid MegerDoina Precup
2018-12-07
Power Allocation in Multi-user Cellular Networks With Deep Q Learning Approach
Fan MengPeng ChenLenan Wu
2018-12-07
Active Deep Q-learning with Demonstration
Si-An ChenVoot TangkarattHsuan-Tien LinMasashi Sugiyama
2018-12-06
Bach2Bach: Generating Music Using A Deep Reinforcement Learning Approach
Nikhil Kotecha
2018-12-03
Deep Reinforcement Learning for Intelligent Transportation Systems
Xiao-Yang LiuZihan DingSem BorstAnwar Walid
2018-12-03
Macro action selection with deep reinforcement learning in StarCraft
| Sijia XuHongyu KuangZhi ZhuangRenjie HuYang LiuHuyang Sun
2018-12-02
Deep Multi-Agent Reinforcement Learning with Relevance Graphs
| Aleksandra MalyshevaTegg Taekyong SungChae-Bong SohnDaniel KudenkoAleksei Shpilman
2018-11-30
Urban Driving with Multi-Objective Deep Reinforcement Learning
Changjian LiKrzysztof Czarnecki
2018-11-21
An initial attempt of combining visual selective attention with deep reinforcement learning
Liu YuezhangRuohan ZhangDana H. Ballard
2018-11-11
Reconciling $λ$-Returns with Experience Replay
| Brett DaleyChristopher Amato
2018-10-23
Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning
| David JanzJiri HronPrzemysław MazurKatja HofmannJosé Miguel Hernández-LobatoSebastian Tschiatschek
2018-10-15
Empowerment-driven Exploration using Mutual Information Estimation
| Navneet Madhu Kumar
2018-10-11
Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
| Jiechao XiongQing WangZhuoran YangPeng SunLei HanYang ZhengHaobo FuTong ZhangJi LiuHan Liu
2018-10-10
Generalization and Regularization in DQN
| Jesse FarebrotherMarlos C. MachadoMichael Bowling
2018-09-29
Coordinated Heterogeneous Distributed Perception based on Latent Space Representation
Timo KorthalsJürgen LeitnerUlrich Rückert
2018-09-12
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning
Tom ZahavyMatan HaroushNadav MerlisDaniel J. MankowitzShie Mannor
2018-09-06
Model-Based Regularization for Deep Reinforcement Learning with Transcoder Networks
Felix LeibfriedPeter Vrancx
2018-09-06
Reinforcement Learning using Augmented Neural Networks
Jack ShannonMarek Grzes
2018-06-20
Surprising Negative Results for Generative Adversarial Tree Search
| Kamyar AzizzadenesheliBrandon YangWeitang LiuZachary C LiptonAnimashree Anandkumar
2018-06-15
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network
Wenjia MengQian ZhengLong YangPengfei LiGang Pan
2018-06-14
Implicit Quantile Networks for Distributional Reinforcement Learning
| Will DabneyGeorg OstrovskiDavid SilverRémi Munos
2018-06-14
Learning to Search in Long Documents Using Document Structure
| Mor GevaJonathan Berant
2018-06-09
Randomized Value Functions via Multiplicative Normalizing Flows
Ahmed TouatiHarsh SatijaJoshua RomoffJoelle PineauPascal Vincent
2018-06-06
Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update
| Su Young LeeSungik ChoiSae-Young Chung
2018-05-31
Episodic Memory Deep Q-Networks
Zichuan LinTianqi ZhaoGuangwen YangLintao Zhang
2018-05-19
Optimized Computation Offloading Performance in Virtual Edge Computing Systems via Deep Reinforcement Learning
Xianfu ChenHonggang ZhangCelimuge WuShiwen MaoYusheng JiMehdi Bennis
2018-05-16
Advances in Experience Replay
Tracy WanNeil Xu
2018-05-15
MOVI: A Model-Free Approach to Dynamic Fleet Management
Takuma OdaCarlee Joe-Wong
2018-04-13
Reinforcement Learning based QoS/QoE-aware Service Function Chaining in Software-Driven 5G Slices
Xi ChenZonghang LiYupeng ZhangRuiming LongHongfang YuXiaojiang DuMohsen Guizani
2018-04-06
Natural Gradient Deep Q-learning
| Ethan KnightOsher Lerner
2018-03-20
Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments
Yan ZhengJianye HaoZongzhang Zhang
2018-02-23
Efficient Exploration through Bayesian Deep Q-Networks
| Kamyar AzizzadenesheliAnimashree Anandkumar
2018-02-13
Faster Deep Q-learning using Neural Episodic Control
Daichi NishioSatoshi Yamane
2018-01-06
PARAMETRIZED DEEP Q-NETWORKS LEARNING: PLAYING ONLINE BATTLE ARENA WITH DISCRETE-CONTINUOUS HYBRID ACTION SPACE
Jiechao XiongQing WangZhuoran YangPeng SunYang ZhengLei HanHaobo FuXiangru LianCarson EisenachHaichuan YangEmmanuel EkwedikeBei PengHaoyue GaoTong ZhangJi LiuHan Liu
2018-01-01
Faster Reinforcement Learning with Expert State Sequences
Xiaoxiao GuoShiyu ChangMo YuMiao LiuGerald Tesauro
2018-01-01
Autonomous Vehicle Fleet Coordination With Deep Reinforcement Learning
Cane Punma
2018-01-01
A Deep Policy Inference Q-Network for Multi-Agent Systems
Zhang-Wei HongShih-Yang SuTzu-Yun ShannYi-Hsiang ChangChun-Yi Lee
2017-12-21
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning
| Felipe Petroski SuchVashisht MadhavanEdoardo ContiJoel LehmanKenneth O. StanleyJeff Clune
2017-12-18
Uncertainty Estimates for Efficient Neural Network-based Dialogue Policy Optimisation
Christopher TeghoPaweł BudzianowskiMilica Gašić
2017-11-30
A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management
Iñigo CasanuevaPaweł BudzianowskiPei-Hao SuNikola MrkšićTsung-Hsien WenStefan UltesLina Rojas-BarahonaSteve YoungMilica Gašić
2017-11-29
Implementing the Deep Q-Network
| Melrose RoderickJames MacGlashanStefanie Tellex
2017-11-20
TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning
| Gregory FarquharTim RocktäschelMaximilian IglShimon Whiteson
2017-10-31
Distributional Reinforcement Learning with Quantile Regression
| Will DabneyMark RowlandMarc G. BellemareRémi Munos
2017-10-27
Rainbow: Combining Improvements in Deep Reinforcement Learning
| Matteo HesselJoseph ModayilHado van HasseltTom SchaulGeorg OstrovskiWill DabneyDan HorganBilal PiotMohammad AzarDavid Silver
2017-10-06
Deep Reinforcement Learning with Surrogate Agent-Environment Interface
Song WangYu Jing
2017-09-12
Pre-training Neural Networks with Human Demonstrations for Deep Reinforcement Learning
Gabriel V. de la Cruz JrYunshu DuMatthew E. Taylor
2017-09-12
Formulation of Deep Reinforcement Learning Architecture Toward Autonomous Driving for On-Ramp Merge
Pin WangChing-Yao Chan
2017-09-07
LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions
Yu WangJiayi LiuYuxiang LiuJun HaoYang HeJinghe HuWeipeng P. YanMantian Li
2017-08-18
3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-scale 3D Point Clouds
Fangyu LiuShuaipeng LiLiqiang ZhangChenghu ZhouRongtian YeYuebin WangJiwen Lu
2017-07-21
Noisy Networks for Exploration
| Meire FortunatoMohammad Gheshlaghi AzarBilal PiotJacob MenickIan OsbandAlex GravesVlad MnihRemi MunosDemis HassabisOlivier PietquinCharles BlundellShane Legg
2017-06-30
Parameter Space Noise for Exploration
| Matthias PlappertRein HouthooftPrafulla DhariwalSzymon SidorRichard Y. ChenXi ChenTamim AsfourPieter AbbeelMarcin Andrychowicz
2017-06-06
Explaining Transition Systems through Program Induction
Svetlin PenkovSubramanian Ramamoorthy
2017-05-23
Shallow Updates for Deep Reinforcement Learning
Nir LevineTom ZahavyDaniel J. MankowitzAviv TamarShie Mannor
2017-05-21
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Audrunas GruslysWill DabneyMohammad Gheshlaghi AzarBilal PiotMarc BellemareRemi Munos
2017-04-15
Deep Q-learning from Demonstrations
| Todd HesterMatej VecerikOlivier PietquinMarc LanctotTom SchaulBilal PiotDan HorganJohn QuanAndrew SendonarisGabriel Dulac-ArnoldIan OsbandJohn AgapiouJoel Z. LeiboAudrunas Gruslys
2017-04-12
Tactics of Adversarial Attack on Deep Reinforcement Learning Agents
Yen-Chen LinZhang-Wei HongYuan-Hong LiaoMeng-Li ShihMing-Yu LiuMin Sun
2017-03-08
Count-Based Exploration with Neural Density Models
Georg OstrovskiMarc G. BellemareAaron van den OordRemi Munos
2017-03-03
Sigmoid-Weighted Linear Units for Neural Network Function Approximation in Reinforcement Learning
Stefan ElfwingEiji UchibeKenji Doya
2017-02-10
Autonomous Braking System via Deep Reinforcement Learning
Hyunmin ChaeChang Mook KangByeoungDo KimJaekyum KimChung Choo ChungJun Won Choi
2017-02-08
Vulnerability of Deep Reinforcement Learning to Policy Induction Attacks
Vahid BehzadanArslan Munir
2017-01-16
Deep Reinforcement Learning for Multi-Domain Dialogue Systems
| Heriberto CuayáhuitlSeunghak YuAshley WilliamsonJacob Carse
2016-11-26
Memory Lens: How Much Memory Does an Agent Use?
Christoph DannKatja HofmannSebastian Nowozin
2016-11-21
Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning
| Oron AnschelNir BaramNahum Shimkin
2016-11-07
Deep Reinforcement Learning From Raw Pixels in Doom
Danijar Hafner
2016-10-07
Opponent Modeling in Deep Reinforcement Learning
He HeJordan Boyd-GraberKevin KwokHal Daumé III
2016-09-18
Deep Reinforcement Learning Discovers Internal Models
Nir BaramTom ZahavyShie Mannor
2016-06-16
Deep Reinforcement Learning With Macro-Actions
Ishan P. DurugkarClemens RosenbaumStefan DernbachSridhar Mahadevan
2016-06-15
Classifying Options for Deep Reinforcement Learning
Kai ArulkumaranNat DilokthanakulMurray ShanahanAnil Anthony Bharath
2016-04-27
Deep Exploration via Bootstrapped DQN
| Ian OsbandCharles BlundellAlexander PritzelBenjamin Van Roy
2016-02-15
How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies
Vincent François-LavetRaphael FonteneauDamien Ernst
2015-12-07
Deep Attention Recurrent Q-Network
| Ivan SorokinAlexey SeleznevMikhail PavlovAleksandr FedorovAnastasiia Ignateva
2015-12-05
State of the Art Control of Atari Games Using Shallow Reinforcement Learning
Yitao LiangMarlos C. MachadoErik TalvitieMichael Bowling
2015-12-04
Policy Distillation
| Andrei A. RusuSergio Gomez ColmenarejoCaglar GulcehreGuillaume DesjardinsJames KirkpatrickRazvan PascanuVolodymyr MnihKoray KavukcuogluRaia Hadsell
2015-11-19
Prioritized Experience Replay
| Tom SchaulJohn QuanIoannis AntonoglouDavid Silver
2015-11-18
Generating Text with Deep Reinforcement Learning
Hongyu Guo
2015-10-30
Deep Reinforcement Learning with Double Q-learning
| Hado van HasseltArthur GuezDavid Silver
2015-09-22
Massively Parallel Methods for Deep Reinforcement Learning
| Arun NairPraveen SrinivasanSam BlackwellCagdas AlcicekRory FearonAlessandro De MariaVedavyas PanneershelvamMustafa SuleymanCharles BeattieStig PetersenShane LeggVolodymyr MnihKoray KavukcuogluDavid Silver
2015-07-15
Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning
Xiaoxiao GuoSatinder SinghHonglak LeeRichard L. LewisXiaoshi Wang
2014-12-01
Playing Atari with Deep Reinforcement Learning
| Volodymyr MnihKoray KavukcuogluDavid SilverAlex GravesIoannis AntonoglouDaan WierstraMartin Riedmiller
2013-12-19

Tasks

Categories