Search Results for author: Donghwan Lee

Found 38 papers, 4 papers with code

Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach

no code implementations • 11 Mar 2024 • Narim Jeong, Donghwan Lee

We hope that our analysis will deepen the current understanding of soft Q-learning by establishing connections with switching system models and may even pave the way for new frameworks in the finite-time analysis of other reinforcement learning algorithms.

Q-Learning

Paper
Add Code

Analysis of Off-Policy Multi-Step TD-Learning with Linear Function Approximation

no code implementations • 24 Feb 2024 • Donghwan Lee

This paper analyzes multi-step TD-learning algorithms within the `deadly triad' scenario, characterized by linear function approximation, off-policy learning, and bootstrapping.

reinforcement-learning

Paper
Add Code

Finite-Time Error Analysis of Online Model-Based Q-Learning with a Relaxed Sampling Model

no code implementations • 19 Feb 2024 • Han-Dong Lim, HyeAnn Lee, Donghwan Lee

Reinforcement learning has witnessed significant advancements, particularly with the emergence of model-based approaches.

Q-Learning reinforcement-learning

Paper
Add Code

Harnessing Membership Function Dynamics for Stability Analysis of T-S Fuzzy Systems

no code implementations • 4 Jan 2024 • Donghwan Lee, Do-Wan Kim

The main goal of this paper is to develop a new linear matrix inequality (LMI) condition for the asymptotic stability of continuous-time Takagi-Sugeno (T-S) fuzzy systems.

Paper
Add Code

A Theory of Non-Linear Feature Learning with One Gradient Step in Two-Layer Neural Networks

no code implementations • 11 Oct 2023 • Behrad Moniri, Donghwan Lee, Hamed Hassani, Edgar Dobriban

However, with a constant gradient descent step size, this spike only carries information from the linear component of the target function and therefore learning non-linear components is impossible.

Paper
Add Code

Suppressing Overestimation in Q-Learning through Adversarial Behaviors

no code implementations • 10 Oct 2023 • HyeAnn Lee, Donghwan Lee

The goal of this paper is to propose a new Q-learning algorithm with a dummy adversarial player, which is called dummy adversarial Q-learning (DAQ), that can effectively regulate the overestimation bias in standard Q-learning.

Q-Learning

Paper
Add Code

A primal-dual perspective for distributed TD-learning

no code implementations • 1 Oct 2023 • Han-Dong Lim, Donghwan Lee

The goal of this paper is to investigate distributed temporal difference (TD) learning for a networked multi-agent Markov decision process.

Distributed Optimization

Paper
Add Code

On the Local Quadratic Stability of T-S Fuzzy Systems in the Vicinity of the Origin

no code implementations • 13 Sep 2023 • Donghwan Lee, Do Wan Kim

Moreover, we establish that the proposed methods offer necessary and sufficient conditions for the local exponential stability of T-S fuzzy systems.

Paper
Add Code

Continuous-Time Distributed Dynamic Programming for Networked Multi-Agent Markov Decision Processes

no code implementations • 31 Jul 2023 • Donghwan Lee, Han-Dong Lim, Do Wan Kim

The main goal of this paper is to investigate continuous-time distributed dynamic programming (DP) algorithms for networked multi-agent Markov decision problems (MAMDPs).

Distributed Optimization

Paper
Add Code

Temporal Difference Learning with Experience Replay

no code implementations • 16 Jun 2023 • Han-Dong Lim, Donghwan Lee

Temporal-difference (TD) learning is widely regarded as one of the most popular algorithms in reinforcement learning (RL).

Reinforcement Learning (RL)

Paper
Add Code

Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach

no code implementations • 9 Jun 2023 • Donghwan Lee

The objective of this paper is to investigate the finite-time analysis of a Q-learning algorithm applied to two-player zero-sum Markov games.

Q-Learning

Paper
Add Code

Optimal Heterogeneous Collaborative Linear Regression and Contextual Bandits

no code implementations • 9 Jun 2023 • Xinmeng Huang, Kan Xu, Donghwan Lee, Hamed Hassani, Hamsa Bastani, Edgar Dobriban

MOLAR improves the dependence of the estimation error on the data dimension, compared to independent least squares estimates.

Multi-Armed Bandits regression

Paper
Add Code

TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering

no code implementations • CVPR 2023 • Jaehoon Choi, Dongki Jung, Taejae Lee, SangWook Kim, Youngdong Jung, Dinesh Manocha, Donghwan Lee

We present a new pipeline for acquiring a textured mesh in the wild with a single smartphone which offers access to images, depth maps, and valid poses.

3D Reconstruction Surface Reconstruction +1

Paper
Add Code

Backstepping Temporal Difference Learning

no code implementations • 20 Feb 2023 • Han-Dong Lim, Donghwan Lee

Off-policy learning ability is an important feature of reinforcement learning (RL) for practical applications.

Reinforcement Learning (RL)

Paper
Add Code

Demystifying Disagreement-on-the-Line in High Dimensions

1 code implementation • 31 Jan 2023 • Donghwan Lee, Behrad Moniri, Xinmeng Huang, Edgar Dobriban, Hamed Hassani

Evaluating the performance of machine learning models under distribution shift is challenging, especially when we only have unlabeled data from the shifted (target) domain, along with labeled data from the original (source) domain.

Vocal Bursts Intensity Prediction

Paper
Code

Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View

no code implementations • 25 Jul 2022 • Han-Dong Lim, Donghwan Lee

Q-learning has long been one of the most popular reinforcement learning algorithms, and theoretical analysis of Q-learning has been an active research topic for decades.

Q-Learning

Paper
Add Code

Collaborative Learning of Discrete Distributions under Heterogeneity and Communication Constraints

no code implementations • 1 Jun 2022 • Xinmeng Huang, Donghwan Lee, Edgar Dobriban, Hamed Hassani

In modern machine learning, users often have to collaborate to learn the distribution of the data.

Paper
Add Code

Investigating the Role of Image Retrieval for Visual Localization -- An exhaustive benchmark

1 code implementation • 31 May 2022 • Martin Humenberger, Yohann Cabon, Noé Pion, Philippe Weinzaepfel, Donghwan Lee, Nicolas Guérin, Torsten Sattler, Gabriela Csurka

In order to investigate the consequences for visual localization, this paper focuses on understanding the role of image retrieval for multiple visual localization paradigms.

Autonomous Driving Image Retrieval +3

256

Paper
Code

Finite-Time Analysis of Temporal Difference Learning: Discrete-Time Linear System Perspective

no code implementations • 22 Apr 2022 • Donghwan Lee, Do Wan Kim

TD-learning is a fundamental algorithm in the field of reinforcement learning (RL), that is employed to evaluate a given policy by estimating the corresponding value function for a Markov decision process.

Reinforcement Learning (RL)

Paper
Add Code

A Single Correspondence Is Enough: Robust Global Registration to Avoid Degeneracy in Urban Environments

no code implementations • 13 Mar 2022 • Hyungtae Lim, Suyong Yeon, Soohyun Ryu, Yonghan Lee, Youngji Kim, JaeSeong Yun, Euigon Jung, Donghwan Lee, Hyun Myung

As verified in indoor and outdoor 3D LiDAR datasets, our proposed method yields robust global registration performance compared with other global registration methods, even for distant point cloud pairs.

Paper
Add Code

SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning

no code implementations • 10 Mar 2022 • Jaehoon Choi, Dongki Jung, Yonghan Lee, Deokhwa Kim, Dinesh Manocha, Donghwan Lee

Given these metric poses and monocular sequences, we propose a self-supervised learning method for the pre-trained supervised monocular depth networks to enable metrically scaled depth estimation.

Monocular Depth Estimation Robot Navigation +2

Paper
Add Code

T-Cal: An optimal test for the calibration of predictive models

1 code implementation • 3 Mar 2022 • Donghwan Lee, Xinmeng Huang, Hamed Hassani, Edgar Dobriban

We find that detecting mis-calibration is only possible when the conditional probabilities of the classes are sufficiently smooth functions of the predictions.

Paper
Code

Regularized Q-learning

no code implementations • 11 Feb 2022 • Han-Dong Lim, Do Wan Kim, Donghwan Lee

This paper develops a new Q-learning algorithm that converges when linear function approximation is used.

Q-Learning reinforcement-learning +1

Paper
Add Code

Control Theoretic Analysis of Temporal Difference Learning

no code implementations • 29 Dec 2021 • Donghwan Lee, Do Wan Kim

The goal of this manuscript is to conduct a controltheoretic analysis of Temporal Difference (TD) learning algorithms.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

New Versions of Gradient Temporal Difference Learning

no code implementations • 9 Sep 2021 • Donghwan Lee, Han-Dong Lim, Jihoon Park, Okyong Choi

Sutton, Szepesv\'{a}ri and Maei introduced the first gradient temporal-difference (GTD) learning algorithms compatible with both linear function approximation and off-policy training.

Paper
Add Code

DnD: Dense Depth Estimation in Crowded Dynamic Indoor Scenes

no code implementations • ICCV 2021 • Dongki Jung, Jaehoon Choi, Yonghan Lee, Deokhwa Kim, Changick Kim, Dinesh Manocha, Donghwan Lee

We present a novel approach for estimating depth from a monocular camera as it moves through complex and crowded indoor environments, e. g., a department store or a metro station.

3D Reconstruction Depth Estimation

Paper
Add Code

Large-scale Localization Datasets in Crowded Indoor Spaces

no code implementations • CVPR 2021 • Donghwan Lee, Soohyun Ryu, Suyong Yeon, Yonghan Lee, Deokhwa Kim, Cheolho Han, Yohann Cabon, Philippe Weinzaepfel, Nicolas Guérin, Gabriela Csurka, Martin Humenberger

In this paper, we introduce 5 new indoor datasets for visual localization in challenging real-world environments.

Robot Navigation Visual Localization

Paper
Add Code

Simulation Studies on Deep Reinforcement Learning for Building Control with Human Interaction

no code implementations • 14 Mar 2021 • Donghwan Lee, Niao He, Seungjae Lee, Panagiota Karava, Jianghai Hu

The building sector consumes the largest energy in the world, and there have been considerable research interests in energy consumption and comfort management of buildings.

Management reinforcement-learning +1

Paper
Add Code

A Discrete-Time Switching System Analysis of Q-learning

no code implementations • 17 Feb 2021 • Donghwan Lee, Jianghai Hu, Niao He

Based on these two systems, we derive a new finite-time error bound of asynchronous Q-learning when a constant stepsize is used.

Q-Learning

Paper
Add Code

A Unified Switching System Perspective and Convergence Analysis of Q-Learning Algorithms

no code implementations • NeurIPS 2020 • Donghwan Lee, Niao He

This paper develops a novel and unified framework to analyze the convergence of a large family of Q-learning algorithms from the switching system perspective.

Q-Learning

Paper
Add Code

SelfDeco: Self-Supervised Monocular Depth Completion in Challenging Indoor Environments

no code implementations • 10 Nov 2020 • Jaehoon Choi, Dongki Jung, Yonghan Lee, Deokhwa Kim, Dinesh Manocha, Donghwan Lee

We present a novel algorithm for self-supervised monocular depth completion.

Depth Completion

Paper
Add Code

SAFENet: Self-Supervised Monocular Depth Estimation with Semantic-Aware Feature Extraction

1 code implementation • 6 Oct 2020 • Jaehoon Choi, Dongki Jung, Donghwan Lee, Changick Kim

In this paper, we propose SAFENet that is designed to leverage semantic information to overcome the limitations of the photometric loss.

Depth Prediction Monocular Depth Estimation +1

Paper
Code

Periodic Q-Learning

no code implementations • L4DC 2020 • Donghwan Lee, Niao He

The use of target networks is a common practice in deep reinforcement learning for stabilizing the training; however, theoretical understanding of this technique is still limited.

Q-Learning

Paper
Add Code

A Unified Switching System Perspective and O.D.E. Analysis of Q-Learning Algorithms

no code implementations • 4 Dec 2019 • Donghwan Lee, Niao He

In this paper, we introduce a unified framework for analyzing a large family of Q-learning algorithms, based on switching system perspectives and ODE-based stochastic approximation.

Q-Learning

Paper
Add Code

Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents

no code implementations • 1 Dec 2019 • Donghwan Lee, Niao He, Parameswaran Kamalaruban, Volkan Cevher

This article reviews recent advances in multi-agent reinforcement learning algorithms for large-scale control systems and communication networks, which learn to communicate and cooperate.

Distributed Optimization Multi-agent Reinforcement Learning +2

Paper
Add Code

Target-Based Temporal Difference Learning

no code implementations • 24 Apr 2019 • Donghwan Lee, Niao He

The use of target networks has been a popular and key component of recent deep Q-learning algorithms for reinforcement learning, yet little is known from the theory side.

Q-Learning

Paper
Add Code

Learning to Communicate: A Machine Learning Framework for Heterogeneous Multi-Agent Robotic Systems

no code implementations • 13 Dec 2018 • Hyung-Jin Yoon, Huaiyu Chen, Kehan Long, Heling Zhang, Aditya Gahlawat, Donghwan Lee, Naira Hovakimyan

The encoding is useful for sharing local visual observations with other agents under communication resource constraints.

BIG-bench Machine Learning reinforcement-learning +1

Paper
Add Code

Hidden Markov Model Estimation-Based Q-learning for Partially Observable Markov Decision Process

no code implementations • 17 Sep 2018 • Hyung-Jin Yoon, Donghwan Lee, Naira Hovakimyan

The objective is to study an on-line Hidden Markov model (HMM) estimation-based Q-learning algorithm for partially observable Markov decision process (POMDP) on finite state and action sets.

Q-Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.