Search Results for author: Daniel Wontae Nam

Found 6 papers, 1 papers with code

Binary Classifier Optimization for Large Language Model Alignment

no code implementations6 Apr 2024 Seungjae Jung, Gunsoo Han, Daniel Wontae Nam, Kyoung-Woon On

In the process of this discovery, we identified two techniques for effective alignment: reward shift and underlying distribution matching.

Language Modelling Large Language Model

Hexa: Self-Improving for Knowledge-Grounded Dialogue System

no code implementations10 Oct 2023 DaeJin Jo, Daniel Wontae Nam, Gunsoo Han, Kyoung-Woon On, Taehwan Kwon, Seungeun Rho, Sungwoong Kim

A common practice in knowledge-grounded dialogue generation is to explicitly utilize intermediate steps (e. g., web-search, memory retrieval) with modular approaches.

Dialogue Generation Retrieval

LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward

1 code implementation11 Oct 2022 DaeJin Jo, Sungwoong Kim, Daniel Wontae Nam, Taehwan Kwon, Seungeun Rho, Jongmin Kim, Donghoon Lee

In order to resolve these issues, in this paper, we propose a learnable hash-based episodic count, which we name LECO, that efficiently performs as a task-specific intrinsic reward in hard exploration problems.

Efficient Exploration reinforcement-learning

GMAC: A Distributional Perspective on Actor-Critic Framework

no code implementations24 May 2021 Daniel Wontae Nam, Younghoon Kim, Chan Y. Park

In this paper, we devise a distributional framework on actor-critic as a solution to distributional instability, action type restriction, and conflation between samples and statistics.

Atari Games

A Distributional Perspective on Actor-Critic Framework

no code implementations1 Jan 2021 Daniel Wontae Nam, Younghoon Kim, Chan Youn Park

Recent distributional reinforcement learning methods, despite their successes, still contain fundamental problems that can lead to inaccurate representations of value distributions, such as distributional instability, action type restriction, and biased approximation.

Distributional Reinforcement Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.