Search Results for author: Youngsoo Jang

Found 7 papers, 1 papers with code

Variational Inference for Sequential Data with Future Likelihood Estimates

no code implementations • ICML 2020 • Geon-Hyeong Kim, Youngsoo Jang, Hongseok Yang, Kee-Eung Kim

The estimated future likelihoods form the core of our new low-variance gradient estimator.

Paper
Add Code

Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection

no code implementations • 21 Mar 2024 • Kyungjae Lee, Dasol Hwang, Sunghyun Park, Youngsoo Jang, Moontae Lee

Despite the promise of RLHF in aligning LLMs with human preferences, it often leads to superficial alignment, prioritizing stylistic changes over improving downstream performance of LLMs.

Mathematical Reasoning

Paper
Add Code

LobsDICE: Offline Learning from Observation via Stationary Distribution Correction Estimation

2 code implementations • 28 Feb 2022 • Geon-Hyeong Kim, Jongmin Lee, Youngsoo Jang, Hongseok Yang, Kee-Eung Kim

We consider the problem of learning from observation (LfO), in which the agent aims to mimic the expert's behavior from the state-only demonstrations by experts.

Imitation Learning

Paper
Code

Offline Reinforcement Learning for Large Scale Language Action Spaces

no code implementations • ICLR 2022 • Youngsoo Jang, Jongmin Lee, Kee-Eung Kim

GPT-Critic is essentially free from the issue of diverging from human language since it learns from the sentences sampled from the pre-trained language model.

Language Modelling Offline RL +2

Paper
Add Code

Monte-Carlo Planning and Learning with Language Action Value Estimates

no code implementations • ICLR 2021 • Youngsoo Jang, Seokin Seo, Jongmin Lee, Kee-Eung Kim

Interactive Fiction (IF) games provide a useful testbed for language-based reinforcement learning agents, posing significant challenges of natural language understanding, commonsense reasoning, and non-myopic planning in the combinatorial search space.

Natural Language Understanding reinforcement-learning +1

Paper
Add Code

End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2

no code implementations • ACL 2020 • Donghoon Ham, Jeong-Gwan Lee, Youngsoo Jang, Kee-Eung Kim

The goal-oriented dialogue system needs to be optimized for tracking the dialogue flow and carrying out an effective conversation under various situations to meet the user goal.

Goal-Oriented Dialogue Systems

Paper
Add Code

PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules

no code implementations • IJCNLP 2019 • Youngsoo Jang, Jongmin Lee, Jaeyoung Park, Kyeng-Hun Lee, Pierre Lison, Kee-Eung Kim

We present PyOpenDial, a Python-based domain-independent, open-source toolkit for spoken dialogue systems.

Dialogue Management Dialogue State Tracking +5

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.