1 code implementation • 24 Feb 2025 • Hogun Kee, Wooseok Oh, Minjae Kang, Hyemin Ahn, Songhwai Oh
In this paper, we present the tidiness score-guided Monte Carlo tree search (TSMCTS), a novel framework designed to address the tabletop tidying up problem using only an RGB-D camera.
no code implementations • 25 Oct 2024 • Hojun Chung, Junseo Lee, Minsoo Kim, Dohyeong Kim, Songhwai Oh
Training agents that are robust to environmental changes remains a significant challenge in deep reinforcement learning (RL).
no code implementations • 8 Oct 2024 • Minsoo Kim, Obin Kwon, Howoong Jun, Songhwai Oh
We propose a novel visual localization and navigation framework for real-world environments directly integrating observed visual information into the bird-eye-view map.
1 code implementation • 24 Sep 2024 • Dohyeong Kim, Hyeokjin Kwon, Junseok Kim, Gunmin Lee, Songhwai Oh
As the complexity of tasks addressed through reinforcement learning (RL) increases, the definition of reward functions also has become highly complicated.
no code implementations • 2 Jul 2024 • Hyeokjin Kwon, Gunmin Lee, Junseo Lee, Songhwai Oh
By exploiting a constraint reward (CoR), our framework guides the agent to balance performance goals of reward sum with safety constraints.
no code implementations • 29 May 2024 • Dohyeong Kim, Taehyun Cho, Seungyub Han, Hojun Chung, Kyungjae Lee, Songhwai Oh
Furthermore, the proposed method has been evaluated on continuous control tasks and showed the best performance among other RCRL algorithms satisfying the constraints.
no code implementations • 1 Mar 2024 • Dohyeong Kim, Mineui Hong, Jeongho Park, Songhwai Oh
We address these challenges straightforwardly by treating the maximization of multiple objectives as a constrained optimization problem (COP), where the constraints are defined to improve the original objectives.
Multi-Objective Reinforcement Learning
reinforcement-learning
+1
no code implementations • NeurIPS 2023 • Mineui Hong, Minjae Kang, Songhwai Oh
To this end, we propose a method to utilize a diffusion-based generative sequence model to plan a series of milestones in a latent space and to have an agent to follow the milestones to accomplish a given task.
no code implementations • 1 Dec 2023 • Dohyeong Kim, Songhwai Oh
In this paper, we propose a trust region-based safe RL method with CVaR constraints, called TRC.
no code implementations • 1 Dec 2023 • Dohyeong Kim, Songhwai Oh
This paper aims to solve a safe reinforcement learning (RL) problem with risk measure-based constraints.
no code implementations • CVPR 2023 • Minyoung Hwang, Jaeyeon Jeong, Minsoo Kim, Yoonseon Oh, Songhwai Oh
We show that an exploitation policy, which moves the agent toward a well-chosen local goal among unvisited but observable states, outperforms a method which moves the agent to a previously visited state.
Ranked #2 on
Visual Navigation
on R2R
1 code implementation • CVPR 2023 • Obin Kwon, Jeongho Park, Songhwai Oh
We propose a novel type of map for visual navigation, a renderable neural radiance map (RNR-Map), which is designed to contain the overall visual information of a 3D environment.
1 code implementation • NeurIPS 2023 • Dohyeong Kim, Kyungjae Lee, Songhwai Oh
In safety-critical robotic tasks, potential failures must be reduced, and multiple constraints must be met, such as avoiding collisions, limiting energy consumption, and maintaining balance.
Distributional Reinforcement Learning
reinforcement-learning
+3
no code implementations • 14 Mar 2022 • Seunggyu Chang, Jungchan Cho, Songhwai Oh
To provide sufficient information for estimating a complete texture map, the proposed model simultaneously generates multi-view hallucinations in the image domain and an estimated texture map in the texture domain.
1 code implementation • ICCV 2021 • Obin Kwon, Nuri Kim, Yunho Choi, Hwiyeon Yoo, Jeongho Park, Songhwai Oh
We present a novel graph-structured memory for visual navigation, called visual graph memory (VGM), which consists of unsupervised image representations obtained from navigation history.
no code implementations • 16 Dec 2020 • Hyemin Ahn, Obin Kwon, Kyoungdo Kim, Jaeyeon Jeong, Howoong Jun, Hongjung Lee, Dongheui Lee, Songhwai Oh
We also suggest a relevant dataset and model which can be a baseline, and show that our model trained with the proposed dataset can also be applied to the real world based on the CycleGAN.
no code implementations • NeurIPS 2020 • Kyungjae Lee, Hongjun Yang, Sungbin Lim, Songhwai Oh
In simulation, the proposed estimator shows favorable performance compared to existing robust estimators for various $p$ values and, for MAB problems, the proposed perturbation strategy outperforms existing exploration methods.
no code implementations • 11 Nov 2019 • Hyemin Ahn, Jaehun Kim, Kihyun Kim, Songhwai Oh
The trained dance pose generator, which is a generative autoregressive model, is able to synthesize a dance sequence longer than 5, 000 pose frames.
no code implementations • ICCV 2019 • Chanho Ahn, Eunwoo Kim, Songhwai Oh
To this end, we propose an efficient approach to exploit a compact but accurate model in a backbone architecture for each instance of all tasks.
no code implementations • CVPR 2019 • Eunwoo Kim, Chanho Ahn, Philip H. S. Torr, Songhwai Oh
To this end, we propose a novel network architecture producing multiple networks of different configurations, termed deep virtual networks (DVNs), for different tasks.
no code implementations • 31 Jan 2019 • Kyungjae Lee, Sungyub Kim, Sungbin Lim, Sungjoon Choi, Songhwai Oh
By controlling the entropic index, we can generate various types of entropy, including the SG entropy, and a different entropy results in a different class of the optimal policy in Tsallis MDPs.
1 code implementation • 28 May 2018 • Nuri Kim, Donghoon Lee, Songhwai Oh
Recent object detectors find instances while categorizing candidate regions.
2 code implementations • 28 May 2018 • Hyemin Ahn, Sungjoon Choi, Nuri Kim, Geonho Cha, Songhwai Oh
To handle the inherent ambiguity in human language commands, a suitable question which can resolve the ambiguity is generated.
no code implementations • NeurIPS 2018 • Kyungjae Lee, Sungjoon Choi, Songhwai Oh
Third, we propose a maximum causal Tsallis entropy imitation learning (MCTEIL) algorithm with a sparse mixture density network (sparse MDN) by modeling mixture weights using a sparsemax distribution.
no code implementations • 22 Mar 2018 • Geonho Cha, Minsik Lee, Jungchan Cho, Songhwai Oh
In this paper, to resolve this issue, we propose a multiple-partial-hypothesis-based framework for the problem of estimating 3D human pose from a single image, which can be fine-tuned in an end-to-end fashion.
no code implementations • 12 Jan 2018 • Donghoon Lee, Ming-Hsuan Yang, Songhwai Oh
Single image reflection separation is an ill-posed problem since two scenes, a transmitted scene and a reflected scene, need to be inferred from a single observation.
no code implementations • CVPR 2018 • Eunwoo Kim, Chanho Ahn, Songhwai Oh
A nested sparse network consists of multiple levels of networks with a different sparsity ratio associated with each level, and higher level networks share parameters with lower level networks to enable stable nested learning.
1 code implementation • 15 Oct 2017 • Hyemin Ahn, Timothy Ha, Yunho Choi, Hwiyeon Yoo, Songhwai Oh
We demonstrate that the network can generate human-like actions which can be transferred to a Baxter robot, such that the robot performs an action based on a provided sentence.
no code implementations • 19 Sep 2017 • Kyungjae Lee, Sungjoon Choi, Songhwai Oh
The proposed sparse MDP is compared to soft MDPs which utilize causal entropy regularization.
1 code implementation • 3 Sep 2017 • Sungjoon Choi, Kyungjae Lee, Sungbin Lim, Songhwai Oh
The proposed uncertainty-aware learning from demonstration method outperforms other compared methods in terms of safety using a complex real-world driving dataset.
1 code implementation • ECCV 2018 • Donghoon Lee, Sangdoo Yun, Sungjoon Choi, Hwiyeon Yoo, Ming-Hsuan Yang, Songhwai Oh
We introduce a new problem of generating an image based on a small number of key local patches without any geometric prior.
no code implementations • 12 Aug 2016 • Sungjoon Choi, Kyungjae Lee, Andy Park, Songhwai Oh
The performance of KDMRL is extensively evaluated in two sets of experiments: grid world and track driving experiments.
no code implementations • CVPR 2016 • Minsik Lee, Jungchan Cho, Songhwai Oh
Recently, there have been many progresses for the problem of non-rigid structure reconstruction based on 2D trajectories, but it is still challenging to deal with complex deformations or restricted view ranges.
no code implementations • ICCV 2015 • Donghoon Lee, Ming-Hsuan Yang, Songhwai Oh
In this paper, we consider the problem of estimating the gaze direction of a person from a low-resolution image.
no code implementations • CVPR 2015 • Eunwoo Kim, Minsik Lee, Songhwai Oh
The proposed method is applied to a number of low-rank matrix approximation problems to demonstrate its efficiency in the presence of heavy corruptions and to show its effectiveness and robustness compared to the existing methods.
no code implementations • CVPR 2014 • Minsik Lee, Chong-Ho Choi, Songhwai Oh
Recovering a non-rigid 3D structure from a series of 2D observations is still a difficult problem to solve accurately.
no code implementations • CVPR 2013 • Minsik Lee, Jungchan Cho, Chong-Ho Choi, Songhwai Oh
Non-rigid structure from motion is a fundamental problem in computer vision, which is yet to be solved satisfactorily.
no code implementations • IEEE Transactions on Automatic Control 2009 • Songhwai Oh, Stuart Russell, Shankar Sastry
This paper presents Markov chain Monte Carlo data association (MCMCDA) for solving data association problems arising in multiple-target tracking in a cluttered environment.