Search Results for author: Songhwai Oh

Found 38 papers, 10 papers with code

Tidiness Score-Guided Monte Carlo Tree Search for Visual Tabletop Rearrangement

1 code implementation24 Feb 2025 Hogun Kee, Wooseok Oh, Minjae Kang, Hyemin Ahn, Songhwai Oh

In this paper, we present the tidiness score-guided Monte Carlo tree search (TSMCTS), a novel framework designed to address the tabletop tidying up problem using only an RGB-D camera.

Adversarial Environment Design via Regret-Guided Diffusion Models

no code implementations25 Oct 2024 Hojun Chung, Junseo Lee, Minsoo Kim, Dohyeong Kim, Songhwai Oh

Training agents that are robust to environmental changes remains a significant challenge in deep reinforcement learning (RL).

Deep Reinforcement Learning Diversity +2

RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps

no code implementations8 Oct 2024 Minsoo Kim, Obin Kwon, Howoong Jun, Songhwai Oh

We propose a novel visual localization and navigation framework for real-world environments directly integrating observed visual information into the bird-eye-view map.

Visual Localization Visual Navigation

Stage-Wise Reward Shaping for Acrobatic Robots: A Constrained Multi-Objective Reinforcement Learning Approach

1 code implementation24 Sep 2024 Dohyeong Kim, Hyeokjin Kwon, Junseok Kim, Gunmin Lee, Songhwai Oh

As the complexity of tasks addressed through reinforcement learning (RL) increases, the definition of reward functions also has become highly complicated.

Multi-Objective Reinforcement Learning

Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees

no code implementations29 May 2024 Dohyeong Kim, Taehyun Cho, Seungyub Han, Hojun Chung, Kyungjae Lee, Songhwai Oh

Furthermore, the proposed method has been evaluated on continuous control tasks and showed the best performance among other RCRL algorithms satisfying the constraints.

Bilevel Optimization continuous-control +4

Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning

no code implementations1 Mar 2024 Dohyeong Kim, Mineui Hong, Jeongho Park, Songhwai Oh

We address these challenges straightforwardly by treating the maximization of multiple objectives as a constrained optimization problem (COP), where the constraints are defined to improve the original objectives.

Multi-Objective Reinforcement Learning reinforcement-learning +1

Diffused Task-Agnostic Milestone Planner

no code implementations NeurIPS 2023 Mineui Hong, Minjae Kang, Songhwai Oh

To this end, we propose a method to utilize a diffusion-based generative sequence model to plan a series of milestones in a latent space and to have an agent to follow the milestones to accomplish a given task.

Decision Making Offline RL +1

Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding

no code implementations CVPR 2023 Minyoung Hwang, Jaeyeon Jeong, Minsoo Kim, Yoonseon Oh, Songhwai Oh

We show that an exploitation policy, which moves the agent toward a well-chosen local goal among unvisited but observable states, outperforms a method which moves the agent to a previously visited state.

Vision and Language Navigation Visual Navigation

Renderable Neural Radiance Map for Visual Navigation

1 code implementation CVPR 2023 Obin Kwon, Jeongho Park, Songhwai Oh

We propose a novel type of map for visual navigation, a renderable neural radiance map (RNR-Map), which is designed to contain the overall visual information of a 3D environment.

Descriptive Visual Localization +1

Trust Region-Based Safe Distributional Reinforcement Learning for Multiple Constraints

1 code implementation NeurIPS 2023 Dohyeong Kim, Kyungjae Lee, Songhwai Oh

In safety-critical robotic tasks, potential failures must be reduced, and multiple constraints must be met, such as avoiding collisions, limiting energy consumption, and maintaining balance.

Distributional Reinforcement Learning reinforcement-learning +3

Texture Generation Using Dual-Domain Feature Flow with Multi-View Hallucinations

no code implementations14 Mar 2022 Seunggyu Chang, Jungchan Cho, Songhwai Oh

To provide sufficient information for estimating a complete texture map, the proposed model simultaneously generates multi-view hallucinations in the image domain and an estimated texture map in the texture domain.

Texture Synthesis

Visual Graph Memory With Unsupervised Representation for Visual Navigation

1 code implementation ICCV 2021 Obin Kwon, Nuri Kim, Yunho Choi, Hwiyeon Yoo, Jeongho Park, Songhwai Oh

We present a novel graph-structured memory for visual navigation, called visual graph memory (VGM), which consists of unsupervised image representations obtained from navigation history.

Navigate Visual Navigation

Visually Grounding Language Instruction for History-Dependent Manipulation

no code implementations16 Dec 2020 Hyemin Ahn, Obin Kwon, Kyoungdo Kim, Jaeyeon Jeong, Howoong Jun, Hongjung Lee, Dongheui Lee, Songhwai Oh

We also suggest a relevant dataset and model which can be a baseline, and show that our model trained with the proposed dataset can also be applied to the real world based on the CycleGAN.

Optimal Algorithms for Stochastic Multi-Armed Bandits with Heavy Tailed Rewards

no code implementations NeurIPS 2020 Kyungjae Lee, Hongjun Yang, Sungbin Lim, Songhwai Oh

In simulation, the proposed estimator shows favorable performance compared to existing robust estimators for various $p$ values and, for MAB problems, the proposed perturbation strategy outperforms existing exploration methods.

Multi-Armed Bandits

Generative Autoregressive Networks for 3D Dancing Move Synthesis from Music

no code implementations11 Nov 2019 Hyemin Ahn, Jaehun Kim, Kihyun Kim, Songhwai Oh

The trained dance pose generator, which is a generative autoregressive model, is able to synthesize a dance sequence longer than 5, 000 pose frames.

Deep Elastic Networks with Model Selection for Multi-Task Learning

no code implementations ICCV 2019 Chanho Ahn, Eunwoo Kim, Songhwai Oh

To this end, we propose an efficient approach to exploit a compact but accurate model in a backbone architecture for each instance of all tasks.

Image Classification Model Selection +1

Deep Virtual Networks for Memory Efficient Inference of Multiple Tasks

no code implementations CVPR 2019 Eunwoo Kim, Chanho Ahn, Philip H. S. Torr, Songhwai Oh

To this end, we propose a novel network architecture producing multiple networks of different configurations, termed deep virtual networks (DVNs), for different tasks.

Tsallis Reinforcement Learning: A Unified Framework for Maximum Entropy Reinforcement Learning

no code implementations31 Jan 2019 Kyungjae Lee, Sungyub Kim, Sungbin Lim, Sungjoon Choi, Songhwai Oh

By controlling the entropic index, we can generate various types of entropy, including the SG entropy, and a different entropy results in a different class of the optimal policy in Tsallis MDPs.

MuJoCo reinforcement-learning +2

Interactive Text2Pickup Network for Natural Language based Human-Robot Collaboration

2 code implementations28 May 2018 Hyemin Ahn, Sungjoon Choi, Nuri Kim, Geonho Cha, Songhwai Oh

To handle the inherent ambiguity in human language commands, a suitable question which can resolve the ambiguity is generated.

Object Position

Maximum Causal Tsallis Entropy Imitation Learning

no code implementations NeurIPS 2018 Kyungjae Lee, Sungjoon Choi, Songhwai Oh

Third, we propose a maximum causal Tsallis entropy imitation learning (MCTEIL) algorithm with a sparse mixture density network (sparse MDN) by modeling mixture weights using a sparsemax distribution.

Imitation Learning

Deep Pose Consensus Networks

no code implementations22 Mar 2018 Geonho Cha, Minsik Lee, Jungchan Cho, Songhwai Oh

In this paper, to resolve this issue, we propose a multiple-partial-hypothesis-based framework for the problem of estimating 3D human pose from a single image, which can be fine-tuned in an end-to-end fashion.

Generative Single Image Reflection Separation

no code implementations12 Jan 2018 Donghoon Lee, Ming-Hsuan Yang, Songhwai Oh

Single image reflection separation is an ill-posed problem since two scenes, a transmitted scene and a reflected scene, need to be inferred from a single observation.

NestedNet: Learning Nested Sparse Structures in Deep Neural Networks

no code implementations CVPR 2018 Eunwoo Kim, Chanho Ahn, Songhwai Oh

A nested sparse network consists of multiple levels of networks with a different sparsity ratio associated with each level, and higher level networks share parameters with lower level networks to enable stable nested learning.

Knowledge Distillation Scheduling

Text2Action: Generative Adversarial Synthesis from Language to Action

1 code implementation15 Oct 2017 Hyemin Ahn, Timothy Ha, Yunho Choi, Hwiyeon Yoo, Songhwai Oh

We demonstrate that the network can generate human-like actions which can be transferred to a Baxter robot, such that the robot performs an action based on a provided sentence.

Decoder Generative Adversarial Network +1

Uncertainty-Aware Learning from Demonstration using Mixture Density Networks with Sampling-Free Variance Modeling

1 code implementation3 Sep 2017 Sungjoon Choi, Kyungjae Lee, Sungbin Lim, Songhwai Oh

The proposed uncertainty-aware learning from demonstration method outperforms other compared methods in terms of safety using a complex real-world driving dataset.

Autonomous Driving

Unsupervised Holistic Image Generation from Key Local Patches

1 code implementation ECCV 2018 Donghoon Lee, Sangdoo Yun, Sungjoon Choi, Hwiyeon Yoo, Ming-Hsuan Yang, Songhwai Oh

We introduce a new problem of generating an image based on a small number of key local patches without any geometric prior.

Decoder Image Generation

Density Matching Reward Learning

no code implementations12 Aug 2016 Sungjoon Choi, Kyungjae Lee, Andy Park, Songhwai Oh

The performance of KDMRL is extensively evaluated in two sets of experiments: grid world and track driving experiments.

Autonomous Navigation Reinforcement Learning

Consensus of Non-Rigid Reconstructions

no code implementations CVPR 2016 Minsik Lee, Jungchan Cho, Songhwai Oh

Recently, there have been many progresses for the problem of non-rigid structure reconstruction based on 2D trajectories, but it is still challenging to deal with complex deformations or restricted view ranges.

Elastic-Net Regularization of Singular Values for Robust Subspace Learning

no code implementations CVPR 2015 Eunwoo Kim, Minsik Lee, Songhwai Oh

The proposed method is applied to a number of low-rank matrix approximation problems to demonstrate its efficiency in the presence of heavy corruptions and to show its effectiveness and robustness compared to the existing methods.

A Procrustean Markov Process for Non-Rigid Structure Recovery

no code implementations CVPR 2014 Minsik Lee, Chong-Ho Choi, Songhwai Oh

Recovering a non-rigid 3D structure from a series of 2D observations is still a difficult problem to solve accurately.

Procrustean Normal Distribution for Non-rigid Structure from Motion

no code implementations CVPR 2013 Minsik Lee, Jungchan Cho, Chong-Ho Choi, Songhwai Oh

Non-rigid structure from motion is a fundamental problem in computer vision, which is yet to be solved satisfactorily.

Markov Chain Monte Carlo Data Association for Multiple-Target Tracking

no code implementations IEEE Transactions on Automatic Control 2009 Songhwai Oh, Stuart Russell, Shankar Sastry

This paper presents Markov chain Monte Carlo data association (MCMCDA) for solving data association problems arising in multiple-target tracking in a cluttered environment.

Cannot find the paper you are looking for? You can Submit a new open access paper.