Search Results for author: Daiki Kuyoshi

Found 2 papers, 1 papers with code

Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator

no code implementations30 Jan 2024 Ryoma Furuyama, Daiki Kuyoshi, Satoshi Yamane

In order to make this algorithm more robust to distribution shift, we propose more efficient and robust algorithm by adding to this method a reward function based on adversarial inverse reinforcement learning that rewards the agent for performing actions in status similar to the demo.

Imitation Learning Q-Learning +1

Discriminator Soft Actor Critic without Extrinsic Rewards

1 code implementation19 Jan 2020 Daichi Nishio, Daiki Kuyoshi, Toi Tsuneda, Satoshi Yamane

The methods based on reinforcement learning, such as inverse reinforcement learning and generative adversarial imitation learning (GAIL), can learn from only a few expert data.

Imitation Learning Q-Learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.