Search Results for author: Satoshi Yamane

Found 7 papers, 2 papers with code

Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator

no code implementations30 Jan 2024 Ryoma Furuyama, Daiki Kuyoshi, Satoshi Yamane

In order to make this algorithm more robust to distribution shift, we propose more efficient and robust algorithm by adding to this method a reward function based on adversarial inverse reinforcement learning that rewards the agent for performing actions in status similar to the demo.

Imitation Learning Q-Learning +1

Staged Depthwise Correlation and Feature Fusion for Siamese Object Tracking

no code implementations15 Oct 2023 Dianbo Ma, Jianqiang Xiao, Ziyan Gao, Satoshi Yamane

In this work, we propose a novel staged depthwise correlation and feature fusion network, named DCFFNet, to further optimize the feature extraction for visual tracking.

Object Tracking Visual Tracking

Discriminator Soft Actor Critic without Extrinsic Rewards

1 code implementation19 Jan 2020 Daichi Nishio, Daiki Kuyoshi, Toi Tsuneda, Satoshi Yamane

The methods based on reinforcement learning, such as inverse reinforcement learning and generative adversarial imitation learning (GAIL), can learn from only a few expert data.

Imitation Learning Q-Learning +2

Improving Minimal Gated Unit for Sequential Data

no code implementations21 May 2019 Kazuki Takamura, Satoshi Yamane

As a result of the experiment, the effectiveness of the proposed method was confirmed.

Machine Translation speech-recognition +2

Random Projection in Neural Episodic Control

1 code implementation3 Apr 2019 Daichi Nishio, Satoshi Yamane

End-to-end deep reinforcement learning has enabled agents to learn with little preprocessing by humans.

Reinforcement Learning (RL)

Faster Deep Q-learning using Neural Episodic Control

no code implementations6 Jan 2018 Daichi Nishio, Satoshi Yamane

The research on deep reinforcement learning which estimates Q-value by deep learning has been attracted the interest of researchers recently.

Q-Learning reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.