Search Results for author: Rei Sato

Found 5 papers, 4 papers with code

Stepwise Alignment for Constrained Language Model Policy Optimization

no code implementations • 17 Apr 2024 • Akifumi Wachi, Thien Q Tran, Rei Sato, Takumi Tanabe, Yohei Akimoto

This paper formulates a human value alignment as a language model policy optimization problem to maximize reward under a safety constraint and then proposes an algorithm called Stepwise Alignment for Constrained Policy Optimization (SACPO).

Computational Efficiency Language Modelling

Paper
Add Code

Few-Shot Image-to-Semantics Translation for Policy Transfer in Reinforcement Learning

1 code implementation • 31 Jan 2023 • Rei Sato, Kazuto Fukuchi, Jun Sakuma, Youhei Akimoto

We investigate policy transfer using image-to-semantics translation to mitigate learning difficulties in vision-based robotics control agents.

Active Learning Computational Efficiency +3

Paper
Code

Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification

1 code implementation • 7 Nov 2022 • Takumi Tanabe, Rei Sato, Kazuto Fukuchi, Jun Sakuma, Youhei Akimoto

In this study, we focus on scenarios involving a simulation environment with uncertainty parameters and the set of their possible values, called the uncertainty parameter set.

Paper
Code

AdvantageNAS: Efficient Neural Architecture Search with Credit Assignment

1 code implementation • 11 Dec 2020 • Rei Sato, Jun Sakuma, Youhei Akimoto

In this paper, we propose a novel search strategy for one-shot and sparse propagation NAS, namely AdvantageNAS, which further reduces the time complexity of NAS by reducing the number of search iterations.

Neural Architecture Search

Paper
Code

Scaling Hypothesis of Spatial Search on Fractal Lattice Using Quantum Walk

1 code implementation • 30 Aug 2019 • Rei Sato, Tetsuro Nikuni, Shohei Watabe

We investigate a quantum spatial search problem on fractal lattices, such as Sierpinski carpets and Menger sponges.

Quantum Physics

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.