Search Results for author: Samuel Yang-Zhao

Found 4 papers, 0 papers with code

Dynamic Knowledge Injection for AIXI Agents

no code implementations18 Dec 2023 Samuel Yang-Zhao, Kee Siong Ng, Marcus Hutter

Prior approximations of AIXI, a Bayesian optimality notion for general reinforcement learning, can only approximate AIXI's Bayesian environment model using an a-priori defined set of models.

General Reinforcement Learning

A Direct Approximation of AIXI Using Logical State Abstractions

no code implementations13 Oct 2022 Samuel Yang-Zhao, Tianyu Wang, Kee Siong Ng

We propose a practical integration of logical state abstraction with AIXI, a Bayesian optimality notion for reinforcement learning agents, to significantly expand the model class that AIXI agents can be approximated over to complex history-dependent and structured environments.

Factored Conditional Filtering: Tracking States and Estimating Parameters in High-Dimensional Spaces

no code implementations5 Jun 2022 Dawei Chen, Samuel Yang-Zhao, John Lloyd, Kee Siong Ng

This paper introduces the factored conditional filter, a new filtering algorithm for simultaneously tracking states and estimating parameters in high-dimensional state spaces.

Conditions on Features for Temporal Difference-Like Methods to Converge

no code implementations28 May 2019 Marcus Hutter, Samuel Yang-Zhao, Sultan J. Majeed

The convergence of many reinforcement learning (RL) algorithms with linear function approximation has been investigated extensively but most proofs assume that these methods converge to a unique solution.

reinforcement-learning Reinforcement Learning (RL) +1

Cannot find the paper you are looking for? You can Submit a new open access paper.