Search Results for author: Zeyu Jia

Found 6 papers, 0 papers with code

Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data

no code implementations25 Mar 2024 Zeyu Jia, Alexander Rakhlin, Ayush Sekhari, Chen-Yu Wei

We revisit the problem of offline reinforcement learning with value function realizability but without Bellman completeness.

reinforcement-learning

Linear Reinforcement Learning with Ball Structure Action Space

no code implementations14 Nov 2022 Zeyu Jia, Randy Jia, Dhruv Madeka, Dean P. Foster

We study the problem of Reinforcement Learning (RL) with linear function approximation, i. e. assuming the optimal action-value function is linear in a known $d$-dimensional feature mapping.

reinforcement-learning Reinforcement Learning (RL)

Intrinsic Dimension Estimation Using Wasserstein Distances

no code implementations8 Jun 2021 Adam Block, Zeyu Jia, Yury Polyanskiy, Alexander Rakhlin

It has long been thought that high-dimensional data encountered in many practical machine learning tasks have low-dimensional structure, i. e., the manifold hypothesis holds.

BIG-bench Machine Learning

Model-Based Reinforcement Learning with Value-Targeted Regression

no code implementations ICML 2020 Alex Ayoub, Zeyu Jia, Csaba Szepesvari, Mengdi Wang, Lin F. Yang

We propose a model based RL algorithm that is based on optimism principle: In each episode, the set of models that are `consistent' with the data collected is constructed.

Model-based Reinforcement Learning regression +2

PROVABLY BENEFITS OF DEEP HIERARCHICAL RL

no code implementations25 Sep 2019 Zeyu Jia, Simon S. Du, Ruosong Wang, Mengdi Wang, Lin F. Yang

Modern complex sequential decision-making problem often both low-level policy and high-level planning.

Decision Making Hierarchical Reinforcement Learning

Feature-Based Q-Learning for Two-Player Stochastic Games

no code implementations2 Jun 2019 Zeyu Jia, Lin F. Yang, Mengdi Wang

Consider a two-player zero-sum stochastic game where the transition function can be embedded in a given feature space.

Q-Learning Vocal Bursts Valence Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.