Search Results for author: Simon Shaolei Du

Found 8 papers, 2 papers with code

Offline Multi-task Transfer RL with Representational Penalization

no code implementations19 Feb 2024 Avinandan Bose, Simon Shaolei Du, Maryam Fazel

We study the problem of representation transfer in offline Reinforcement Learning (RL), where a learner has access to episodic data from a number of source tasks collected a priori, and aims to learn a shared representation to be used in finding a good policy for a target task.

Offline RL Reinforcement Learning (RL)

Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning

no code implementations3 Feb 2024 Yiping Wang, Yifang Chen, Wendan Yan, Kevin Jamieson, Simon Shaolei Du

In recent years, data selection has emerged as a core issue for large-scale visual-language model pretraining, especially on noisy web-curated datasets.

Contrastive Learning Experimental Design +1

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning

1 code implementation30 Oct 2023 Zhaoyi Zhou, Chuning Zhu, Runlong Zhou, Qiwen Cui, Abhishek Gupta, Simon Shaolei Du

Off-policy dynamic programming (DP) techniques such as $Q$-learning have proven to be important in sequential decision-making problems.

Decision Making Offline RL +1

Robust Offline Reinforcement Learning -- Certify the Confidence Interval

no code implementations28 Sep 2023 Jiarui Yao, Simon Shaolei Du

Currently, reinforcement learning (RL), especially deep RL, has received more and more attention in the research area.

reinforcement-learning Reinforcement Learning (RL)

A Benchmark for Low-Switching-Cost Reinforcement Learning

no code implementations13 Dec 2021 Shusheng Xu, Yancheng Liang, Yunfei Li, Simon Shaolei Du, Yi Wu

A ubiquitous requirement in many practical reinforcement learning (RL) applications, including medical treatment, recommendation system, education and robotics, is that the deployed policy that actually interacts with the environment cannot change frequently.

Atari Games reinforcement-learning +1

Deep Q-Learning with Low Switching Cost

no code implementations1 Jan 2021 Shusheng Xu, Simon Shaolei Du, Yi Wu

We initiate the study on deep reinforcement learning problems that require low switching cost, i. e., small number of policy switches during training.

Atari Games Q-Learning +2

Hypothesis Transfer Learning via Transformation Functions

no code implementations NeurIPS 2017 Simon Shaolei Du, Jayanth Koushik, Aarti Singh, Barnabas Poczos

We consider the Hypothesis Transfer Learning (HTL) problem where one incorporates a hypothesis trained on the source domain into the learning procedure of the target domain.

Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.