Search Results for author: Vincent Liu

Found 17 papers, 2 papers with code

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving

2 code implementations • 22 Feb 2023 • Zhuohan Li, Lianmin Zheng, Yinmin Zhong, Vincent Liu, Ying Sheng, Xin Jin, Yanping Huang, Zhifeng Chen, Hao Zhang, Joseph E. Gonzalez, Ion Stoica

Model parallelism is conventionally viewed as a method to scale a single large deep learning model beyond the memory limits of a single device.

2,983

Paper
Code

DABS: A Domain-Agnostic Benchmark for Self-Supervised Learning

1 code implementation • 23 Nov 2021 • Alex Tamkin, Vincent Liu, Rongfei Lu, Daniel Fein, Colin Schultz, Noah Goodman

Self-supervised learning algorithms, including BERT and SimCLR, have enabled significant strides in fields like natural language processing, computer vision, and speech processing.

Ranked #1 on Self-Supervised Learning on DABS

Self-Supervised Learning

104

Paper
Code

Attribute-aware Collaborative Filtering: Survey and Classification

no code implementations • 20 Oct 2018 • Wen-Hao Chen, Chin-Chi Hsu, Yi-An Lai, Vincent Liu, Mi-Yen Yeh, Shou-De Lin

Attribute-aware CF models aims at rating prediction given not only the historical rating from users to items, but also the information associated with users (e. g. age), items (e. g. price), or even ratings (e. g. rating time).

Attribute Classification +2

Paper
Add Code

The Utility of Sparse Representations for Control in Reinforcement Learning

no code implementations • 15 Nov 2018 • Vincent Liu, Raksha Kumaraswamy, Lei Le, Martha White

We investigate sparse representations for control in reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Recurrent Control Nets for Deep Reinforcement Learning

no code implementations • 6 Jan 2019 • Vincent Liu, Ademi Adeniji, Nathaniel Lee, Jason Zhao, Mario Srouji

Central Pattern Generators (CPGs) are biological neural circuits capable of producing coordinated rhythmic outputs in the absence of rhythmic input.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Incrementally Learning Functions of the Return

no code implementations • 5 Jul 2019 • Brendan Bennett, Wesley Chung, Muhammad Zaheer, Vincent Liu

Temporal difference methods enable efficient estimation of value functions in reinforcement learning in an incremental fashion, and are of broader interest because they correspond learning as observed in biological systems.

Reinforcement Learning (RL)

Paper
Add Code

Performance metrics for intervention-triggering prediction models do not reflect an expected reduction in outcomes from using the model

no code implementations • 2 Jun 2020 • Alejandro Schuler, Aashish Bhardwaj, Vincent Liu

Clinical researchers often select among and evaluate risk prediction models using standard machine learning metrics based on confusion matrices.

valid

Paper
Add Code

Towards a practical measure of interference for reinforcement learning

no code implementations • 7 Jul 2020 • Vincent Liu, Adam White, Hengshuai Yao, Martha White

In this work, we provide a definition of interference for control in reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Training Recurrent Neural Networks Online by Learning Explicit State Variables

no code implementations • ICLR 2020 • Somjit Nath, Vincent Liu, Alan Chan, Xin Li, Adam White, Martha White

Recurrent neural networks (RNNs) allow an agent to construct a state-representation from a stream of experience, which is essential in partially observable problems.

Paper
Add Code

Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning

no code implementations • 15 Nov 2021 • Vincent Liu, James R. Wright, Martha White

Offline reinforcement learning -- learning a policy from a batch of data -- is known to be hard for general MDPs.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Investigating the Properties of Neural Network Representations in Reinforcement Learning

no code implementations • 30 Mar 2022 • Han Wang, Erfan Miahi, Martha White, Marlos C. Machado, Zaheer Abbas, Raksha Kumaraswamy, Vincent Liu, Adam White

In this paper we investigate the properties of representations learned by deep reinforcement learning systems.

Q-Learning reinforcement-learning +2

Paper
Add Code

No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL

no code implementations • 18 May 2022 • Han Wang, Archit Sakhadeo, Adam White, James Bell, Vincent Liu, Xutong Zhao, Puer Liu, Tadashi Kozuno, Alona Fyshe, Martha White

The performance of reinforcement learning (RL) agents is sensitive to the choice of hyperparameters.

Reinforcement Learning (RL)

Paper
Add Code

Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments

no code implementations • 23 Feb 2023 • Vincent Liu, Yash Chandak, Philip Thomas, Martha White

In this work, we consider the off-policy policy evaluation problem for contextual bandits and finite horizon reinforcement learning in the nonstationary setting.

Multi-Armed Bandits regression +2

Paper
Add Code

Measuring and Mitigating Interference in Reinforcement Learning

no code implementations • 10 Jul 2023 • Vincent Liu, Han Wang, Ruo Yu Tao, Khurram Javed, Adam White, Martha White

Lastly, we outline a class of algorithms which we call online-aware that are designed to mitigate interference, and show they do reduce interference according to our measure and that they improve stability and performance in several classic control environments.

reinforcement-learning

Paper
Add Code

When is Offline Policy Selection Sample Efficient for Reinforcement Learning?

no code implementations • 4 Dec 2023 • Vincent Liu, Prabhat Nagarajan, Andrew Patterson, Martha White

As a result, no OPS method can be more sample efficient than OPE in the worst case.

reinforcement-learning

Paper
Add Code

Under the Surface: Tracking the Artifactuality of LLM-Generated Data

no code implementations • 26 Jan 2024 • Debarati Das, Karin de Langis, Anna Martin-Boyle, Jaehyung Kim, Minhwa Lee, Zae Myung Kim, Shirley Anugrah Hayati, Risako Owan, Bin Hu, Ritik Parkar, Ryan Koo, Jonginn Park, Aahan Tyagi, Libby Ferland, Sanjali Roy, Vincent Liu, Dongyeop Kang

This work delves into the expanding role of large language models (LLMs) in generating artificial data.

Paper
Add Code

Switching the Loss Reduces the Cost in Batch Reinforcement Learning

no code implementations • 8 Mar 2024 • Alex Ayoub, Kaiwen Wang, Vincent Liu, Samuel Robertson, James McInerney, Dawen Liang, Nathan Kallus, Csaba Szepesvári

We propose training fitted Q-iteration with log-loss (FQI-LOG) for batch reinforcement learning (RL).

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.