Search Results for author: Yifang Chen

Found 17 papers, 1 papers with code

Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning

no code implementations • 3 Feb 2024 • Yiping Wang, Yifang Chen, Wendan Yan, Kevin Jamieson, Simon Shaolei Du

In recent years, data selection has emerged as a core issue for large-scale visual-language model pretraining, especially on noisy web-curated datasets.

Contrastive Learning Experimental Design +1

Paper
Add Code

An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models

no code implementations • 12 Jan 2024 • Gantavya Bhatt, Yifang Chen, Arnav M. Das, Jifan Zhang, Sang T. Truong, Stephen Mussmann, Yinglun Zhu, Jeffrey Bilmes, Simon S. Du, Kevin Jamieson, Jordan T. Ash, Robert D. Nowak

To mitigate the annotation cost of SFT and circumvent the computational bottlenecks of active learning, we propose using experimental design.

Active Learning Experimental Design +1

Paper
Add Code

LabelBench: A Comprehensive Framework for Benchmarking Adaptive Label-Efficient Learning

1 code implementation • 16 Jun 2023 • Jifan Zhang, Yifang Chen, Gregory Canal, Stephen Mussmann, Arnav M. Das, Gantavya Bhatt, Yinglun Zhu, Jeffrey Bilmes, Simon Shaolei Du, Kevin Jamieson, Robert D Nowak

Labeled data are critical to modern machine learning applications, but obtaining labels can be expensive.

Active Learning Benchmarking +1

Paper
Code

Improved Active Multi-Task Representation Learning via Lasso

no code implementations • 5 Jun 2023 • Yiping Wang, Yifang Chen, Kevin Jamieson, Simon S. Du

In addition to our sample complexity results, we also characterize the potential of our $\nu^1$-based strategy in sample-cost-sensitive settings.

Representation Learning

Paper
Add Code

Causal Bandits: Online Decision-Making in Endogenous Settings

no code implementations • 16 Nov 2022 • Jingwen Zhang, Yifang Chen, Amandeep Singh

To this end, in this paper, we consider the problem of online learning in linear stochastic contextual bandit problems with endogenous covariates.

Decision Making Multi-Armed Bandits

Paper
Add Code

Improved Adaptive Algorithm for Scalable Active Learning with Weak Labeler

no code implementations • 4 Nov 2022 • Yifang Chen, Karthik Sankararaman, Alessandro Lazaric, Matteo Pirotta, Dmytro Karamshuk, Qifan Wang, Karishma Mandyam, Sinong Wang, Han Fang

We design a novel algorithmic template, Weak Labeler Active Cover (WL-AC), that is able to robustly leverage the lower quality weak labelers to reduce the query complexity while retaining the desired level of accuracy.

Active Learning

Paper
Add Code

A Deep Bayesian Bandits Approach for Anticancer Therapy: Exploration via Functional Prior

no code implementations • 5 May 2022 • Mingyu Lu, Yifang Chen, Su-In Lee

Learning personalized cancer treatment with machine learning holds great promise to improve cancer patients' chance of survival.

BIG-bench Machine Learning Drug Response Prediction

Paper
Add Code

Active Multi-Task Representation Learning

no code implementations • 2 Feb 2022 • Yifang Chen, Simon S. Du, Kevin Jamieson

To leverage the power of big data from source tasks and overcome the scarcity of the target task samples, representation learning based on multi-task pretraining has become a standard approach in many applications.

Active Learning Multi-Task Learning +1

Paper
Add Code

Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes

no code implementations • 26 Jan 2022 • Andrew Wagenmaker, Yifang Chen, Max Simchowitz, Simon S. Du, Kevin Jamieson

We first develop a computationally efficient algorithm for reward-free RL in a $d$-dimensional linear MDP with sample complexity scaling as $\widetilde{\mathcal{O}}(d^2 H^5/\epsilon^2)$.

Reinforcement Learning (RL)

Paper
Add Code

First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach

no code implementations • 7 Dec 2021 • Andrew Wagenmaker, Yifang Chen, Max Simchowitz, Simon S. Du, Kevin Jamieson

Obtaining first-order regret bounds -- regret bounds scaling not as the worst-case but with some measure of the performance of the optimal policy on a given instance -- is a core question in sequential decision-making.

Decision Making reinforcement-learning +1

Paper
Add Code

Corruption Robust Active Learning

no code implementations • NeurIPS 2021 • Yifang Chen, Simon S. Du, Kevin Jamieson

We conduct theoretical studies on streaming-based active learning for binary classification under unknown adversarial label corruptions.

Active Learning Binary Classification

Paper
Add Code

Improved Corruption Robust Algorithms for Episodic Reinforcement Learning

no code implementations • 13 Feb 2021 • Yifang Chen, Simon S. Du, Kevin Jamieson

We study episodic reinforcement learning under unknown adversarial corruptions in both the rewards and the transition probabilities of the underlying system.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

More Practical and Adaptive Algorithms for Online Quantum State Learning

no code implementations • 1 Jun 2020 • Yifang Chen, Xin Wang

This regret bound depends only on the maximum rank $M$ of measurements rather than the number of qubits, which takes advantage of low-rank measurements.

Paper
Add Code

Fair Contextual Multi-Armed Bandits: Theory and Experiments

no code implementations • 13 Dec 2019 • Yifang Chen, Alex Cuellar, Haipeng Luo, Jignesh Modi, Heramb Nemlekar, Stefanos Nikolaidis

We introduce a Multi-Armed Bandit algorithm with fairness constraints, where fairness is defined as a minimum rate that a task or a resource is assigned to a user.

Decision Making Fairness +1

Paper
Add Code

Online and Bandit Algorithms for Nonstationary Stochastic Saddle-Point Optimization

no code implementations • 3 Dec 2019 • Abhishek Roy, Yifang Chen, Krishnakumar Balasubramanian, Prasant Mohapatra

We establish sub-linear regret bounds on the proposed notions of regret in both the online and bandit setting.

Multi-agent Reinforcement Learning

Paper
Add Code

Multi-Armed Bandits with Fairness Constraints for Distributing Resources to Human Teammates

no code implementations • 30 Jun 2019 • Houston Claure, Yifang Chen, Jignesh Modi, Malte Jung, Stefanos Nikolaidis

How should a robot that collaborates with multiple people decide upon the distribution of resources (e. g. social attention, or parts needed for an assembly)?

Fairness Multi-Armed Bandits

Paper
Add Code

A New Algorithm for Non-stationary Contextual Bandits: Efficient, Optimal, and Parameter-free

no code implementations • 3 Feb 2019 • Yifang Chen, Chung-Wei Lee, Haipeng Luo, Chen-Yu Wei

We propose the first contextual bandit algorithm that is parameter-free, efficient, and optimal in terms of dynamic regret.

Multi-Armed Bandits

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.