no code implementations • 23 Apr 2024 • Yangchen Pan, Junfeng Wen, Chenjun Xiao, Philip Torr
In traditional statistical learning, data points are usually assumed to be independently and identically distributed (i. i. d.)
no code implementations • 12 Oct 2022 • Yi Sui, Junfeng Wen, Yenson Lau, Brendan Leigh Ross, Jesse C. Cresswell
In the traditional federated learning setting, a central server coordinates a network of clients to train one global model.
no code implementations • 17 Jun 2022 • Ramki Gummadi, Saurabh Kumar, Junfeng Wen, Dale Schuurmans
Approaches to policy optimization have been motivated from diverse principles, based on how the parametric model is interpreted (e. g. value versus policy representation) or how the learning objective is formulated, yet they share a common goal of maximizing expected return.
1 code implementation • 22 Nov 2021 • Shivam Kalra, Junfeng Wen, Jesse C. Cresswell, Maksims Volkovs, Hamid R. Tizhoosh
Institutions in highly regulated domains such as finance and healthcare often have restrictive rules around data sharing.
no code implementations • 13 Jun 2021 • Junfeng Wen, Saurabh Kumar, Ramki Gummadi, Dale Schuurmans
Actor-critic (AC) methods are ubiquitous in reinforcement learning.
1 code implementation • ICML 2020 • Junfeng Wen, Bo Dai, Lihong Li, Dale Schuurmans
We consider the problem of approximating the stationary distribution of an ergodic Markov chain given a set of sampled transitions.
no code implementations • ICLR 2019 • Chen Ma, Dylan R. Ashley, Junfeng Wen, Yoshua Bengio
Transfer in Reinforcement Learning (RL) refers to the idea of applying knowledge gained from previous tasks to solving related tasks.
no code implementations • ICML 2020 • Junfeng Wen, Russell Greiner, Dale Schuurmans
In many real-world applications, we want to exploit multiple source datasets of similar tasks to learn a model for a different but related target dataset -- e. g., recognizing characters of a new font using a set of different fonts.
no code implementations • 3 Dec 2018 • Junfeng Wen, Yanshuai Cao, Ruitong Huang
We demonstrate the superiority of our method to the previous ones in two different continual learning settings on popular benchmarks, as well as a new continual learning problem where tasks are designed to be more dissimilar.
no code implementations • 11 Apr 2018 • Chen Ma, Junfeng Wen, Yoshua Bengio
The objective of transfer reinforcement learning is to generalize from a set of previous tasks to unseen new tasks.
no code implementations • NeurIPS 2016 • Vignesh Ganapathiraman, Xinhua Zhang, Yao-Liang Yu, Junfeng Wen
Unsupervised learning of structured predictors has been a long standing pursuit in machine learning.