Search Results for author: Junfeng Wen

Found 10 papers, 2 papers with code

Universal Successor Representations for Transfer Reinforcement Learning

no code implementations11 Apr 2018 Chen Ma, Junfeng Wen, Yoshua Bengio

The objective of transfer reinforcement learning is to generalize from a set of previous tasks to unseen new tasks.

reinforcement-learning Reinforcement Learning (RL) +1

Few-Shot Self Reminder to Overcome Catastrophic Forgetting

no code implementations3 Dec 2018 Junfeng Wen, Yanshuai Cao, Ruitong Huang

We demonstrate the superiority of our method to the previous ones in two different continual learning settings on popular benchmarks, as well as a new continual learning problem where tasks are designed to be more dissimilar.

Continual Learning

Domain Aggregation Networks for Multi-Source Domain Adaptation

no code implementations ICML 2020 Junfeng Wen, Russell Greiner, Dale Schuurmans

In many real-world applications, we want to exploit multiple source datasets of similar tasks to learn a model for a different but related target dataset -- e. g., recognizing characters of a new font using a set of different fonts.

Domain Adaptation Sentiment Analysis

Universal Successor Features for Transfer Reinforcement Learning

no code implementations ICLR 2019 Chen Ma, Dylan R. Ashley, Junfeng Wen, Yoshua Bengio

Transfer in Reinforcement Learning (RL) refers to the idea of applying knowledge gained from previous tasks to solving related tasks.

reinforcement-learning Reinforcement Learning (RL) +1

Batch Stationary Distribution Estimation

1 code implementation ICML 2020 Junfeng Wen, Bo Dai, Lihong Li, Dale Schuurmans

We consider the problem of approximating the stationary distribution of an ergodic Markov chain given a set of sampled transitions.

Off-policy evaluation

Decentralized Federated Learning through Proxy Model Sharing

1 code implementation22 Nov 2021 Shivam Kalra, Junfeng Wen, Jesse C. Cresswell, Maksims Volkovs, Hamid R. Tizhoosh

Institutions in highly regulated domains such as finance and healthcare often have restrictive rules around data sharing.

Federated Learning whole slide images

A Parametric Class of Approximate Gradient Updates for Policy Optimization

no code implementations17 Jun 2022 Ramki Gummadi, Saurabh Kumar, Junfeng Wen, Dale Schuurmans

Approaches to policy optimization have been motivated from diverse principles, based on how the parametric model is interpreted (e. g. value versus policy representation) or how the learning objective is formulated, yet they share a common goal of maximizing expected return.

Find Your Friends: Personalized Federated Learning with the Right Collaborators

no code implementations12 Oct 2022 Yi Sui, Junfeng Wen, Yenson Lau, Brendan Leigh Ross, Jesse C. Cresswell

In the traditional federated learning setting, a central server coordinates a network of clients to train one global model.

Personalized Federated Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.