Search Results for author: Junfeng Wen

Found 10 papers, 2 papers with code

Convex Two-Layer Modeling with Latent Structure

no code implementations • NeurIPS 2016 • Vignesh Ganapathiraman, Xinhua Zhang, Yao-Liang Yu, Junfeng Wen

Unsupervised learning of structured predictors has been a long standing pursuit in machine learning.

Graph Matching Vocal Bursts Valence Prediction

Paper
Add Code

Universal Successor Representations for Transfer Reinforcement Learning

no code implementations • 11 Apr 2018 • Chen Ma, Junfeng Wen, Yoshua Bengio

The objective of transfer reinforcement learning is to generalize from a set of previous tasks to unseen new tasks.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Few-Shot Self Reminder to Overcome Catastrophic Forgetting

no code implementations • 3 Dec 2018 • Junfeng Wen, Yanshuai Cao, Ruitong Huang

We demonstrate the superiority of our method to the previous ones in two different continual learning settings on popular benchmarks, as well as a new continual learning problem where tasks are designed to be more dissimilar.

Continual Learning

Paper
Add Code

Domain Aggregation Networks for Multi-Source Domain Adaptation

no code implementations • ICML 2020 • Junfeng Wen, Russell Greiner, Dale Schuurmans

In many real-world applications, we want to exploit multiple source datasets of similar tasks to learn a model for a different but related target dataset -- e. g., recognizing characters of a new font using a set of different fonts.

Domain Adaptation Sentiment Analysis

Paper
Add Code

Universal Successor Features for Transfer Reinforcement Learning

no code implementations • ICLR 2019 • Chen Ma, Dylan R. Ashley, Junfeng Wen, Yoshua Bengio

Transfer in Reinforcement Learning (RL) refers to the idea of applying knowledge gained from previous tasks to solving related tasks.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Batch Stationary Distribution Estimation

1 code implementation • ICML 2020 • Junfeng Wen, Bo Dai, Lihong Li, Dale Schuurmans

We consider the problem of approximating the stationary distribution of an ergodic Markov chain given a set of sampled transitions.

Off-policy evaluation

Paper
Code

Characterizing the Gap Between Actor-Critic and Policy Gradient

no code implementations • 13 Jun 2021 • Junfeng Wen, Saurabh Kumar, Ramki Gummadi, Dale Schuurmans

Actor-critic (AC) methods are ubiquitous in reinforcement learning.

Paper
Add Code

Decentralized Federated Learning through Proxy Model Sharing

1 code implementation • 22 Nov 2021 • Shivam Kalra, Junfeng Wen, Jesse C. Cresswell, Maksims Volkovs, Hamid R. Tizhoosh

Institutions in highly regulated domains such as finance and healthcare often have restrictive rules around data sharing.

Federated Learning whole slide images

Paper
Code

A Parametric Class of Approximate Gradient Updates for Policy Optimization

no code implementations • 17 Jun 2022 • Ramki Gummadi, Saurabh Kumar, Junfeng Wen, Dale Schuurmans

Approaches to policy optimization have been motivated from diverse principles, based on how the parametric model is interpreted (e. g. value versus policy representation) or how the learning objective is formulated, yet they share a common goal of maximizing expected return.

Paper
Add Code

Find Your Friends: Personalized Federated Learning with the Right Collaborators

no code implementations • 12 Oct 2022 • Yi Sui, Junfeng Wen, Yenson Lau, Brendan Leigh Ross, Jesse C. Cresswell

In the traditional federated learning setting, a central server coordinates a network of clients to train one global model.

Personalized Federated Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.