Search Results for author: Zhengyu Yang

Found 13 papers, 5 papers with code

Towards Applicable Reinforcement Learning: Improving the Generalization and Sample Efficiency with Policy Ensemble

no code implementations • 19 May 2022 • Zhengyu Yang, Kan Ren, Xufang Luo, Minghuan Liu, Weiqing Liu, Jiang Bian, Weinan Zhang, Dongsheng Li

Considering the great performance of ensemble methods on both accuracy and generalization in supervised learning (SL), we design a robust and applicable method named Ensemble Proximal Policy Optimization (EPPO), which learns ensemble policies in an end-to-end manner.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Curriculum Offline Imitating Learning

no code implementations • NeurIPS 2021 • Minghuan Liu, Hanye Zhao, Zhengyu Yang, Jian Shen, Weinan Zhang, Li Zhao, Tie-Yan Liu

However, IL is usually limited in the capability of the behavioral policy and tends to learn a mediocre behavior from the dataset collected by the mixture of policies.

Continuous Control Imitation Learning +2

Paper
Add Code

Curriculum Offline Imitation Learning

1 code implementation • 3 Nov 2021 • Minghuan Liu, Hanye Zhao, Zhengyu Yang, Jian Shen, Weinan Zhang, Li Zhao, Tie-Yan Liu

However, IL is usually limited in the capability of the behavioral policy and tends to learn a mediocre behavior from the dataset collected by the mixture of policies.

Continuous Control Imitation Learning +2

Paper
Code

SSFL: Tackling Label Deficiency in Federated Learning via Personalized Self-Supervision

no code implementations • 6 Oct 2021 • Chaoyang He, Zhengyu Yang, Erum Mushtaq, Sunwoo Lee, Mahdi Soltanolkotabi, Salman Avestimehr

In this paper we propose self-supervised federated learning (SSFL), a unified self-supervised and personalized federated learning framework, and a series of algorithms under this framework which work towards addressing these challenges.

Personalized Federated Learning Self-Supervised Learning

Paper
Add Code

Deep Ensemble Policy Learning

no code implementations • 29 Sep 2021 • Zhengyu Yang, Kan Ren, Xufang Luo, Weiqing Liu, Jiang Bian, Weinan Zhang, Dongsheng Li

Ensemble learning, which can consistently improve the prediction performance in supervised learning, has drawn increasing attentions in reinforcement learning (RL).

Ensemble Learning Reinforcement Learning (RL)

Paper
Add Code

SimMER: Simple Maximization of Entropy and Rank for Self-supervised Representation Learning

no code implementations • 29 Sep 2021 • Zhengyu Yang, Zijian Hu, Xuefeng Hu, Ram Nevatia

With both entropy and rank maximization, our method surpasses the state-of-the-art on CIFAR-10 and Mini-ImageNet under the standard linear evaluation protocol.

Contrastive Learning Representation Learning +1

Paper
Add Code

SimPLE: Similar Pseudo Label Exploitation for Semi-Supervised Classification

1 code implementation • CVPR 2021 • Zijian Hu, Zhengyu Yang, Xuefeng Hu, Ram Nevatia

Combining the Pair Loss with the techniques developed by the MixMatch family, our proposed SimPLE algorithm shows significant performance gains over previous algorithms on CIFAR-100 and Mini-ImageNet, and is on par with the state-of-the-art methods on CIFAR-10 and SVHN.

Ranked #1 on Semi-Supervised Image Classification on Mini-ImageNet, 4000 Labels

Classification General Classification +3

Paper
Code

To Follow or not to Follow: Selective Imitation Learning from Observations

no code implementations • 16 Dec 2019 • Youngwoon Lee, Edward S. Hu, Zhengyu Yang, Joseph J. Lim

Learning from demonstrations is a useful way to transfer a skill from one agent to another.

Imitation Learning

Paper
Add Code

IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks

1 code implementation • 17 Nov 2019 • Youngwoon Lee, Edward S. Hu, Zhengyu Yang, Alex Yin, Joseph J. Lim

The IKEA Furniture Assembly Environment is one of the first benchmarks for testing and accelerating the automation of complex manipulation tasks.

Industrial Robots reinforcement-learning +2

481

Paper
Code

Deep Landscape Forecasting for Real-time Bidding Advertising

2 code implementations • 7 May 2019 • Kan Ren, Jiarui Qin, Lei Zheng, Zhengyu Yang, Wei-Nan Zhang, Yong Yu

The problem is formulated as to forecast the probability distribution of market price for each ad auction.

Survival Analysis

Paper
Code

Deep Recurrent Survival Analysis

1 code implementation • 7 Sep 2018 • Kan Ren, Jiarui Qin, Lei Zheng, Zhengyu Yang, Wei-Nan Zhang, Lin Qiu, Yong Yu

By capturing the time dependency through modeling the conditional probability of the event for each sample, our method predicts the likelihood of the true event occurrence and estimates the survival rate over time, i. e., the probability of the non-occurrence of the event, for the censored data.

Survival Analysis

134

Paper
Code

Geometric Semantic Genetic Programming Algorithm and Slump Prediction

no code implementations • 18 Sep 2017 • Juncai Xu, Zhenzhong Shen, Qingwen Ren, Xin Xie, Zhengyu Yang

Research on the performance of recycled concrete as building material in the current world is an important subject.

Paper
Add Code

Slope Stability Analysis with Geometric Semantic Genetic Programming

no code implementations • 30 Aug 2017 • Juncai Xu, Zhenzhong Shen, Qingwen Ren, Xin Xie, Zhengyu Yang

Furthermore, a model for slope stability analysis is established on the basis of geometric semantics.

General Classification regression

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.