Search Results for author: Zhengyu Yang

Found 13 papers, 5 papers with code

Towards Applicable Reinforcement Learning: Improving the Generalization and Sample Efficiency with Policy Ensemble

no code implementations19 May 2022 Zhengyu Yang, Kan Ren, Xufang Luo, Minghuan Liu, Weiqing Liu, Jiang Bian, Weinan Zhang, Dongsheng Li

Considering the great performance of ensemble methods on both accuracy and generalization in supervised learning (SL), we design a robust and applicable method named Ensemble Proximal Policy Optimization (EPPO), which learns ensemble policies in an end-to-end manner.

reinforcement-learning Reinforcement Learning (RL)

Curriculum Offline Imitating Learning

no code implementations NeurIPS 2021 Minghuan Liu, Hanye Zhao, Zhengyu Yang, Jian Shen, Weinan Zhang, Li Zhao, Tie-Yan Liu

However, IL is usually limited in the capability of the behavioral policy and tends to learn a mediocre behavior from the dataset collected by the mixture of policies.

Continuous Control Imitation Learning +2

Curriculum Offline Imitation Learning

1 code implementation3 Nov 2021 Minghuan Liu, Hanye Zhao, Zhengyu Yang, Jian Shen, Weinan Zhang, Li Zhao, Tie-Yan Liu

However, IL is usually limited in the capability of the behavioral policy and tends to learn a mediocre behavior from the dataset collected by the mixture of policies.

Continuous Control Imitation Learning +2

SSFL: Tackling Label Deficiency in Federated Learning via Personalized Self-Supervision

no code implementations6 Oct 2021 Chaoyang He, Zhengyu Yang, Erum Mushtaq, Sunwoo Lee, Mahdi Soltanolkotabi, Salman Avestimehr

In this paper we propose self-supervised federated learning (SSFL), a unified self-supervised and personalized federated learning framework, and a series of algorithms under this framework which work towards addressing these challenges.

Personalized Federated Learning Self-Supervised Learning

Deep Ensemble Policy Learning

no code implementations29 Sep 2021 Zhengyu Yang, Kan Ren, Xufang Luo, Weiqing Liu, Jiang Bian, Weinan Zhang, Dongsheng Li

Ensemble learning, which can consistently improve the prediction performance in supervised learning, has drawn increasing attentions in reinforcement learning (RL).

Ensemble Learning Reinforcement Learning (RL)

SimMER: Simple Maximization of Entropy and Rank for Self-supervised Representation Learning

no code implementations29 Sep 2021 Zhengyu Yang, Zijian Hu, Xuefeng Hu, Ram Nevatia

With both entropy and rank maximization, our method surpasses the state-of-the-art on CIFAR-10 and Mini-ImageNet under the standard linear evaluation protocol.

Contrastive Learning Representation Learning +1

SimPLE: Similar Pseudo Label Exploitation for Semi-Supervised Classification

1 code implementation CVPR 2021 Zijian Hu, Zhengyu Yang, Xuefeng Hu, Ram Nevatia

Combining the Pair Loss with the techniques developed by the MixMatch family, our proposed SimPLE algorithm shows significant performance gains over previous algorithms on CIFAR-100 and Mini-ImageNet, and is on par with the state-of-the-art methods on CIFAR-10 and SVHN.

Classification General Classification +3

IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks

1 code implementation17 Nov 2019 Youngwoon Lee, Edward S. Hu, Zhengyu Yang, Alex Yin, Joseph J. Lim

The IKEA Furniture Assembly Environment is one of the first benchmarks for testing and accelerating the automation of complex manipulation tasks.

Industrial Robots reinforcement-learning +2

Deep Landscape Forecasting for Real-time Bidding Advertising

2 code implementations7 May 2019 Kan Ren, Jiarui Qin, Lei Zheng, Zhengyu Yang, Wei-Nan Zhang, Yong Yu

The problem is formulated as to forecast the probability distribution of market price for each ad auction.

Survival Analysis

Deep Recurrent Survival Analysis

1 code implementation7 Sep 2018 Kan Ren, Jiarui Qin, Lei Zheng, Zhengyu Yang, Wei-Nan Zhang, Lin Qiu, Yong Yu

By capturing the time dependency through modeling the conditional probability of the event for each sample, our method predicts the likelihood of the true event occurrence and estimates the survival rate over time, i. e., the probability of the non-occurrence of the event, for the censored data.

Survival Analysis

Geometric Semantic Genetic Programming Algorithm and Slump Prediction

no code implementations18 Sep 2017 Juncai Xu, Zhenzhong Shen, Qingwen Ren, Xin Xie, Zhengyu Yang

Research on the performance of recycled concrete as building material in the current world is an important subject.

Slope Stability Analysis with Geometric Semantic Genetic Programming

no code implementations30 Aug 2017 Juncai Xu, Zhenzhong Shen, Qingwen Ren, Xin Xie, Zhengyu Yang

Furthermore, a model for slope stability analysis is established on the basis of geometric semantics.

General Classification regression

Cannot find the paper you are looking for? You can Submit a new open access paper.