Search Results for author: Hao Tian

Found 29 papers, 11 papers with code

Reinforcement Learning with Evolutionary Trajectory Generator: A General Approach for Quadrupedal Locomotion

1 code implementation14 Sep 2021 Haojie Shi, Bo Zhou, Hongsheng Zeng, Fan Wang, Yueqiang Dong, Jiangyong Li, Kang Wang, Hao Tian, Max Q. -H. Meng

However, due to the complex nonlinear dynamics in quadrupedal robots and reward sparsity, it is still difficult for RL to learn effective gaits from scratch, especially in challenging tasks such as walking over the balance beam.

ADER:Adapting between Exploration and Robustness for Actor-Critic Methods

no code implementations8 Sep 2021 Bo Zhou, Kejiao Li, Hongsheng Zeng, Fan Wang, Hao Tian

Combining off-policy reinforcement learning methods with function approximators such as neural networks has been found to lead to overestimation of the value function and sub-optimal solutions.

Continuous Control

Do What Nature Did To Us: Evolving Plastic Recurrent Neural Networks For Task Generalization

1 code implementation8 Sep 2021 Fan Wang, Hao Tian, Haoyi Xiong, Hua Wu, Yang Cao, Yu Kang, Haifeng Wang

While artificial neural networks (ANNs) have been widely adopted in machine learning, researchers are increasingly obsessed by the gaps between ANNs and biological neural networks (BNNs).

Meta-Learning

Action Set Based Policy Optimization for Safe Power Grid Management

no code implementations29 Jun 2021 Bo Zhou, Hongsheng Zeng, Yuecheng Liu, Kejiao Li, Fan Wang, Hao Tian

At the planning stage, the search space is limited to the action set produced by the policy.

Decision Making

ERNIE-Tiny : A Progressive Distillation Framework for Pretrained Transformer Compression

no code implementations4 Jun 2021 Weiyue Su, Xuyi Chen, Shikun Feng, Jiaxiang Liu, Weixin Liu, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Specifically, the first stage, General Distillation, performs distillation with guidance from pretrained teacher, gerenal data and latent distillation loss.

Knowledge Distillation

The Flare and Warp of the Young Stellar Disk traced with LAMOST DR5 OB-type stars

no code implementations1 Feb 2021 Yang Yu, Hai-Feng Wang, Wen-Yuan Cui, Lin-Lin Li, Chao Liu, Bo Zhang, Hao Tian, Zhen-Yan Huo, Jie Ju, Zhi-Cun Liu, Fang Wen, Shuai Feng

We present analysis of the spatial density structure for the outer disk from 8$-$14 \, kpc with the LAMOST DR5 13534 OB-type stars and observe similar flaring on north and south sides of the disk implying that the flaring structure is symmetrical about the Galactic plane, for which the scale height at different Galactocentric distance is from 0. 14 to 0. 5 \, kpc.

Astrophysics of Galaxies

Exploring the Galactic Anticenter substructure with LAMOST & Gaia DR2

no code implementations7 Jan 2021 Jing Li, Xiang-Xiang Xue, Chao Liu, Bo Zhang, Hans-Walter Rix, Jeffrey L. Carlin, Chengqun Yang, Rene A. Mendez, Jing Zhong, Hao Tian, Lan Zhang, Yan Xu, Yaqian Wu, Gang Zhao, Ruixiang Chang

Their location in [$\alpha$/M] vs. [M/H] space is more metal poor than typical thin disk stars, with [$\alpha$/M] \textbf{lower} than the thick disk.

Astrophysics of Galaxies

ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora

1 code implementation31 Dec 2020 Xuan Ouyang, Shuohuan Wang, Chao Pang, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

In this paper, we propose ERNIE-M, a new training method that encourages the model to align the representation of multiple languages with monolingual corpora, to overcome the constraint that the parallel corpus size places on the model performance.

Translation

Unsupervised Object Detection with LiDAR Clues

no code implementations CVPR 2021 Hao Tian, Yuntao Chen, Jifeng Dai, Zhaoxiang Zhang, Xizhou Zhu

We further identify another major issue, seldom noticed by the community, that the long-tailed and open-ended (sub-)category distribution should be accommodated.

Object Detection

SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis

2 code implementations ACL 2020 Hao Tian, Can Gao, Xinyan Xiao, Hao liu, Bolei He, Hua Wu, Haifeng Wang, Feng Wu

In particular, the prediction of aspect-sentiment pairs is converted into multi-label classification, aiming to capture the dependency between words in a pair.

Multi-Label Classification Sentiment Analysis

ivis Dimensionality Reduction Framework for Biomacromolecular Simulations

1 code implementation22 Apr 2020 Hao Tian, Peng Tao

Molecular dynamics (MD) simulations have been widely applied to study macromolecules including proteins.

Dimensionality Reduction

ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation

3 code implementations26 Jan 2020 Dongling Xiao, Han Zhang, Yukun Li, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Current pre-training works in natural language generation pay little attention to the problem of exposure bias on downstream tasks.

 Ranked #1 on Generative Question Answering on CoQA (using extra training data)

Abstractive Text Summarization Dialogue Generation +2

Efficient and Robust Reinforcement Learning with Uncertainty-based Value Expansion

no code implementations10 Dec 2019 Bo Zhou, Hongsheng Zeng, Fan Wang, Yunxiang Li, Hao Tian

By integrating dynamics models into model-free reinforcement learning (RL) methods, model-based value expansion (MVE) algorithms have shown a significant advantage in sample efficiency as well as value estimation.

Learning to Recommend via Meta Parameter Partition

no code implementations4 Dec 2019 Liang Zhao, Yang Wang, daxiang dong, Hao Tian

The fixed part, capturing user invariant features, is shared by all users and is learned during offline meta learning stage.

Meta-Learning

MBCAL: Sample Efficient and Variance Reduced Reinforcement Learning for Recommender Systems

no code implementations6 Nov 2019 Fan Wang, Xiaomin Fang, Lihang Liu, Hao Tian, Zhiming Peng

The proposed method takes advantage of the characteristics of recommender systems and draws ideas from the model-based reinforcement learning method for higher sample efficiency.

Model-based Reinforcement Learning Recommendation Systems

ERNIE 2.0: A Continual Pre-training Framework for Language Understanding

3 code implementations29 Jul 2019 Yu Sun, Shuohuan Wang, Yukun Li, Shikun Feng, Hao Tian, Hua Wu, Haifeng Wang

Recently, pre-trained models have achieved state-of-the-art results in various language understanding tasks, which indicates that pre-training on large-scale corpora may play a crucial role in natural language processing.

Chinese Named Entity Recognition Chinese Reading Comprehension +8

Sequential Evaluation and Generation Framework for Combinatorial Recommender System

no code implementations1 Feb 2019 Fan Wang, Xiaomin Fang, Lihang Liu, Yaxue Chen, Jiucheng Tao, Zhiming Peng, Cihang Jin, Hao Tian

On the one hand of this framework, an evaluation model is trained to evaluate the expected overall utility, by fully considering the user, item information and the correlations among the co-exposed items.

Recommendation Systems

Transferring Grasp Configurations using Active Learning and Local Replanning

no code implementations22 Jul 2018 Hao Tian, Changbo Wang, Dinesh Manocha, Xin-Yu Zhang

We compute a grasp space for each part of the example object using active learning.

Robotics

Cannot find the paper you are looking for? You can Submit a new open access paper.