Search Results for author: Dawei Feng

Found 11 papers, 1 papers with code

Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles

no code implementations30 Dec 2023 Yuanzhao Zhai, Han Zhang, Yu Lei, Yue Yu, Kele Xu, Dawei Feng, Bo Ding, Huaimin Wang

Reinforcement learning from human feedback (RLHF) emerges as a promising paradigm for aligning large language models (LLMs).

Uncertainty Quantification

Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration

no code implementations24 Aug 2022 Zijian Gao, Yiying Li, Kele Xu, Yuanzhao Zhai, Dawei Feng, Bo Ding, XinJun Mao, Huaimin Wang

The curiosity arouses if memorized information can not deal with the current state, and the information gap between dual learners can be formulated as the intrinsic reward for agents, and then such state information can be consolidated into the dynamic memory.

Reinforcement Learning (RL)

Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning

no code implementations24 Aug 2022 Zijian Gao, Kele Xu, Yuanzhao Zhai, Dawei Feng, Bo Ding, XinJun Mao, Huaimin Wang

Our method involves training a self-supervised prediction model, saving snapshots of the model parameters, and using nuclear norm to evaluate the temporal inconsistency between the predictions of different snapshots as intrinsic rewards.

reinforcement-learning Reinforcement Learning (RL)

Nuclear Norm Maximization Based Curiosity-Driven Learning

no code implementations21 May 2022 Chao Chen, Zijian Gao, Kele Xu, Sen yang, Yiying Li, Bo Ding, Dawei Feng, Huaimin Wang

To handle the sparsity of the extrinsic rewards in reinforcement learning, researchers have proposed intrinsic reward which enables the agent to learn the skills that might come in handy for pursuing the rewards in the future, such as encouraging the agent to visit novel states.

Atari Games

FINT: Field-aware INTeraction Neural Network For CTR Prediction

1 code implementation5 Jul 2021 Zhishan Zhao, Sen yang, Guohui Liu, Dawei Feng, Kele Xu

As a critical component for online advertising and marking, click-through rate (CTR) prediction has draw lots of attentions from both industry and academia field.

Click-Through Rate Prediction

Exploring Pre-trained Language Models for Event Extraction and Generation

no code implementations ACL 2019 Sen Yang, Dawei Feng, Linbo Qiao, Zhigang Kan, Dongsheng Li

Traditional approaches to the task of ACE event extraction usually depend on manually annotated data, which is often laborious to create and limited in size.

Event Extraction General Classification

Collaborative Deep Learning Across Multiple Data Centers

no code implementations16 Oct 2018 Kele Xu, Haibo Mi, Dawei Feng, Huaimin Wang, Chuan Chen, Zibin Zheng, Xu Lan

Valuable training data is often owned by independent organizations and located in multiple data centers.

Sample Dropout for Audio Scene Classification Using Multi-Scale Dense Connected Convolutional Neural Network

no code implementations12 Jun 2018 Dawei Feng, Kele Xu, Haibo Mi, Feifan Liao, Yan Zhou

In this paper, we explore the use of multi-scale Dense connected convolutional neural network (DenseNet) for the classification task, with the goal to improve the classification performance as multi-scale features can be extracted from the time-frequency representation of the audio signal.

Acoustic Scene Classification Classification +3

Mixup-Based Acoustic Scene Classification Using Multi-Channel Convolutional Neural Network

no code implementations18 May 2018 Kele Xu, Dawei Feng, Haibo Mi, Boqing Zhu, Dezhi Wang, Lilun Zhang, Hengxing Cai, Shuwen Liu

Audio scene classification, the problem of predicting class labels of audio scenes, has drawn lots of attention during the last several years.

Acoustic Scene Classification Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.