Search Results for author: Pengfei Wei

Found 25 papers, 7 papers with code

Subdomain Adaptation with Manifolds Discrepancy Alignment

no code implementations6 May 2020 Pengfei Wei, Yiping Ke, Xinghua Qu, Tze-Yun Leong

Specifically, we propose to use low-dimensional manifold to represent subdomain, and align the local data distribution discrepancy in each manifold across domains.

Subdomain adaptation Transfer Learning

Hierarchical Reinforcement Learning in StarCraft II with Human Expertise in Subgoals Selection

no code implementations8 Aug 2020 Xinyi Xu, Tiancheng Huang, Pengfei Wei, Akshay Narayan, Tze-Yun Leong

This work is inspired by recent advances in hierarchical reinforcement learning (HRL) (Barto and Mahadevan 2003; Hengst 2010), and improvements in learning efficiency from heuristic-based subgoal selection, experience replay (Lin 1993; Andrychowicz et al. 2017), and task-based curriculum learning (Bengio et al. 2009; Zaremba and Sutskever 2014).

Decision Making Hierarchical Reinforcement Learning +4

MESA: Boost Ensemble Imbalanced Learning with MEta-SAmpler

2 code implementations NeurIPS 2020 Zhining Liu, Pengfei Wei, Jing Jiang, Wei Cao, Jiang Bian, Yi Chang

This makes MESA generally applicable to most of the existing learning models and the meta-sampler can be efficiently applied to new tasks.

imbalanced classification Meta-Learning

An Improved Transfer Model: Randomized Transferable Machine

no code implementations27 Nov 2020 Pengfei Wei, Xinghua Qu, Yew Soon Ong, Zejun Ma

Existing studies usually assume that the learned new feature representation is \emph{domain-invariant}, and thus train a transfer model $\mathcal{M}$ on the source domain.

Transfer Learning

Learning Disentangled Semantic Representation for Domain Adaptation

1 code implementation22 Dec 2020 Ruichu Cai, Zijian Li, Pengfei Wei, Jie Qiao, Kun Zhang, Zhifeng Hao

Different from previous efforts on the entangled feature space, we aim to extract the domain invariant semantic information in the latent disentangled semantic representation (DSR) of the data.

Domain Adaptation

Joint Intent Detection and Slot Filling with Wheel-Graph Attention Networks

no code implementations9 Feb 2021 Pengfei Wei, Bi Zeng, Wenxiong Liao

In this paper, we propose a new joint model with a wheel-graph attention network (Wheel-GAT) which is able to model interrelated connections directly for intent detection and slot filling.

Graph Attention Intent Detection +3

Adaptive Multi-Source Causal Inference

no code implementations31 May 2021 Thanh Vinh Vo, Pengfei Wei, Trong Nghia Hoang, Tze-Yun Leong

The proposed method can infer causal effects in the target population without prior knowledge of data discrepancy between the additional data sources and the target.

Causal Inference Transfer Learning

Graph Domain Adaptation: A Generative View

no code implementations14 Jun 2021 Ruichu Cai, Fengzhu Wu, Zijian Li, Pengfei Wei, Lingling Yi, Kun Zhang

Based on this assumption, we propose a disentanglement-based unsupervised domain adaptation method for the graph-structured data, which applies variational graph auto-encoders to recover these latent variables and disentangles them via three supervised learning modules.

Disentanglement Graph Classification +2

Synthesising Audio Adversarial Examples for Automatic Speech Recognition

no code implementations29 Sep 2021 Xinghua Qu, Pengfei Wei, Mingyong Gao, Zhu Sun, Yew-Soon Ong, Zejun Ma

Adversarial examples in automatic speech recognition (ASR) are naturally sounded by humans yet capable of fooling well trained ASR models to transcribe incorrectly.

Audio Synthesis Automatic Speech Recognition +2

Uncertainty Regularized Policy Learning for Offline Reinforcement Learning

no code implementations29 Sep 2021 Han Zheng, Jing Jiang, Pengfei Wei, Guodong Long, Xuan Song, Chengqi Zhang

URPL adds an uncertainty regularization term in the policy learning objective to enforce to learn a more stable policy under the offline setting.

D4RL Offline RL +2

Adaptive Q-learning for Interaction-Limited Reinforcement Learning

no code implementations29 Sep 2021 Han Zheng, Xufang Luo, Pengfei Wei, Xuan Song, Dongsheng Li, Jing Jiang

Specifically, we explicitly consider the difference between the online and offline data and apply an adaptive update scheme accordingly, i. e., a pessimistic update strategy for the offline dataset and a greedy or no pessimistic update scheme for the online dataset.

Offline RL Q-Learning +2

Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks

no code implementations9 Mar 2022 Yizhou Lu, Mingkun Huang, Xinghua Qu, Pengfei Wei, Zejun Ma

It makes room for language specific modeling by pruning out unimportant parameters for each language, without requiring any manually designed language specific component.

Representation Learning speech-recognition +1

Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective

1 code implementation NeurIPS 2023 Pengfei Wei, Lingdong Kong, Xinghua Qu, Yi Ren, Zhiqiang Xu, Jing Jiang, Xiang Yin

Specifically, we consider the generation of cross-domain videos from two sets of latent factors, one encoding the static information and another encoding the dynamic information.

Action Recognition Disentanglement +1

Importance Prioritized Policy Distillation

1 code implementation KDD 2022 Xinghua Qu, Yew-Soon Ong, Abhishek Gupta, Pengfei Wei, Zhu Sun, Zejun Ma

Given such an issue, we denote the \emph{frame importance} as its contribution to the expected reward on a particular frame, and hypothesize that adapting such frame importance could benefit the performance of the distilled student policy.

Atari Games Decision Making +1

Multi-objective optimization via evolutionary algorithm (MOVEA) for high-definition transcranial electrical stimulation of the human brain

1 code implementation10 Nov 2022 Mo Wang, Kexin Lou, Zeming Liu, Pengfei Wei, Quanying Liu

In this paper, we propose a general framework called multi-objective optimization via evolutionary algorithms (MOVEA) to address the non-convex optimization problem in designing TES strategies without predefined direction.

Evolutionary Algorithms

Virtual Try-On with Pose-Garment Keypoints Guided Inpainting

1 code implementation ICCV 2023 Zhi Li, Pengfei Wei, Xiang Yin, Zejun Ma, Alex C. Kot

In our method, human pose and garment keypoints are extracted from source images and constructed as graphs to predict the garment keypoints at the target pose.

Virtual Try-on

Adaptive Policy Learning for Offline-to-Online Reinforcement Learning

no code implementations14 Mar 2023 Han Zheng, Xufang Luo, Pengfei Wei, Xuan Song, Dongsheng Li, Jing Jiang

In this paper, we consider an offline-to-online setting where the agent is first learned from the offline dataset and then trained online, and propose a framework called Adaptive Policy Learning for effectively taking advantage of offline and online data.

Continuous Control Offline RL +2

Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis

no code implementations14 Jul 2023 Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Zhenhui Ye, Shengpeng Ji, Qian Yang, Chen Zhang, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao

However, the prompting mechanisms of zero-shot TTS still face challenges in the following aspects: 1) previous works of zero-shot TTS are typically trained with single-sentence prompts, which significantly restricts their performance when the data is relatively sufficient during the inference stage.

In-Context Learning Language Modelling +3

SR-R$^2$KAC: Improving Single Image Defocus Deblurring

no code implementations30 Jul 2023 Peng Tang, Zhiqiang Xu, Pengfei Wei, Xiaobin Hu, Peilin Zhao, Xin Cao, Chunlai Zhou, Tobias Lasser

To further alleviate the contingent effect of recursive stacking, i. e., ringing artifacts, we add identity shortcuts between atrous convolutions to simulate residual deconvolutions.

Deblurring Image Defocus Deblurring

Cannot find the paper you are looking for? You can Submit a new open access paper.