Search Results for author: Yijie Peng

Found 18 papers, 4 papers with code

Forward Learning for Gradient-based Black-box Saliency Map Generation

no code implementations22 Mar 2024 Zeliang Zhang, Mingqian Feng, Jinyang Jiang, Rongyi Zhu, Yijie Peng, Chenliang Xu

Gradient-based saliency maps are widely used to explain deep neural network decisions.

Approximated Likelihood Ratio: A Forward-Only and Parallel Framework for Boosting Neural Network Training

no code implementations18 Mar 2024 Zeliang Zhang, Jinyang Jiang, Zhuo Liu, Susan Liang, Yijie Peng, Chenliang Xu

In this paper, we introduce an approximation technique for the likelihood ratio (LR) method to alleviate computational and memory demands in gradient estimation.

Deep Reinforcement Learning for Solving Management Problems: Towards A Large Management Mode

no code implementations1 Mar 2024 Jinyang Jiang, Xiaotian Liu, Tao Ren, Qinghao Wang, Yi Zheng, Yufu Du, Yijie Peng, Cheng Zhang

We introduce a deep reinforcement learning (DRL) approach for solving management problems including inventory management, dynamic pricing, and recommendation.

Decision Making Management

RiskMiner: Discovering Formulaic Alphas via Risk Seeking Monte Carlo Tree Search

no code implementations11 Feb 2024 Tao Ren, Ruihan Zhou, Jinyang Jiang, Jiafeng Liang, Qinghao Wang, Yijie Peng

Existing alpha mining only employs the neural network agent, unable to utilize the structural information of the solution space.

Sample-Efficient Clustering and Conquer Procedures for Parallel Large-Scale Ranking and Selection

no code implementations3 Feb 2024 Zishi Zhang, Yijie Peng

We propose novel "clustering and conquer" procedures for the parallel large-scale ranking and selection (R&S) problem, which leverage correlation information for clustering to break the bottleneck of sample efficiency.

Clustering Neural Architecture Search

AlphaRank: An Artificial Intelligence Approach for Ranking and Selection Problems

no code implementations1 Feb 2024 Ruihan Zhou, L. Jeff Hong, Yijie Peng

We introduce AlphaRank, an artificial intelligence approach to address the fixed-budget ranking and selection (R&S) problems.

One Forward is Enough for Neural Network Training via Likelihood Ratio Method

no code implementations15 May 2023 Jinyang Jiang, Zeliang Zhang, Chenliang Xu, Zhaofei Yu, Yijie Peng

While backpropagation (BP) is the mainstream approach for gradient computation in neural network training, its heavy reliance on the chain rule of differentiation constrains the designing flexibility of network architecture and training pipelines.

Efficient Learning for Selecting Top-m Context-Dependent Designs

no code implementations6 May 2023 Gongbo Zhang, Sihua Chen, Kuihua Huang, Yijie Peng

We consider a simulation optimization problem for a context-dependent decision-making, which aims to determine the top-m designs for all contexts.

Decision Making

A Novel Noise Injection-based Training Scheme for Better Model Robustness

no code implementations17 Feb 2023 Zeliang Zhang, Jinyang Jiang, Minjie Chen, Zhiyuan Wang, Yijie Peng, Zhaofei Yu

Noise injection-based method has been shown to be able to improve the robustness of artificial neural networks in previous work.

Adversarial Robustness Computational Efficiency

An Efficient Dynamic Sampling Policy For Monte Carlo Tree Search

no code implementations26 Apr 2022 Gongbo Zhang, Yijie Peng, Yilong Xu

We consider the popular tree-based search strategy within the framework of reinforcement learning, the Monte Carlo Tree Search (MCTS), in the context of finite-horizon Markov decision process.

reinforcement-learning Reinforcement Learning (RL)

Noise Optimization for Artificial Neural Networks

1 code implementation6 Feb 2021 Li Xiao, Zeliang Zhang, Yijie Peng

Adding noises to artificial neural network(ANN) has been shown to be able to improve robustness in previous work.

Context-dependent Ranking and Selection under a Bayesian Framework

no code implementations10 Dec 2020 Haidong Li, Henry Lam, Zhe Liang, Yijie Peng

We consider a context-dependent ranking and selection problem.

Methodology

Efficient Learning for Clustering and Optimizing Context-Dependent Designs

1 code implementation10 Dec 2020 Haidong Li, Henry Lam, Yijie Peng

We consider a simulation optimization problem for a context-dependent decision-making.

Decision Making Methodology

Training Artificial Neural Networks by Generalized Likelihood Ratio Method: Exploring Brain-like Learning to Improve Robustness

1 code implementation31 Jan 2019 Li Xiao, Yijie Peng, Jeff Hong, Zewu Ke, Shuhuai Yang

In this work, we propose a generalized likelihood ratio method capable of training the artificial neural networks with some biological brain-like mechanisms,. e. g., (a) learning by the loss value, (b) learning via neurons with discontinuous activation and loss functions.

General Classification

Ranking and Selection as Stochastic Control

no code implementations7 Oct 2017 Yijie Peng, Edwin K. P. Chong, Chun-Hung Chen, Michael C. Fu

Under a Bayesian framework, we formulate the fully sequential sampling and selection decision in statistical ranking and selection as a stochastic control problem, and derive the associated Bellman equation.

Cannot find the paper you are looking for? You can Submit a new open access paper.