Search Results for author: Pengfei Li

Found 20 papers, 6 papers with code

Expert-Calibrated Learning for Online Optimization with Switching Costs

no code implementations18 Apr 2022 Pengfei Li, Jianyi Yang, Shaolei Ren

Nonetheless, by using the standard practice of training an ML model as a standalone optimizer and plugging it into an ML-augmented algorithm, the average cost performance can be highly unsatisfactory.

CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning

1 code implementation15 Feb 2022 Long Yang, Jiaming Ji, Juntao Dai, Yu Zhang, Pengfei Li, Gang Pan

Although using bounds as surrogate functions to design safe RL algorithms have appeared in some existing works, we develop them at least three aspects: (i) We provide a rigorous theoretical analysis to extend the surrogate functions to generalized advantage estimator (GAE).

reinforcement-learning Safe Exploration +1

Semi-supervised Implicit Scene Completion from Sparse LiDAR

1 code implementation29 Nov 2021 Pengfei Li, Yongliang Shi, Tianyu Liu, Hao Zhao, Guyue Zhou, Ya-Qin Zhang

Recent advances show that semi-supervised implicit representation learning can be achieved through physical constraints like Eikonal equations.

Representation Learning

Rapid Assessments of Light-Duty Gasoline Vehicle Emissions Using On-Road Remote Sensing and Machine Learning

no code implementations1 Oct 2021 Yan Xia, Linhui Jiang, Lu Wang, Xue Chen, Jianjie Ye, Tangyan Hou, Liqiang Wang, Yibo Zhang, Mengying Li, Zhen Li, Zhe Song, Yaping Jiang, Weiping Liu, Pengfei Li, Daniel Rosenfeld, John H. Seinfeld, Shaocai Yu

Our results show that the ORRS measurements, assisted by the machine-learning-based ensemble model developed here, can realize day-to-day supervision of on-road vehicle-specific emissions.

1st Place Solution to ICDAR 2021 RRC-ICTEXT End-to-end Text Spotting and Aesthetic Assessment on Integrated Circuit

no code implementations8 Apr 2021 Qiyao Wang, Pengfei Li, Li Zhu, Yi Niu

For the text spotting task, we detect the characters on integrated circuit and classify them based on yolov5 detection model.

Text Spotting

A cautionary tale in fitting galaxy rotation curves with Bayesian techniques: does Newton's constant vary from galaxy to galaxy?

no code implementations27 Jan 2021 Pengfei Li, Federico Lelli, Stacy McGaugh, James Schombert, Kyu-Hyun Chae

The application of Bayesian techniques to astronomical data is generally non-trivial because the fitting parameters can be strongly degenerated and the formal uncertainties are themselves uncertain.

Astrophysics of Galaxies Cosmology and Nongalactic Astrophysics Instrumentation and Methods for Astrophysics

Towards Reducing Severe Defocus Spread Effects for Multi-Focus Image Fusion via an Optimization Based Strategy

1 code implementation29 Dec 2020 Shuang Xu, Lizhen Ji, Zhe Wang, Pengfei Li, Kai Sun, Chunxia Zhang, Jiangshe Zhang

According to the idea that each local region in the fused image should be similar to the sharpest one among source images, this paper presents an optimization-based approach to reduce defocus spread effects.

SSIM

CARE: Commonsense-Aware Emotional Response Generation with Latent Concepts

no code implementations15 Dec 2020 Peixiang Zhong, Di Wang, Pengfei Li, Chen Zhang, Hao Wang, Chunyan Miao

Experimental results on two large-scale datasets support our hypothesis and show that our model can produce more accurate and commonsense-aware emotional responses and achieve better human ratings than state-of-the-art models that only specialize in one aspect.

Response Generation

On Convergence of Gradient Expected Sarsa($λ$)

no code implementations14 Dec 2020 Long Yang, Gang Zheng, Yu Zhang, Qian Zheng, Pengfei Li, Gang Pan

We study the convergence of $\mathtt{Expected~Sarsa}(\lambda)$ with linear function approximation.

DIDFuse: Deep Image Decomposition for Infrared and Visible Image Fusion

2 code implementations20 Mar 2020 Zixiang Zhao, Shuang Xu, Chun-Xia Zhang, Junmin Liu, Pengfei Li, Jiangshe Zhang

Infrared and visible image fusion, a hot topic in the field of image processing, aims at obtaining fused images keeping the advantages of source images.

Infrared And Visible Image Fusion

Car Pose in Context: Accurate Pose Estimation with Ground Plane Constraints

no code implementations9 Dec 2019 Pengfei Li, Weichao Qiu, Michael Peven, Gregory D. Hager, Alan L. Yuille

Scene context is a powerful constraint on the geometry of objects within the scene in cases, such as surveillance, where the camera geometry is unknown and image quality may be poor.

Car Pose Estimation

Improving Relation Extraction with Knowledge-attention

no code implementations IJCNLP 2019 Pengfei Li, Kezhi Mao, Xuefeng Yang, Qi Li

While attention mechanisms have been proven to be effective in many NLP tasks, majority of them are data-driven.

Relation Extraction

Gradient Q$(σ, λ)$: A Unified Algorithm with Function Approximation for Reinforcement Learning

no code implementations6 Sep 2019 Long Yang, Yu Zhang, Qian Zheng, Pengfei Li, Gang Pan

To address above problem, we propose a GQ$(\sigma,\lambda)$ that extends tabular Q$(\sigma,\lambda)$ with linear function approximation.

Q-Learning reinforcement-learning

Expected Sarsa($λ$) with Control Variate for Variance Reduction

no code implementations25 Jun 2019 Long Yang, Yu Zhang, Jun Wen, Qian Zheng, Pengfei Li, Gang Pan

In this paper, for reducing the variance, we introduce control variate technique to $\mathtt{Expected}$ $\mathtt{Sarsa}$($\lambda$) and propose a tabular $\mathtt{ES}$($\lambda$)-$\mathtt{CV}$ algorithm.

A Scalable Learned Index Scheme in Storage Systems

no code implementations8 May 2019 Pengfei Li, Yu Hua, Pengfei Zuo, Jingnan Jia

Index structures are important for efficient data access, which have been widely used to improve the performance in many in-memory systems.

Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network

no code implementations14 Jun 2018 Wenjia Meng, Qian Zheng, Long Yang, Pengfei Li, Gang Pan

In this paper, we propose a general framework to combine DQN and most of the return-based reinforcement learning algorithms, named R-DQN.

OpenAI Gym reinforcement-learning

Cannot find the paper you are looking for? You can Submit a new open access paper.