Search Results for author: Peng Wei

Found 36 papers, 4 papers with code

Relative-Translation Invariant Wasserstein Distance

no code implementations4 Sep 2024 Binshuai Wang, Qiwei Di, Ming Yin, Mengdi Wang, Quanquan Gu, Peng Wei

We introduce a new family of distances, relative-translation invariant Wasserstein distances ($RW_p$), for measuring the similarity of two probability distributions under distribution shift.

Computational Efficiency Translation

RuleAlign: Making Large Language Models Better Physicians with Diagnostic Rule Alignment

no code implementations22 Aug 2024 Xiaohan Wang, Xiaoyan Yang, Yuqi Zhu, Yue Shen, Jian Wang, Peng Wei, Lei Liang, Jinjie Gu, Huajun Chen, Ningyu Zhang

Large Language Models (LLMs) like GPT-4, MedPaLM-2, and Med-Gemini achieve performance competitively with human experts across various medical benchmarks.

Intra-symbol Differential Amplitude Shift Keying-aided Blind Detector for Ambient Backscatter Communication Systems

no code implementations16 Aug 2024 Shuaijun Ma, Peng Wei, Sa Xiao, Jianquan Wang, Wanbin Tang, Wei Xiang

Simulation results demonstrate that the proposed blind detector exhibits a significant enhancement in symbol detection performance compared to its traditional counterparts.

DHGS: Decoupled Hybrid Gaussian Splatting for Driving Scene

no code implementations23 Jul 2024 Xi Shi, Lingli Chen, Peng Wei, Xi Wu, Tian Jiang, Yonggang Luo, Lecheng Xie

This paper introduces a novel neural rendering method termed Decoupled Hybrid Gaussian Splatting (DHGS), targeting at promoting the rendering quality of novel view synthesis for static driving scenes.

Neural Rendering Novel View Synthesis

A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions

no code implementations6 Jun 2024 Lei Liu, Xiaoyan Yang, Junchi Lei, Xiaoyang Liu, Yue Shen, Zhiqiang Zhang, Peng Wei, Jinjie Gu, Zhixuan Chu, Zhan Qin, Kui Ren

This survey provides a comprehensive overview of Medical Large Language Models (Med-LLMs), outlining their evolution from general to the medical-specific domain (i. e, Technology and Application), as well as their transformative impact on healthcare (e. g., Trustworthiness and Safety).

Fairness

Reinforcement Learning with Latent State Inference for Autonomous On-ramp Merging under Observation Delay

no code implementations18 Mar 2024 Amin Tabrizian, Zhitong Huang, Peng Wei

L3IS shows a 99. 90% success rate in a challenging on-ramp merging case generated from the real US Highway 101 data.

Edge Information Hub: Orchestrating Satellites, UAVs, MEC, Sensing and Communications for 6G Closed-Loop Controls

no code implementations11 Mar 2024 Chengleyang Lei, Wei Feng, Peng Wei, Yunfei Chen, Ning Ge, Shiwen Mao

Specifically, the linear quadratic regulator (LQR) control cost is used to measure the closed-loop utility, and a sum LQR cost minimization problem is formulated to jointly optimize the splitting of sensor data and allocation of communication and computing resources.

Edge-computing

Two-stage space construction for real-time modeling of distributed parameter systems under sparse sensing

no code implementations28 Oct 2023 Peng Wei

The high-dimensional space construction method is employed to derive continuous spatial basis functions (SBFs).

Fusion-Driven Tree Reconstruction and Fruit Localization: Advancing Precision in Agriculture

no code implementations23 Oct 2023 Kaiming Fu, Peng Wei, Juan Villacres, Zhaodan Kong, Stavros G. Vougioukas, Brian N. Bailey

Fruit distribution is pivotal in shaping the future of both agriculture and agricultural robotics, paving the way for a streamlined supply chain.

Multiscale Fusion for Abnormality Detection and Localization of Distributed Parameter Systems

no code implementations10 Oct 2023 Peng Wei, Han-Xiong Li

Numerous industrial thermal processes and fluid processes can be described by distributed parameter systems (DPSs), wherein many process parameters and variables vary in space and time.

Anomaly Detection Fault Detection

Integrated Conflict Management for UAM with Strategic Demand Capacity Balancing and Learning-based Tactical Deconfliction

no code implementations17 May 2023 Shulu Chen, Antony Evans, Marc Brittain, Peng Wei

By using DCB to precondition traffic to proper density levels, we show that reinforcement learning can achieve much better performance for tactical safety separation.

Management reinforcement-learning +1

MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization

1 code implementation21 Feb 2023 Yongsheng Mei, Hanhan Zhou, Tian Lan, Guru Venkataramani, Peng Wei

To this end, we propose MAC-PO, which formulates optimal prioritized experience replay for multi-agent problems as a regret minimization over the sampling weights of transitions.

Decision Making Multi-agent Reinforcement Learning +4

A Verification Framework for Certifying Learning-Based Safety-Critical Aviation Systems

no code implementations9 May 2022 Ali Baheri, Hao Ren, Benjamin Johnson, Pouria Razzaghi, Peng Wei

We present a safety verification framework for design-time and run-time assurance of learning-based components in aviation systems.

Decision Making

Reinforcement Learning-Empowered Mobile Edge Computing for 6G Edge Intelligence

no code implementations27 Jan 2022 Peng Wei, Kun Guo, Ye Li, Jue Wang, Wei Feng, Shi Jin, Ning Ge, Ying-Chang Liang

Mobile edge computing (MEC) is considered a novel paradigm for computation-intensive and delay-sensitive tasks in fifth generation (5G) networks and beyond.

Edge-computing reinforcement-learning +2

Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning

no code implementations13 Nov 2021 Jueming Hu, Xuxi Yang, Weichang Wang, Peng Wei, Lei Ying, Yongming Liu

Obstacle avoidance for small unmanned aircraft is vital for the safety of future urban air mobility (UAM) and Unmanned Aircraft System (UAS) Traffic Management (UTM).

continuous-control Continuous Control +4

Power System Transient Modeling and Simulation using Integrated Circuit

no code implementations6 Jun 2021 Xiang Zhang, Renchang Dai, Peng Wei, Yijing Liu, Guangyi Liu, Zhiwei Wang

Transient stability analysis (TSA) plays an important role in power system analysis to investigate the stability of power system.

Numerical Integration

Safety Enhancement for Deep Reinforcement Learning in Autonomous Separation Assurance

no code implementations5 May 2021 Wei Guo, Marc Brittain, Peng Wei

We demonstrate the effectiveness of the two sub-modules in an open-source air traffic simulator with challenging environment settings.

Data Augmentation Deep Reinforcement Learning +2

Scalable FastMDP for Pre-departure Airspace Reservation and Strategic De-conflict

no code implementations8 Aug 2020 Joshua R. Bertram, Peng Wei, Joseph Zambreno

Our results show that on commodity GPU hardware we can perform flight plan scheduling against 2000-3000 known flight plans and with server-class hardware the performance can be higher.

Scheduling

Continuous Control for Searching and Planning with a Learned Model

no code implementations12 Jun 2020 Xuxi Yang, Werner Duvaud, Peng Wei

Decision-making agents with planning capabilities have achieved huge success in the challenging domain like Chess, Shogi, and Go.

continuous-control Continuous Control +4

A Deep Multi-Agent Reinforcement Learning Approach to Autonomous Separation Assurance

no code implementations17 Mar 2020 Marc Brittain, Xuxi Yang, Peng Wei

A novel deep multi-agent reinforcement learning framework is proposed to identify and resolve conflicts among a variable number of aircraft in a high-density, stochastic, and dynamic sector.

Multi-agent Reinforcement Learning reinforcement-learning +2

An Efficient Algorithm for Multiple-Pursuer-Multiple-Evader Pursuit/Evasion Game

no code implementations9 Sep 2019 Joshua R. Bertram, Peng Wei

We present a method for pursuit/evasion that is highly efficient and and scales to large teams of aircraft.

Prioritized Sequence Experience Replay

no code implementations25 May 2019 Marc Brittain, Josh Bertram, Xuxi Yang, Peng Wei

Experience replay is widely used in deep reinforcement learning algorithms and allows agents to remember and learn from experiences from the past.

Deep Reinforcement Learning Q-Learning +2

Autonomous Air Traffic Controller: A Deep Multi-Agent Reinforcement Learning Approach

no code implementations2 May 2019 Marc Brittain, Peng Wei

Air traffic control is a real-time safety-critical decision making process in highly dynamic and stochastic environments.

Decision Making Multi-agent Reinforcement Learning +3

Autonomous Airline Revenue Management: A Deep Reinforcement Learning Approach to Seat Inventory Control and Overbooking

no code implementations18 Feb 2019 Syed Arbab Mohd Shihab, Caleb Logemann, Deepak-George Thomas, Peng Wei

This paper focuses on the latter problem - the seat inventory control problem - casting it as a Markov Decision Process to be able to find the optimal policy.

Deep Reinforcement Learning Management +1

A PCB Dataset for Defects Detection and Classification

5 code implementations24 Jan 2019 Weibo Huang, Peng Wei

To coupe with the difficulties in the process of inspection and classification of defects in Printed Circuit Board (PCB), other researchers have proposed many methods.

Classification General Classification

Basis Signal Optimization for N-Continuous OFDM

no code implementations28 Dec 2018 Peng Wei, Yue Xiao, Wei Xiang

A novel basis signal optimization method is proposed for reducing the interference in the N-continuous orthogonal frequency division multiplexing (NC-OFDM) system.

AutoAccel: Automated Accelerator Generation and Optimization with Composable, Parallel and Pipeline Architecture

no code implementations30 Jul 2018 Jason Cong, Peng Wei, Cody Hao Yu, Peng Zhang

Such a well-defined template is able to support efficient accelerator designs for a broad class of computation kernels, and more importantly, drastically reduce the design space.

Distributed, Parallel, and Cluster Computing Hardware Architecture

Explainable Deterministic MDPs

no code implementations9 Jun 2018 Josh Bertram, Peng Wei

We present a method for a certain class of Markov Decision Processes (MDPs) that can relate the optimal policy back to one or more reward sources in the environment.

Hierarchical Reinforcement Learning with Deep Nested Agents

no code implementations18 May 2018 Marc Brittain, Peng Wei

Deep hierarchical reinforcement learning has gained a lot of attention in recent years due to its ability to produce state-of-the-art results in challenging environments where non-hierarchical frameworks fail to learn useful policies.

Hierarchical Reinforcement Learning Minecraft +3

Memoryless Exact Solutions for Deterministic MDPs with Sparse Rewards

no code implementations17 May 2018 Joshua R. Bertram, Peng Wei

The algorithm to solve the MDP does not depend on the size of the state space for either time or memory complexity, and the ability to follow the optimal policy is linear in time and space with the path length of following the optimal policy from the initial state.

Fast Online Exact Solutions for Deterministic MDPs with Sparse Rewards

no code implementations8 May 2018 Joshua R. Bertram, Xuxi Yang, Peng Wei

Markov Decision Processes (MDPs) are a mathematical framework for modeling sequential decision making under uncertainty.

Decision Making Decision Making Under Uncertainty +1

Cannot find the paper you are looking for? You can Submit a new open access paper.