no code implementations • 24 Oct 2024 • Yuhua Liao, Zetian Wang, Peng Wei, Qiangqiang Nie, Zhenhua Zhang
Deep learning and pre-trained models have shown great success in time series forecasting.
no code implementations • 4 Sep 2024 • Binshuai Wang, Qiwei Di, Ming Yin, Mengdi Wang, Quanquan Gu, Peng Wei
We introduce a new family of distances, relative-translation invariant Wasserstein distances ($RW_p$), for measuring the similarity of two probability distributions under distribution shift.
no code implementations • 22 Aug 2024 • Xiaohan Wang, Xiaoyan Yang, Yuqi Zhu, Yue Shen, Jian Wang, Peng Wei, Lei Liang, Jinjie Gu, Huajun Chen, Ningyu Zhang
Large Language Models (LLMs) like GPT-4, MedPaLM-2, and Med-Gemini achieve performance competitively with human experts across various medical benchmarks.
no code implementations • 16 Aug 2024 • Shuaijun Ma, Peng Wei, Sa Xiao, Jianquan Wang, Wanbin Tang, Wei Xiang
Simulation results demonstrate that the proposed blind detector exhibits a significant enhancement in symbol detection performance compared to its traditional counterparts.
no code implementations • 23 Jul 2024 • Xi Shi, Lingli Chen, Peng Wei, Xi Wu, Tian Jiang, Yonggang Luo, Lecheng Xie
This paper introduces a novel neural rendering method termed Decoupled Hybrid Gaussian Splatting (DHGS), targeting at promoting the rendering quality of novel view synthesis for static driving scenes.
1 code implementation • 20 Jun 2024 • Junjie Wang, Mingyang Chen, Binbin Hu, Dan Yang, Ziqi Liu, Yue Shen, Peng Wei, Zhiqiang Zhang, Jinjie Gu, Jun Zhou, Jeff Z. Pan, Wen Zhang, Huajun Chen
LLMs fine-tuned with this data have improved planning capabilities, better equipping them to handle complex QA tasks that involve retrieval.
no code implementations • 6 Jun 2024 • Lei Liu, Xiaoyan Yang, Junchi Lei, Xiaoyang Liu, Yue Shen, Zhiqiang Zhang, Peng Wei, Jinjie Gu, Zhixuan Chu, Zhan Qin, Kui Ren
This survey provides a comprehensive overview of Medical Large Language Models (Med-LLMs), outlining their evolution from general to the medical-specific domain (i. e, Technology and Application), as well as their transformative impact on healthcare (e. g., Trustworthiness and Safety).
1 code implementation • 7 May 2024 • Kailash Gogineni, Sai Santosh Dayapule, Juan Gómez-Luna, Karthikeya Gogineni, Peng Wei, Tian Lan, Mohammad Sadrosadati, Onur Mutlu, Guru Venkataramani
Reinforcement Learning (RL) trains agents to learn optimal behavior by maximizing reward signals from experience datasets.
no code implementations • 18 Mar 2024 • Amin Tabrizian, Zhitong Huang, Peng Wei
L3IS shows a 99. 90% success rate in a challenging on-ramp merging case generated from the real US Highway 101 data.
no code implementations • 11 Mar 2024 • Chengleyang Lei, Wei Feng, Peng Wei, Yunfei Chen, Ning Ge, Shiwen Mao
Specifically, the linear quadratic regulator (LQR) control cost is used to measure the closed-loop utility, and a sum LQR cost minimization problem is formulated to jointly optimize the splitting of sensor data and allocation of communication and computing resources.
no code implementations • 13 Jan 2024 • Kaiqun Wu, Xiaoling Jiang, Rui Yu, Yonggang Luo, Tian Jiang, Xi Wu, Peng Wei
To capture the subtle differences, a fine-grained network is adopted to acquire multi-scale features.
no code implementations • 28 Oct 2023 • Peng Wei
The high-dimensional space construction method is employed to derive continuous spatial basis functions (SBFs).
no code implementations • 23 Oct 2023 • Kaiming Fu, Peng Wei, Juan Villacres, Zhaodan Kong, Stavros G. Vougioukas, Brian N. Bailey
Fruit distribution is pivotal in shaping the future of both agriculture and agricultural robotics, paving the way for a streamlined supply chain.
no code implementations • 10 Oct 2023 • Peng Wei, Han-Xiong Li
Numerous industrial thermal processes and fluid processes can be described by distributed parameter systems (DPSs), wherein many process parameters and variables vary in space and time.
no code implementations • 17 May 2023 • Shulu Chen, Antony Evans, Marc Brittain, Peng Wei
By using DCB to precondition traffic to proper density levels, we show that reinforcement learning can achieve much better performance for tactical safety separation.
1 code implementation • 21 Feb 2023 • Yongsheng Mei, Hanhan Zhou, Tian Lan, Guru Venkataramani, Peng Wei
To this end, we propose MAC-PO, which formulates optimal prioritized experience replay for multi-agent problems as a regret minimization over the sampling weights of transitions.
no code implementations • 3 Nov 2022 • Pouria Razzaghi, Amin Tabrizian, Wei Guo, Shulu Chen, Abenezer Taye, Ellis Thompson, Alexis Bregeon, Ali Baheri, Peng Wei
Then we survey the landscape of existing RL-based applications in aviation.
no code implementations • 9 May 2022 • Ali Baheri, Hao Ren, Benjamin Johnson, Pouria Razzaghi, Peng Wei
We present a safety verification framework for design-time and run-time assurance of learning-based components in aviation systems.
no code implementations • 27 Jan 2022 • Peng Wei, Kun Guo, Ye Li, Jue Wang, Wei Feng, Shi Jin, Ning Ge, Ying-Chang Liang
Mobile edge computing (MEC) is considered a novel paradigm for computation-intensive and delay-sensitive tasks in fifth generation (5G) networks and beyond.
no code implementations • 13 Nov 2021 • Jueming Hu, Xuxi Yang, Weichang Wang, Peng Wei, Lei Ying, Yongming Liu
Obstacle avoidance for small unmanned aircraft is vital for the safety of future urban air mobility (UAM) and Unmanned Aircraft System (UAS) Traffic Management (UTM).
no code implementations • 6 Jun 2021 • Xiang Zhang, Renchang Dai, Peng Wei, Yijing Liu, Guangyi Liu, Zhiwei Wang
Transient stability analysis (TSA) plays an important role in power system analysis to investigate the stability of power system.
no code implementations • 5 May 2021 • Wei Guo, Marc Brittain, Peng Wei
We demonstrate the effectiveness of the two sub-modules in an open-source air traffic simulator with challenging environment settings.
no code implementations • 8 Aug 2020 • Joshua R. Bertram, Peng Wei, Joseph Zambreno
Our results show that on commodity GPU hardware we can perform flight plan scheduling against 2000-3000 known flight plans and with server-class hardware the performance can be higher.
no code implementations • 12 Jun 2020 • Xuxi Yang, Werner Duvaud, Peng Wei
Decision-making agents with planning capabilities have achieved huge success in the challenging domain like Chess, Shogi, and Go.
no code implementations • 17 Mar 2020 • Marc Brittain, Xuxi Yang, Peng Wei
A novel deep multi-agent reinforcement learning framework is proposed to identify and resolve conflicts among a variable number of aircraft in a high-density, stochastic, and dynamic sector.
Multi-agent Reinforcement Learning reinforcement-learning +2
no code implementations • 9 Sep 2019 • Joshua R. Bertram, Peng Wei
We present a method for pursuit/evasion that is highly efficient and and scales to large teams of aircraft.
no code implementations • 25 May 2019 • Marc Brittain, Josh Bertram, Xuxi Yang, Peng Wei
Experience replay is widely used in deep reinforcement learning algorithms and allows agents to remember and learn from experiences from the past.
no code implementations • 2 May 2019 • Marc Brittain, Peng Wei
Air traffic control is a real-time safety-critical decision making process in highly dynamic and stochastic environments.
no code implementations • 18 Feb 2019 • Syed Arbab Mohd Shihab, Caleb Logemann, Deepak-George Thomas, Peng Wei
This paper focuses on the latter problem - the seat inventory control problem - casting it as a Markov Decision Process to be able to find the optimal policy.
5 code implementations • 24 Jan 2019 • Weibo Huang, Peng Wei
To coupe with the difficulties in the process of inspection and classification of defects in Printed Circuit Board (PCB), other researchers have proposed many methods.
no code implementations • 28 Dec 2018 • Peng Wei, Yue Xiao, Wei Xiang
A novel basis signal optimization method is proposed for reducing the interference in the N-continuous orthogonal frequency division multiplexing (NC-OFDM) system.
no code implementations • 30 Jul 2018 • Jason Cong, Peng Wei, Cody Hao Yu, Peng Zhang
Such a well-defined template is able to support efficient accelerator designs for a broad class of computation kernels, and more importantly, drastically reduce the design space.
Distributed, Parallel, and Cluster Computing Hardware Architecture
no code implementations • 9 Jun 2018 • Josh Bertram, Peng Wei
We present a method for a certain class of Markov Decision Processes (MDPs) that can relate the optimal policy back to one or more reward sources in the environment.
no code implementations • 18 May 2018 • Marc Brittain, Peng Wei
Deep hierarchical reinforcement learning has gained a lot of attention in recent years due to its ability to produce state-of-the-art results in challenging environments where non-hierarchical frameworks fail to learn useful policies.
no code implementations • 17 May 2018 • Joshua R. Bertram, Peng Wei
The algorithm to solve the MDP does not depend on the size of the state space for either time or memory complexity, and the ability to follow the optimal policy is linear in time and space with the path length of following the optimal policy from the initial state.
no code implementations • 8 May 2018 • Joshua R. Bertram, Xuxi Yang, Peng Wei
Markov Decision Processes (MDPs) are a mathematical framework for modeling sequential decision making under uncertainty.