Search Results for author: Yunuo Zhang

Found 5 papers, 3 papers with code

Observation Adaptation via Annealed Importance Resampling for Partially Observable Markov Decision Processes

no code implementations25 Mar 2025 Yunuo Zhang, Baiting Luo, Ayan Mukhopadhyay, Abhishek Dubey

Since directly sampling from the ideal state distribution given the latest observation and previous state is infeasible, particle filters approximate the posterior belief distribution by propagating states and adjusting weights through prediction and resampling steps.

Decision Making Sequential Decision Making

NS-Gym: Open-Source Simulation Environments and Benchmarks for Non-Stationary Markov Decision Processes

1 code implementation16 Jan 2025 Nathaniel S. Keplinger, Baiting Luo, Iliyas Bektas, Yunuo Zhang, Kyle Hollins Wray, Aron Laszka, Abhishek Dubey, Ayan Mukhopadhyay

This toolkit is the first effort to develop a set of standardized interfaces and benchmark problems to enable consistent and reproducible evaluation of algorithms under non-stationary conditions.

Decision Making

Shrinking POMCP: A Framework for Real-Time UAV Search and Rescue

no code implementations20 Nov 2024 Yunuo Zhang, Baiting Luo, Ayan Mukhopadhyay, Daniel Stojcsics, Daniel Elenius, Anirban Roy, Susmit Jha, Miklos Maroti, Xenofon Koutsoukos, Gabor Karsai, Abhishek Dubey

We present a comprehensive approach to optimize UAV-based search and rescue operations in neighborhood areas, utilizing both a 3D AirSim-ROS2 simulator and a 2D simulator.

Decision Making in Non-Stationary Environments with Policy-Augmented Search

1 code implementation6 Jan 2024 Ava Pettet, Yunuo Zhang, Baiting Luo, Kyle Wray, Hendrik Baier, Aron Laszka, Abhishek Dubey, Ayan Mukhopadhyay

In this paper, we introduce \textit{Policy-Augmented Monte Carlo tree search} (PA-MCTS), which combines action-value estimates from an out-of-date policy with an online search using an up-to-date model of the environment.

Decision Making Decision Making Under Uncertainty +3

Act as You Learn: Adaptive Decision-Making in Non-Stationary Markov Decision Processes

1 code implementation3 Jan 2024 Baiting Luo, Yunuo Zhang, Abhishek Dubey, Ayan Mukhopadhyay

However, existing approaches for decision-making in NSMDPs have two major shortcomings: first, they assume that the updated environmental dynamics at the current time are known (although future dynamics can change); and second, planning is largely pessimistic, i. e., the agent acts ``safely'' to account for the non-stationary evolution of the environment.

Decision Making Sequential Decision Making

Cannot find the paper you are looking for? You can Submit a new open access paper.