Search Results for author: Hanhan Zhou

Found 9 papers, 4 papers with code

Collaborative AI Teaming in Unknown Environments via Active Goal Deduction

no code implementations22 Mar 2024 Zuyuan Zhang, Hanhan Zhou, Mahdi Imani, Taeyoung Lee, Tian Lan

With the advancements of artificial intelligence (AI), we're seeing more scenarios that require AI to work closely with other agents, whose goals and strategies might not be known beforehand.

Starcraft Starcraft II

Real-time Network Intrusion Detection via Decision Transformers

no code implementations12 Dec 2023 Jingdi Chen, Hanhan Zhou, Yongsheng Mei, Gina Adam, Nathaniel D. Bastian, Tian Lan

Many cybersecurity problems that require real-time decision-making based on temporal observations can be abstracted as a sequence modeling problem, e. g., network intrusion detection from a sequence of arriving packets.

Decision Making Network Intrusion Detection +1

Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning

no code implementations28 Aug 2023 Hanhan Zhou, Tian Lan, Vaneet Aggarwal

Offline reinforcement learning aims to utilize datasets of previously gathered environment-action interaction records to learn a policy without access to the real environment.

D4RL Off-policy evaluation +2

MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization

1 code implementation21 Feb 2023 Yongsheng Mei, Hanhan Zhou, Tian Lan, Guru Venkataramani, Peng Wei

To this end, we propose MAC-PO, which formulates optimal prioritized experience replay for multi-agent problems as a regret minimization over the sampling weights of transitions.

Decision Making Multi-agent Reinforcement Learning +3

ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning

no code implementations11 Feb 2023 Yongsheng Mei, Hanhan Zhou, Tian Lan

Such an optimization problem can be relaxed and solved using the Lagrangian multiplier method to obtain the close-form optimal projection weights.

Decision Making reinforcement-learning +2

PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning

1 code implementation22 Jun 2022 Hanhan Zhou, Tian Lan, Vaneet Aggarwal

Multi-agent reinforcement learning (MARL) has witnessed significant progress with the development of value function factorization methods.

counterfactual Multi-agent Reinforcement Learning +5

On the Convergence of Heterogeneous Federated Learning with Arbitrary Adaptive Online Model Pruning

1 code implementation27 Jan 2022 Hanhan Zhou, Tian Lan, Guru Venkataramani, Wenbo Ding

In this paper, we present a unifying framework for heterogeneous FL algorithms with {\em arbitrary} adaptive online model pruning and provide a general convergence analysis.

Federated Learning Open-Ended Question Answering

Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients

1 code implementation4 Jan 2022 Hanhan Zhou, Tian Lan, Vaneet Aggarwal

To this end, we present LSF-SAC, a novel framework that features a variational inference-based information-sharing mechanism as extra state information to assist individual agents in the value function factorization.

Starcraft Starcraft II +1

PT-VTON: an Image-Based Virtual Try-On Network with Progressive Pose Attention Transfer

no code implementations23 Nov 2021 Hanhan Zhou, Tian Lan, Guru Venkataramani

The virtual try-on system has gained great attention due to its potential to give customers a realistic, personalized product presentation in virtualized settings.

Pose Transfer Virtual Try-on

Cannot find the paper you are looking for? You can Submit a new open access paper.