Search Results for author: Mingxiao Feng

Found 6 papers, 4 papers with code

MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning

1 code implementation • 3 Jun 2023 • Haolin Song, Mingxiao Feng, Wengang Zhou, Houqiang Li

Recent approaches have utilized self-supervised auxiliary tasks as representation learning to improve the performance and sample efficiency of vision-based reinforcement learning algorithms in single-agent settings.

Contrastive Learning Multi-agent Reinforcement Learning +2

Paper
Code

H-TSP: Hierarchically Solving the Large-Scale Travelling Salesman Problem

1 code implementation • 19 Apr 2023 • Xuanhao Pan, Yan Jin, Yuandong Ding, Mingxiao Feng, Li Zhao, Lei Song, Jiang Bian

We propose an end-to-end learning framework based on hierarchical reinforcement learning, called H-TSP, for addressing the large-scale Travelling Salesman Problem (TSP).

Hierarchical Reinforcement Learning reinforcement-learning

Paper
Code

Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management

no code implementations • 15 Dec 2022 • Yuandong Ding, Mingxiao Feng, Guozi Liu, Wei Jiang, Chuheng Zhang, Li Zhao, Lei Song, Houqiang Li, Yan Jin, Jiang Bian

In this paper, we consider the inventory management (IM) problem where we need to make replenishment decisions for a large number of stock keeping units (SKUs) to balance their supply and demand.

Management Multi-agent Reinforcement Learning +2

Paper
Add Code

Stabilizing Voltage in Power Distribution Networks via Multi-Agent Reinforcement Learning with Transformer

1 code implementation • 8 Jun 2022 • Minrui Wang, Mingxiao Feng, Wengang Zhou, Houqiang Li

Utilizing MARL algorithms to coordinate multiple control units in the grid, which is able to handle rapid changes of power systems, has been widely studied in active voltage control task recently.

Multi-agent Reinforcement Learning reinforcement-learning +2

Paper
Code

Multi-Agent Reinforcement Learning with Shared Resource in Inventory Management

no code implementations • 29 Sep 2021 • Mingxiao Feng, Guozi Liu, Li Zhao, Lei Song, Jiang Bian, Tao Qin, Wengang Zhou, Houqiang Li, Tie-Yan Liu

We consider inventory management (IM) problem for a single store with a large number of SKUs (stock keeping units) in this paper, where we need to make replenishment decisions for each SKU to balance its supply and demand.

Management Multi-agent Reinforcement Learning +2

Paper
Add Code

PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning

2 code implementations • NeurIPS 2021 • Tao Yu, Cuiling Lan, Wenjun Zeng, Mingxiao Feng, Zhizheng Zhang, Zhibo Chen

In this work, we propose a novel method, dubbed PlayVirtual, which augments cycle-consistent virtual trajectories to enhance the data efficiency for RL feature representation learning.

Ranked #1 on Continuous Control (100k environment steps) on DeepMind Finger Spin (Images)

Continuous Control (100k environment steps) Continuous Control (500k environment steps) +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.