Search Results for author: Lei Song

Found 14 papers, 10 papers with code

Higher Replay Ratio Empowers Sample-Efficient Multi-Agent Reinforcement Learning

no code implementations • 15 Apr 2024 • Linjie Xu, Zichuan Liu, Alexander Dockhorn, Diego Perez-Liebana, Jinyu Wang, Lei Song, Jiang Bian

One of the notorious issues for Reinforcement Learning (RL) is poor sample efficiency.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Reinforced In-Context Black-Box Optimization

1 code implementation • 27 Feb 2024 • Lei Song, Chenxiao Gao, Ke Xue, Chenyang Wu, Dong Li, Jianye Hao, Zongzhang Zhang, Chao Qian

In this paper, we propose RIBBO, a method to reinforce-learn a BBO algorithm from offline data in an end-to-end fashion.

In-Context Learning Meta-Learning

Paper
Code

Stochastic Bayesian Optimization with Unknown Continuous Context Distribution via Kernel Density Estimation

1 code implementation • 16 Dec 2023 • Xiaobin Huang, Lei Song, Ke Xue, Chao Qian

Considering that the estimated PDF may have high estimation error when the true distribution is complicated, we further propose the second algorithm that optimizes the distributionally robust objective.

Bayesian Optimization Density Estimation

Paper
Code

Pre-Trained Large Language Models for Industrial Control

no code implementations • 6 Aug 2023 • Lei Song, Chuheng Zhang, Li Zhao, Jiang Bian

2)~How well can GPT-4 generalize to different scenarios for HVAC control?

Paper
Add Code

A Versatile Multi-Agent Reinforcement Learning Benchmark for Inventory Management

1 code implementation • 13 Jun 2023 • Xianliang Yang, Zhihao Liu, Wei Jiang, Chuheng Zhang, Li Zhao, Lei Song, Jiang Bian

Multi-agent reinforcement learning (MARL) models multiple agents that interact and learn within a shared environment.

Autonomous Driving Management +2

Paper
Code

Mildly Constrained Evaluation Policy for Offline Reinforcement Learning

1 code implementation • 6 Jun 2023 • Linjie Xu, Zhengyao Jiang, Jinyu Wang, Lei Song, Jiang Bian

Offline reinforcement learning (RL) methodologies enforce constraints on the policy to adhere closely to the behavior policy, thereby stabilizing value learning and mitigating the selection of out-of-distribution (OOD) actions during test time.

Offline RL reinforcement-learning +1

Paper
Code

RFR-WWANet: Weighted Window Attention-Based Recovery Feature Resolution Network for Unsupervised Image Registration

1 code implementation • 7 May 2023 • Mingrui Ma, Tao Wang, Lei Song, Weijie Wang, Guixia Liu

Furthermore, shifted window partitioning operations are inflexible, indicating that they cannot perceive the semantic information over uncertain distances and automatically bridge the global connections between windows.

Computational Efficiency Long-range modeling +1

Paper
Code

H-TSP: Hierarchically Solving the Large-Scale Travelling Salesman Problem

1 code implementation • 19 Apr 2023 • Xuanhao Pan, Yan Jin, Yuandong Ding, Mingxiao Feng, Li Zhao, Lei Song, Jiang Bian

We propose an end-to-end learning framework based on hierarchical reinforcement learning, called H-TSP, for addressing the large-scale Travelling Salesman Problem (TSP).

Hierarchical Reinforcement Learning reinforcement-learning

Paper
Code

Pointerformer: Deep Reinforced Multi-Pointer Transformer for the Traveling Salesman Problem

1 code implementation • 19 Apr 2023 • Yan Jin, Yuandong Ding, Xuanhao Pan, Kun He, Li Zhao, Tao Qin, Lei Song, Jiang Bian

Traveling Salesman Problem (TSP), as a classic routing optimization problem originally arising in the domain of transportation and logistics, has become a critical task in broader domains, such as manufacturing and biology.

Traveling Salesman Problem

Paper
Code

Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management

no code implementations • 15 Dec 2022 • Yuandong Ding, Mingxiao Feng, Guozi Liu, Wei Jiang, Chuheng Zhang, Li Zhao, Lei Song, Houqiang Li, Yan Jin, Jiang Bian

In this paper, we consider the inventory management (IM) problem where we need to make replenishment decisions for a large number of stock keeping units (SKUs) to balance their supply and demand.

Management Multi-agent Reinforcement Learning +2

Paper
Add Code

TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets

1 code implementation • 5 Dec 2022 • Yuanying Cai, Chuheng Zhang, Li Zhao, Wei Shen, Xuyun Zhang, Lei Song, Jiang Bian, Tao Qin, TieYan Liu

There are two challenges for this setting: 1) The optimal trade-off between optimizing the RL signal and the behavior cloning (BC) signal changes on different states due to the variation of the action coverage induced by different behavior policies.

D4RL Offline RL +2

Paper
Code

Monte Carlo Tree Search based Variable Selection for High Dimensional Bayesian Optimization

1 code implementation • 4 Oct 2022 • Lei Song, Ke Xue, Xiaobin Huang, Chao Qian

Bayesian optimization (BO) is a class of popular methods for expensive black-box optimization, and has been widely applied to many scenarios.

Bayesian Optimization Variable Selection

Paper
Code

Symmetric Transformer-based Network for Unsupervised Image Registration

1 code implementation • 28 Apr 2022 • Mingrui Ma, Lei Song, Yuanbo Xu, Guixia Liu

Medical image registration is a fundamental and critical task in medical image analysis.

Medical Image Registration Unsupervised Image Registration

Paper
Code

Multi-Agent Reinforcement Learning with Shared Resource in Inventory Management

no code implementations • 29 Sep 2021 • Mingxiao Feng, Guozi Liu, Li Zhao, Lei Song, Jiang Bian, Tao Qin, Wengang Zhou, Houqiang Li, Tie-Yan Liu

We consider inventory management (IM) problem for a single store with a large number of SKUs (stock keeping units) in this paper, where we need to make replenishment decisions for each SKU to balance its supply and demand.

Management Multi-agent Reinforcement Learning +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.