Search Results for author: Minne Li

Found 11 papers, 5 papers with code

Mean Field Multi-Agent Reinforcement Learning

3 code implementations • ICML 2018 • Yaodong Yang, Rui Luo, Minne Li, Ming Zhou, Wei-Nan Zhang, Jun Wang

Existing multi-agent reinforcement learning methods are limited typically to a small number of agents.

Multi-agent Reinforcement Learning Q-Learning +2

363

Paper
Code

Bi-level Actor-Critic for Multi-agent Coordination

1 code implementation • 8 Sep 2019 • Haifeng Zhang, Weizhe Chen, Zeren Huang, Minne Li, Yaodong Yang, Wei-Nan Zhang, Jun Wang

Coordination is one of the essential problems in multi-agent systems.

Multiagent Systems

Paper
Code

Multi-Agent Trust Region Learning

1 code implementation • 1 Jan 2021 • Ying Wen, Hui Chen, Yaodong Yang, Zheng Tian, Minne Li, Xu Chen, Jun Wang

We derive the lower bound of agents' payoff improvements for MATRL methods, and also prove the convergence of our method on the meta-game fixed points.

Atari Games Multi-agent Reinforcement Learning +3

Paper
Code

A Game-Theoretic Approach to Multi-Agent Trust Region Optimization

1 code implementation • 12 Jun 2021 • Ying Wen, Hui Chen, Yaodong Yang, Zheng Tian, Minne Li, Xu Chen, Jun Wang

Trust region methods are widely applied in single-agent reinforcement learning problems due to their monotonic performance-improvement guarantee at every iteration.

Atari Games Multi-agent Reinforcement Learning +2

Paper
Code

Multi-View Reinforcement Learning

1 code implementation • NeurIPS 2019 • Minne Li, Lisheng Wu, Haitham Bou Ammar, Jun Wang

This paper is concerned with multi-view reinforcement learning (MVRL), which allows for decision making when agents share common dynamics but adhere to different observation models.

Decision Making reinforcement-learning +1

Paper
Code

S-OHEM: Stratified Online Hard Example Mining for Object Detection

no code implementations • 5 May 2017 • Minne Li, Zhaoning Zhang, Hao Yu, Xinyuan Chen, Dongsheng Li

S-OHEM exploits OHEM with stratified sampling, a widely-adopted sampling technique, to choose the training examples according to this influence during hard example mining, and thus enhance the performance of object detectors.

object-detection Object Detection

Paper
Add Code

Learning Shared Dynamics with Meta-World Models

no code implementations • 5 Nov 2018 • Lisheng Wu, Minne Li, Jun Wang

Humans have consciousness as the ability to perceive events and objects: a mental model of the world developed from the most impoverished of visual stimuli, enabling humans to make rapid decisions and take actions.

Atari Games Multi-Task Learning

Paper
Add Code

Joint Perception and Control as Inference with an Object-based Implementation

no code implementations • 4 Mar 2019 • Minne Li, Zheng Tian, Pranav Nashikkar, Ian Davies, Ying Wen, Jun Wang

Existing model-based reinforcement learning methods often study perception modeling and decision making separately.

Bayesian Inference Decision Making +2

Paper
Add Code

CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale Ride-Hailing Platforms

no code implementations • 27 May 2019 • Jiarui Jin, Ming Zhou, Wei-Nan Zhang, Minne Li, Zilong Guo, Zhiwei Qin, Yan Jiao, Xiaocheng Tang, Chenxi Wang, Jun Wang, Guobin Wu, Jieping Ye

How to optimally dispatch orders to vehicles and how to trade off between immediate and future returns are fundamental questions for a typical ride-hailing platform.

Multiagent Systems

Paper
Add Code

Compositional ADAM: An Adaptive Compositional Solver

no code implementations • 10 Feb 2020 • Rasul Tutunov, Minne Li, Alexander I. Cowen-Rivers, Jun Wang, Haitham Bou-Ammar

In this paper, we present C-ADAM, the first adaptive solver for compositional problems involving a non-linear functional nesting of expected values.

Meta-Learning

Paper
Add Code

Causal World Models by Unsupervised Deconfounding of Physical Dynamics

no code implementations • 28 Dec 2020 • Minne Li, Mengyue Yang, Furui Liu, Xu Chen, Zhitang Chen, Jun Wang

The capability of imagining internally with a mental model of the world is vitally important for human cognition.

counterfactual

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.