Search Results for author: Chengdong Ma

Found 11 papers, 1 papers with code

EconGym: A Scalable AI Testbed with Diverse Economic Tasks

no code implementations13 Jun 2025 Qirui Mi, Qipeng Yang, Zijun Fan, Wentian Fan, Heyang Ma, Chengdong Ma, Siyu Xia, Bo An, Jun Wang, Haifeng Zhang

Artificial intelligence (AI) has become a powerful tool for economic research, enabling large-scale simulation and policy optimization.

Benchmarking

A Survey on Self-play Methods in Reinforcement Learning

no code implementations2 Aug 2024 Ruize Zhang, Zelai Xu, Chengdong Ma, Chao Yu, Wei-Wei Tu, Wenhao Tang, Shiyu Huang, Deheng Ye, Wenbo Ding, Yaodong Yang, Yu Wang

Self-play, characterized by agents' interactions with copies or past versions of themselves, has recently gained prominence in reinforcement learning (RL).

Multi-agent Reinforcement Learning reinforcement-learning +3

Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles

no code implementations31 May 2024 Jiesong Lian, Yucong Huang, Chengdong Ma, Mingzhi Wang, Ying Wen, Long Hu, Yixue Hao

For solving zero-sum games involving non-transitivity, a useful approach is to maintain a policy population to approximate the Nash Equilibrium (NE).

Multi-agent Reinforcement Learning

Learning Macroeconomic Policies through Dynamic Stackelberg Mean-Field Games

no code implementations14 Mar 2024 Qirui Mi, Zhiyu Zhao, Chengdong Ma, Siyu Xia, Yan Song, Mengyue Yang, Jun Wang, Haifeng Zhang

Macroeconomic outcomes emerge from individuals' decisions, making it essential to model how agents interact with macro policy via consumption, investment, and labor choices.

Decision Making

Roadmap on Incentive Compatibility for AI Alignment and Governance in Sociotechnical Systems

no code implementations20 Feb 2024 Zhaowei Zhang, Fengshuo Bai, Mingzhi Wang, Haoyang Ye, Chengdong Ma, Yaodong Yang

The burgeoning integration of artificial intelligence (AI) into human society brings forth significant implications for societal governance and safety.

Panacea: Pareto Alignment via Preference Adaptation for LLMs

no code implementations3 Feb 2024 Yifan Zhong, Chengdong Ma, Xiaoyuan Zhang, Ziran Yang, Haojun Chen, Qingfu Zhang, Siyuan Qi, Yaodong Yang

Panacea trains a single model capable of adapting online and Pareto-optimally to diverse sets of preferences without the need for further tuning.

Language Modelling Large Language Model

Evolving Diverse Red-team Language Models in Multi-round Multi-agent Games

no code implementations30 Sep 2023 Chengdong Ma, Ziran Yang, Hai Ci, Jun Gao, Minquan Gao, Xuehai Pan, Yaodong Yang

Furthermore, we develop a Gamified Red Team Solver (GRTS) with diversity measures to mitigate mode collapse and theoretically guarantee the convergence of approximate Nash equilibrium which results in better strategies for both teams.

Diversity Language Modelling +2

Scalable Model-based Policy Optimization for Decentralized Networked Systems

no code implementations13 Jul 2022 Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang

Reinforcement learning algorithms require a large amount of samples; this often limits their real-world applications on even simple tasks.

Traffic Signal Control

Game-Theoretic Multiagent Reinforcement Learning

1 code implementation1 Nov 2020 Yaodong Yang, Chengdong Ma, Zihan Ding, Stephen Mcaleer, Chi Jin, Jun Wang

In this work, we provide a monograph on MARL that covers both the fundamentals and the latest developments in the research frontier.

Multi-agent Reinforcement Learning reinforcement-learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.