no code implementations • 3 Mar 2025 • Ziyan Wang, Zhicheng Zhang, Fei Fang, Yali Du
Designing effective reward functions in multi-agent reinforcement learning (MARL) is a significant challenge, often leading to suboptimal or misaligned behaviors in complex, coordinated environments.
Multi-agent Reinforcement Learning
reinforcement-learning
+1
no code implementations • 24 Feb 2025 • Jibang Wu, Chenghao Yang, Simon Mahns, Chaoqi Wang, Hao Zhu, Fei Fang, Haifeng Xu
This paper develops an agentic framework that employs large language models (LLMs) to automate the generation of persuasive and grounded marketing content, using real estate listing descriptions as our focal application domain.
no code implementations • 25 Oct 2024 • Danqing Wang, Zhuorui Ye, Fei Fang, Lei LI
However, the lack of effective cooperation between LLM agents hinders their performance, especially for multi-step reasoning tasks.
no code implementations • 17 Oct 2024 • Mian Zhang, Xianjun Yang, Xinlu Zhang, Travis Labrum, Jamie C. Chiu, Shaun M. Eack, Fei Fang, William Yang Wang, Zhiyu Zoey Chen
There is a significant gap between patient needs and available mental health support today.
1 code implementation • 2 Oct 2024 • Danqing Wang, Jianxin Ma, Fei Fang, Lei LI
Despite significant advancements in the reasoning capabilities of Large Language Models (LLMs), the lack of diverse reasoning solutions often makes them trapped in a limited solution search area.
no code implementations • 30 Aug 2024 • Tyler Malloy, Maria José Ferreira, Fei Fang, Cleotilde Gonzalez
Cosine similarity between two documents can be computed using token embeddings formed by Large Language Models (LLMs) such as GPT-4, and used to categorize those documents across a range of uses.
no code implementations • 22 Jul 2024 • Zhuorui Ye, Stephanie Milani, Geoffrey J. Gordon, Fei Fang
To overcome this limitation, we introduce a novel training scheme that enables RL algorithms to efficiently learn a concept-based policy by only querying humans to label a small set of data, or in the extreme case, without any human labels.
no code implementations • 6 Jun 2024 • Jingwu Tang, Gokul Swamy, Fei Fang, Zhiwei Steven Wu
We study a multi-agent imitation learning (MAIL) problem where we take the perspective of a learner attempting to coordinate a group of agents based on demonstrations of an expert doing so.
no code implementations • 2 Jun 2024 • Naveen Raman, Zheyuan Ryan Shi, Fei Fang
Restless multi-armed bandits (RMAB) extend multi-armed bandits so pulling an arm impacts future states.
no code implementations • 30 May 2024 • Ziyan Wang, Meng Fang, Tristan Tomilin, Fei Fang, Yali Du
These embeddings are then integrated into the multi-agent policy learning process, enabling agents to learn policies that minimize constraint violations while optimizing rewards.
1 code implementation • 30 May 2024 • Ruiyi Wang, Stephanie Milani, Jamie C. Chiu, Jiayin Zhi, Shaun M. Eack, Travis Labrum, Samuel M. Murphy, Nev Jones, Kate Hardy, Hong Shen, Fei Fang, Zhiyu Zoey Chen
We propose an interactive training scheme, PATIENT-{\Psi}-TRAINER, for mental health trainees to practice a key skill in CBT -- formulating the cognitive model of the patient -- through role-playing a therapy session with PATIENT-{\Psi}.
no code implementations • 1 May 2024 • Zhicheng Zhang, Yancheng Liang, Yi Wu, Fei Fang
It learns to explore by first identifying the agents' high-rewarding joint state-action subspace from training tasks and then learning a set of diverse exploration policies to "cover" the subspace.
no code implementations • 19 Feb 2024 • Sameer Jain, Sedrick Scott Keh, Shova Chettri, Karun Dewan, Pablo Izquierdo, Johanna Prussman, Pooja Shreshtha, Cesar Suarez, Zheyuan Ryan Shi, Lei LI, Fei Fang
Environmental conservation organizations routinely monitor news content on conservation in protected areas to maintain situational awareness of developments that can have an environmental impact.
1 code implementation • 12 Feb 2024 • Steven Jecmen, Nihar B. Shah, Fei Fang, Leman Akoglu
A major threat to the peer-review systems of computer science conferences is the existence of "collusion rings" between reviewers.
no code implementations • 28 Nov 2023 • Zimeng Song, Chun Kai Ling, Fei Fang
We show that unlike prior work on multi-defender security games, the introduction of schedules can cause non-existence of equilibrium even under rather restricted environments.
no code implementations • 14 Nov 2023 • Rex Chen, Kathleen M. Carley, Fei Fang, Norman Sadeh
Traffic simulators are used to generate data for learning in intelligent transportation systems (ITSs).
1 code implementation • 6 Nov 2023 • Mateo Dulce Rubio, Siqi Zeng, Qi Wang, Didier Alvarado, Francisco Moreno, Hoda Heidari, Fei Fang
Landmines remain a threat to war-affected communities for years after conflicts have ended, partly due to the laborious nature of demining tasks.
no code implementations • 29 Oct 2023 • Zelai Xu, Chao Yu, Fei Fang, Yu Wang, Yi Wu
To mitigate the intrinsic bias in language actions, our agents use an LLM to perform deductive reasoning and generate a diverse set of action candidates.
no code implementations • 7 Oct 2023 • Jiayu Chen, Zelai Xu, Yunfei Li, Chao Yu, Jiaming Song, Huazhong Yang, Fei Fang, Yu Wang, Yi Wu
In this work, we present a novel subgame curriculum learning framework for zero-sum games.
no code implementations • 5 Aug 2023 • Qiaosong Qi, Le Zhuo, Aixi Zhang, Yue Liao, Fei Fang, Si Liu, Shuicheng Yan
To address these limitations, we present a novel cascaded motion diffusion model, DiffDance, designed for high-resolution, long-form dance generation.
1 code implementation • 30 Apr 2023 • Sedrick Scott Keh, Zheyuan Ryan Shi, David J. Patterson, Nirmal Bhagabati, Karun Dewan, Areendran Gopala, Pablo Izquierdo, Debojyoti Mallick, Ambika Sharma, Pooja Shrestha, Fei Fang
We introduce NewsPanda, a toolkit which automatically detects and analyzes online articles related to environmental conservation and infrastructure construction.
no code implementations • 12 Apr 2023 • Aravind Venugopal, Stephanie Milani, Fei Fang, Balaraman Ravindran
Unlike existing models, MABL is capable of encoding essential global information into the latent states during training while guaranteeing the decentralized execution of learned policies.
no code implementations • 2 Mar 2023 • Stephanie Milani, Arthur Juliani, Ida Momennejad, Raluca Georgescu, Jaroslaw Rzpecki, Alison Shaw, Gavin Costello, Fei Fang, Sam Devlin, Katja Hofmann
We aim to understand how people assess human likeness in navigation produced by people and artificially intelligent (AI) agents in a video game.
no code implementations • 28 Feb 2023 • Guoqiang Sun, Yibin Shen, Sijin Zhou, Xiang Chen, Hongyan Liu, Chunming Wu, Chenyi Lei, Xianhui Wei, Fei Fang
In this paper, we propose a cross-domain recommendation method: Self-supervised Interest Transfer Network (SITN), which can effectively transfer invariant knowledge between domains via prototypical contrastive learning.
no code implementations • 29 Dec 2022 • Chun Kai Ling, Fei Fang
Correlated Equilibrium is a solution concept that is more general than Nash Equilibrium (NE) and can lead to outcomes with better social welfare.
1 code implementation • 29 Dec 2022 • Chun Kai Ling, J. Zico Kolter, Fei Fang
Function approximation (FA) has been a critical component in solving large zero-sum games.
no code implementations • 7 Dec 2022 • Alexandre Belloni, Fei Fang, Alexander Volfovsky
In contrast to previous work, we approximate the relevant network interference patterns that lead to good estimates of the interference.
1 code implementation • ICCV 2023 • Le Zhuo, Zhaokai Wang, Baisen Wang, Yue Liao, Chenxi Bao, Stanley Peng, Songhao Han, Aixi Zhang, Fei Fang, Si Liu
We believe our dataset, benchmark model, and evaluation metric will boost the development of video background music generation.
1 code implementation • 15 Nov 2022 • Yue Guo, Joseph Campbell, Simon Stepputtis, Ruiyu Li, Dana Hughes, Fei Fang, Katia Sycara
This allows the student to self-reflect on what it has learned, enabling advice generalization and leading to improved sample efficiency and learning performance - even in environments where the teacher is sub-optimal.
Multi-agent Reinforcement Learning
reinforcement-learning
+3
1 code implementation • 18 Oct 2022 • Peide Huang, Mengdi Xu, Jiacheng Zhu, Laixi Shi, Fei Fang, Ding Zhao
Curriculum Reinforcement Learning (CRL) aims to create a sequence of tasks, starting from easy ones and gradually learning towards difficult tasks.
no code implementations • 24 Aug 2022 • Yuanliang Zhang, XiaoFeng Wang, Jinxin Hu, Ke Gao, Chenyi Lei, Fei Fang
we summarize three practical challenges which are not well solved for multi-scenario modeling: (1) Lacking of fine-grained and decoupled information transfer controls among multiple scenarios.
no code implementations • 11 Aug 2022 • Lixin Liu, Yanling Wang, Tianming Wang, Dong Guan, Jiawei Wu, Jingxu Chen, Rong Xiao, Wenxiang Zhu, Fei Fang
Therefore, it is crucial to perform cross-domain CTR prediction to transfer knowledge from large domains to small domains to alleviate the data sparsity issue.
no code implementations • 22 Jul 2022 • Steven Jecmen, Nihar B. Shah, Fei Fang, Vincent Conitzer
Many conferences rely on paper bidding as a key component of their reviewer assignment procedure.
1 code implementation • 24 Jun 2022 • Steven Jecmen, Minji Yoon, Vincent Conitzer, Nihar B. Shah, Fei Fang
The performance of these detection algorithms can be taken as a baseline for future research on detecting malicious bidding.
no code implementations • 23 Jun 2022 • Rex Chen, Fei Fang, Norman Sadeh
Traffic signal control (TSC) is a high-stakes domain that is growing in importance as traffic volume grows globally.
no code implementations • 25 May 2022 • Stephanie Milani, Zhicheng Zhang, Nicholay Topin, Zheyuan Ryan Shi, Charles Kamhoua, Evangelos E. Papalexakis, Fei Fang
The first algorithm, IVIPER, extends VIPER, a recent method for single-agent interpretable RL, to the multi-agent setting.
Multi-agent Reinforcement Learning
reinforcement-learning
+2
1 code implementation • 18 May 2022 • Fei Fang, Kunal Sinha, Noah D. Goodman, Christopher Potts, Elisa Kreiss
It seems likely that these patterns are shaped by the environment a speaker is exposed to in complex ways.
1 code implementation • 11 May 2022 • Lily Xu, Arpita Biswas, Fei Fang, Milind Tambe
Preventing poaching through ranger patrols protects endangered wildlife, directly contributing to the UN Sustainable Development Goal 15 of life on land.
1 code implementation • 30 Mar 2022 • Guan Yang, Minghuan Liu, Weijun Hong, Weinan Zhang, Fei Fang, Guangjun Zeng, Yue Lin
To this end, we characterize card and game features for DouDizhu to represent the perfect and imperfect information.
no code implementations • 19 Feb 2022 • Peide Huang, Mengdi Xu, Fei Fang, Ding Zhao
In this paper, we introduce a novel hierarchical formulation of robust RL - a general-sum Stackelberg game model called RRL-Stack - to formalize the sequential nature and provide extra flexibility for robust training.
no code implementations • 17 Feb 2022 • Stephanie Milani, Nicholay Topin, Manuela Veloso, Fei Fang
In this survey, we propose a novel taxonomy for organizing the XRL literature that prioritizes the RL setting.
no code implementations • 4 Oct 2021 • Hoon Oh, Yanhan Tang, Zong Zhang, Alexandre Jacquillat, Fei Fang
Unlike commercial ridesharing, non-commercial peer-to-peer (P2P) ridesharing has been subject to limited research -- although it can promote viable solutions in non-urban communities.
1 code implementation • 21 Aug 2021 • Weizhe Chen, Zihan Zhou, Yi Wu, Fei Fang
One practical requirement in solving dynamic games is to ensure that the players play well from any decision point onward.
1 code implementation • 13 Aug 2021 • Steven Jecmen, Hanrui Zhang, Ryan Liu, Fei Fang, Vincent Conitzer, Nihar B. Shah
Many scientific conferences employ a two-phase paper review process, where some papers are assigned additional reviewers after the initial reviews are submitted.
1 code implementation • 15 Jun 2021 • Lily Xu, Andrew Perrault, Fei Fang, Haipeng Chen, Milind Tambe
We formulate the problem as a game between the defender and nature who controls the parameter values of the adversarial behavior and design an algorithm MIRROR to find a robust policy.
1 code implementation • 16 Apr 2021 • Elisa Kreiss, Fei Fang, Noah D. Goodman, Christopher Potts
Current deep learning models often achieve excellent results on benchmark image-to-text datasets but fail to generate texts that are useful in practice.
2 code implementations • ICLR 2021 • Zhenggang Tang, Chao Yu, Boyuan Chen, Huazhe Xu, Xiaolong Wang, Fei Fang, Simon Du, Yu Wang, Yi Wu
We propose a simple, general and effective technique, Reward Randomization for discovering diverse strategic policies in complex multi-agent games.
no code implementations • 25 Feb 2021 • Nicholay Topin, Stephanie Milani, Fei Fang, Manuela Veloso
Because of this decision tree equivalence, any function approximator can be used during training, including a neural network, while yielding a decision tree policy for the base MDP.
1 code implementation • NeurIPS 2020 • Chun Kai Ling, Fei Fang, J. Zico Kolter
A central problem in machine learning and statistics is to model joint densities of random variables from data.
2 code implementations • 14 Sep 2020 • Lily Xu, Elizabeth Bondi, Fei Fang, Andrew Perrault, Kai Wang, Milind Tambe
Conservation efforts in green security domains to protect wildlife and forests are constrained by the limited availability of defenders (i. e., patrollers), who must patrol vast areas to protect from attackers (e. g., poachers or illegal loggers).
1 code implementation • 26 Aug 2020 • Zheyuan Ryan Shi, Zhiwei Steven Wu, Rayid Ghani, Fei Fang
In this paper, we introduce bandit data-driven optimization, the first iterative prediction-prescription framework to address these pain points.
no code implementations • 5 Aug 2020 • Jingxing Jiang, Zhubin Wang, Fei Fang, Binqiang Zhao
Critical as is to improve the online shopping experience for customers and merchants, how to find a proper approach for user intent prediction are paid great attention in both industry and academia.
2 code implementations • NeurIPS 2020 • Steven Jecmen, Hanrui Zhang, Ryan Liu, Nihar B. Shah, Vincent Conitzer, Fei Fang
We further consider the problem of restricting the joint probability that certain suspect pairs of reviewers are assigned to certain papers, and show that this problem is NP-hard for arbitrary constraints on these joint probabilities but efficiently solvable for a practical special case.
1 code implementation • ICLR 2020 • Qian Long, Zihan Zhou, Abhibav Gupta, Fei Fang, Yi Wu, Xiaolong Wang
In multi-agent games, the complexity of the environment can grow exponentially as the number of agents increases, so it is particularly challenging to learn good policies when the agent population is large.
Multi-agent Reinforcement Learning
reinforcement-learning
+2
no code implementations • 7 Jan 2020 • Zheyuan Ryan Shi, Claire Wang, Fei Fang
Artificial intelligence for social good (AI4SG) is a research theme that aims to use and advance artificial intelligence to address societal issues and improve the well-being of the world.
no code implementations • 16 Dec 2019 • Andrew Perrault, Fei Fang, Arunesh Sinha, Milind Tambe
With the maturing of AI and multiagent systems research, we have a tremendous opportunity to direct these advances towards addressing complex societal problems.
no code implementations • NeurIPS 2019 • Gabriele Farina, Chun Kai Ling, Fei Fang, Tuomas Sandholm
We show that a regret minimizer can be designed for a scaled extension of any two convex sets, and that from the decomposition we then obtain a global regret minimizer.
no code implementations • 10 Sep 2019 • Liheng Chen, Hongyi Guo, Yali Du, Fei Fang, Haifeng Zhang, Yaoming Zhu, Ming Zhou, Wei-Nan Zhang, Qing Wang, Yong Yu
Although existing works formulate this problem into a centralized learning with decentralized execution framework, which avoids the non-stationary problem in training, their decentralized execution paradigm limits the agents' capability to coordinate.
Multi-agent Reinforcement Learning
reinforcement-learning
+2
no code implementations • 20 Jul 2019 • Taoan Huang, Bohui Fang, Xiaohui Bei, Fei Fang
Transportation service providers that dispatch drivers and vehicles to riders start to support both on-demand ride requests posted in real time and rides scheduled in advance, leading to new challenges which, to the best of our knowledge, have not been addressed by existing works.
no code implementations • 13 May 2019 • Zheyuan Ryan Shi, Ariel D. Procaccia, Kevin S. Chan, Sridhar Venkatesan, Noam Ben-Asher, Nandi O. Leslie, Charles Kamhoua, Fei Fang
In order to formally reason about deception, we introduce the feature deception problem (FDP), a domain-independent model and present a learning and planning framework for finding the optimal deception strategy, taking into account the adversary's preferences which are initially unknown to the defender.
no code implementations • 11 Mar 2019 • Chun Kai Ling, Fei Fang, J. Zico Kolter
With the recent advances in solving large, zero-sum extensive form games, there is a growing interest in the inverse problem of inferring underlying game parameters given only access to agent actions.
no code implementations • 3 Jan 2019 • Zheyuan Ryan Shi, Aaron Schlenker, Brian Hay, Daniel Bittleston, Siyu Gao, Emily Peterson, John Trezza, Fei Fang
Cyber adversaries have increasingly leveraged social engineering attacks to breach large organizations and threaten the well-being of today's online users.
no code implementations • 6 Nov 2018 • Yufei Wang, Zheyuan Ryan Shi, Lantao Yu, Yi Wu, Rohit Singh, Lucas Joppa, Fei Fang
Green Security Games (GSGs) have been proposed and applied to optimize patrols conducted by law enforcement agencies in green security domains such as combating poaching, illegal logging and overfishing.
1 code implementation • 10 Jun 2018 • Aaron M. Roth, Umang Bhatt, Tamara Amin, Afsaneh Doryab, Fei Fang, Manuela Veloso
In this pilot study, we investigate (1) in what way a robot can express a certain mood to influence a human's decision making behavioral model; (2) how and to what extent the human will be influenced in a game theoretic setting.
1 code implementation • 7 May 2018 • Chun Kai Ling, Fei Fang, J. Zico Kolter
Although recent work in AI has made great progress in solving large, zero-sum, extensive-form games, the underlying assumption in most past work is that the parameters of the game itself are known to the agents.
no code implementations • 5 May 2018 • Zheyuan Ryan Shi, Ziye Tang, Long Tran-Thanh, Rohit Singh, Fei Fang
We study Stackelberg Security Games where the defender, in addition to allocating defensive resources to protect targets from the attacker, can strategically manipulate the attacker's payoff under budget constraints in weighted L^p-norm form regarding the amount of change.