no code implementations • 18 Apr 2024 • Jiaqi Li, Xiaobo Wang, Wentao Ding, ZiHao Wang, Yipeng Kang, Zixia Jia, Zilong Zheng
We introduce an innovative RAG-based framework with an ever-improving memory.
no code implementations • 26 Oct 2022 • Yipeng Kang, Tonghan Wang, Xiaoran Wu, Qianlan Yang, Chongjie Zhang
Value decomposition multi-agent reinforcement learning methods learn the global value function as a mixing of each agent's individual utility functions.
no code implementations • NeurIPS 2020 • Yipeng Kang, Tonghan Wang, Gerard de Melo
Emergentism and pragmatics are two research fields that study the dynamics of linguistic communication along substantially different timescales and intelligence levels.
Multi-agent Reinforcement Learning Reinforcement Learning (RL) +2