Search Results for author: Dapeng Li

Found 21 papers, 0 papers with code

KnowledgeNavigator: Leveraging Large Language Models for Enhanced Reasoning over Knowledge Graph

no code implementations26 Dec 2023 Tiezheng Guo, Qingwen Yang, Chen Wang, Yanyi Liu, Pan Li, Jiawei Tang, Dapeng Li, Yingyou Wen

Large language model (LLM) has achieved outstanding performance on various downstream tasks with its powerful natural language understanding and zero-shot capability, but LLM still suffers from knowledge limitation.

Hallucination Language Modelling +3

Adaptive parameter sharing for multi-agent reinforcement learning

no code implementations14 Dec 2023 Dapeng Li, Na Lou, Bin Zhang, Zhiwei Xu, Guoliang Fan

Parameter sharing, as an important technique in multi-agent systems, can effectively solve the scalability issue in large-scale agent problems.

Multi-agent Reinforcement Learning reinforcement-learning

Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach

no code implementations23 Nov 2023 Bin Zhang, Hangyu Mao, Jingqing Ruan, Ying Wen, Yang Li, Shao Zhang, Zhiwei Xu, Dapeng Li, Ziyue Li, Rui Zhao, Lijuan Li, Guoliang Fan

The remarkable progress in Large Language Models (LLMs) opens up new avenues for addressing planning and decision-making problems in Multi-Agent Systems (MAS).

Decision Making Hallucination +3

Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems

no code implementations13 May 2023 Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, Guoliang Fan

Our research contributes to the development of an effective and adaptable asynchronous action coordination method that can be widely applied to various task types and environmental configurations in MAS.

Decision Making Multi-agent Reinforcement Learning

From Explicit Communication to Tacit Cooperation:A Novel Paradigm for Cooperative MARL

no code implementations28 Apr 2023 Dapeng Li, Zhiwei Xu, Bin Zhang, Guoliang Fan

Centralized training with decentralized execution (CTDE) is a widely-used learning paradigm that has achieved significant success in complex tasks.

SEA: A Spatially Explicit Architecture for Multi-Agent Reinforcement Learning

no code implementations25 Apr 2023 Dapeng Li, Zhiwei Xu, Bin Zhang, Guoliang Fan

In addition, our structure can be applied to various existing mainstream reinforcement learning algorithms with minor modifications and can deal with the problem with a variable number of agents.

Multi-agent Reinforcement Learning reinforcement-learning

Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning

no code implementations20 Apr 2023 Bin Zhang, Lijuan Li, Zhiwei Xu, Dapeng Li, Guoliang Fan

In multi-agent reinforcement learning (MARL), self-interested agents attempt to establish equilibrium and achieve coordination depending on game structure.

Decision Making Multi-agent Reinforcement Learning

An Information-Theoretic Analysis of Discrete-Time Control and Filtering Limitations by the I-MMSE Relationships

no code implementations18 Apr 2023 Neng Wan, Dapeng Li, Naira Hovakimyan, Petros G. Voulgaris

Fundamental limitations or performance trade-offs/limits are important properties and constraints of control and filtering systems.

Style Miner: Find Significant and Stable Explanatory Factors in Time Series with Constrained Reinforcement Learning

no code implementations21 Mar 2023 Dapeng Li, Feiyang Pan, Jia He, Zhiwei Xu, Dandan Tu, Guoliang Fan

In high-dimensional time-series analysis, it is essential to have a set of key factors (namely, the style factors) that explain the change of the observed variable.

Time Series Time Series Analysis

Consensus Learning for Cooperative Multi-Agent Reinforcement Learning

no code implementations6 Jun 2022 Zhiwei Xu, Bin Zhang, Dapeng Li, Zeren Zhang, Guangchong Zhou, Hao Chen, Guoliang Fan

Almost all multi-agent reinforcement learning algorithms without communication follow the principle of centralized training with decentralized execution.

Contrastive Learning Multi-agent Reinforcement Learning +2

Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning

no code implementations20 Apr 2022 Zhiwei Xu, Dapeng Li, Bin Zhang, Yuan Zhan, Yunpeng Bai, Guoliang Fan

Recently, model-based agents have achieved better performance than model-free ones using the same computational budget and training time in single-agent environments.

Multi-agent Reinforcement Learning reinforcement-learning +1

Simplified Analysis on Filtering Sensitivity Trade-offs in Continuous- and Discrete-Time Systems

no code implementations8 Apr 2022 Neng Wan, Dapeng Li, Lin Song, Naira Hovakimyan

A simplified analysis is performed on the Bode-type filtering sensitivity trade-off integrals, which capture the sensitivity characteristics of the estimate and estimation error with respect to the process input and estimated signal in continuous- and discrete-time linear time-invariant filtering systems.

Cooperative Stochastic Multi-agent Multi-armed Bandits Robust to Adversarial Corruptions

no code implementations8 Jun 2021 Junyan Liu, Shuai Li, Dapeng Li

Our algorithm not only achieves near-optimal regret in the stochastic setting, but also obtains a regret with an additive term of corruption in the corrupted setting, while maintaining efficient communication.

Multi-Armed Bandits Open-Ended Question Answering

SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning

no code implementations13 May 2021 Zhiwei Xu, Yunpeng Bai, Dapeng Li, Bin Zhang, Guoliang Fan

As one of the solutions to the decentralized partially observable Markov decision process (Dec-POMDP) problems, the value decomposition method has achieved significant results recently.

Multi-agent Reinforcement Learning reinforcement-learning +3

Squeezed-field path-integral description of BCS superconductors

no code implementations24 Feb 2021 Kazuma Nagao, Dapeng Li, Ludwig Mathey

We develop a squeezed-field path-integral representation for BCS superconductors utilizing a generalized completeness relation of squeezed-fermionic coherent states.

Superconductivity

f-Divergence Variational Inference

no code implementations NeurIPS 2020 Neng Wan, Dapeng Li, Naira Hovakimyan

This paper introduces the $f$-divergence variational inference ($f$-VI) that generalizes variational inference to all $f$-divergences.

Stochastic Optimization Variational Inference

OSU Multimodal Machine Translation System Report

no code implementations WS 2017 Mingbo Ma, Dapeng Li, Kai Zhao, Liang Huang

This paper describes Oregon State University's submissions to the shared WMT'17 task "multimodal translation task I".

Image Captioning Multimodal Machine Translation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.