no code implementations • 12 Mar 2024 • Yunpeng Qing, Shunyu Liu, Jingyuan Cong, KaiXuan Chen, Yihe Zhou, Mingli Song
Offline Reinforcement Learning (RL) endeavors to leverage offline datasets to craft effective agent policy without online interaction, which imposes proper conservative constraints with the support of behavior policies to tackle the Out-Of-Distribution (OOD) problem.
no code implementations • 5 Jan 2024 • KaiXuan Chen, Wei Luo, Shunyu Liu, Yaoquan Wei, Yihe Zhou, Yunpeng Qing, Quan Zhang, Jie Song, Mingli Song
In this paper, we present a novel transformer architecture tailored for learning robust power system state representations, which strives to optimize power dispatch for the power flow adjustment across different transmission sections.
no code implementations • 28 Nov 2023 • Yaoquan Wei, Shunyu Liu, Jie Song, Tongya Zheng, KaiXuan Chen, Yong Wang, Mingli Song
Instead, we employ a proxy model to extract state features that are both discriminative (adaptive to the agent) and generally applicable (robust to agent noise).
1 code implementation • 8 Sep 2023 • Arian Prabowo, KaiXuan Chen, Hao Xue, Subbu Sethuvenkatraman, Flora D. Salim
In traditional deep learning algorithms, one of the key assumptions is that the data distribution remains constant during both training and deployment.
1 code implementation • 5 Aug 2023 • Yuwen Wang, Shunyu Liu, KaiXuan Chen, Tongtian Zhu, Ji Qiao, Mengjie Shi, Yuanyu Wan, Mingli Song
Graph Lottery Ticket (GLT), a combination of core subgraph and sparse subnetwork, has been proposed to mitigate the computational cost of deep Graph Neural Networks (GNNs) on large input graphs while preserving original performance.
no code implementations • 15 Jun 2023 • Yu Wang, Tongya Zheng, Shunyu Liu, KaiXuan Chen, Zunlei Feng, Yunzhi Hao, Mingli Song
The human mobility simulation task aims to generate human mobility trajectories given a small set of trajectory data, which have aroused much concern due to the scarcity and sparsity of human mobility data.
1 code implementation • 10 Jun 2023 • Arian Prabowo, KaiXuan Chen, Hao Xue, Subbu Sethuvenkatraman, Flora D. Salim
One of the primary reasons for this is the shift in distribution of occupancy patterns, with many people working or learning from home.
1 code implementation • 5 Jun 2023 • Tongtian Zhu, Fengxiang He, KaiXuan Chen, Mingli Song, DaCheng Tao
Decentralized stochastic gradient descent (D-SGD) allows collaborative learning on massive devices simultaneously without the control of a central server.
no code implementations • 3 Jun 2023 • Wenda Li, KaiXuan Chen, Shunyu Liu, Wenjie Huang, Haofei Zhang, Yingjie Tian, Yun Su, Mingli Song
In this paper, we strive to develop an interpretable GNNs' inference paradigm, termed MSInterpreter, which can serve as a plug-and-play scheme readily applicable to various GNNs' baselines.
1 code implementation • 31 May 2023 • KaiXuan Chen, Shunyu Liu, Tongtian Zhu, Tongya Zheng, Haofei Zhang, Zunlei Feng, Jingwen Ye, Mingli Song
Graph Neural Networks (GNNs) have emerged as a powerful category of learning architecture for handling graph-structured data.
1 code implementation • 27 May 2023 • Yihe Zhou, Shunyu Liu, Yunpeng Qing, KaiXuan Chen, Tongya Zheng, Yanhao Huang, Jie Song, Mingli Song
Despite the encouraging results achieved, CTDE makes an independence assumption on agent policies, which limits agents to adopt global cooperative information from each other during centralized training.
Multi-agent Reinforcement Learning reinforcement-learning +2
1 code implementation • 12 Mar 2023 • Haofei Zhang, Mengqi Xue, Xiaokang Liu, KaiXuan Chen, Jie Song, Mingli Song
In this paper, we study a novel inference paradigm, termed as schema inference, that learns to deductively infer the explainable predictions by rebuilding the prior deep neural network (DNN) forwarding scheme, guided by the prevalent philosophical cognitive concept of schema.
no code implementations • 20 Dec 2022 • Yunyao Cheng, Chenjuan Guo, KaiXuan Chen, Kai Zhao, Bin Yang, Jiandong Xie, Christian S. Jensen, Feiteng Huang, Kai Zheng
To capture the temporal and multivariate correlations among subsequences, we design a pattern discovery model, that constructs correlations via diverse pattern functions.
1 code implementation • 23 Nov 2022 • Shunyu Liu, Yihe Zhou, Jie Song, Tongya Zheng, KaiXuan Chen, Tongtian Zhu, Zunlei Feng, Mingli Song
Value Decomposition (VD) aims to deduce the contributions of agents for decentralized policies in the presence of only global rewards, and has recently emerged as a powerful credit assignment paradigm for tackling cooperative Multi-Agent Reinforcement Learning (MARL) problems.
no code implementations • 22 Aug 2022 • Dalin Zhang, KaiXuan Chen, Yan Zhao, Bin Yang, Lina Yao, Christian S. Jensen
A key challenge is that while the application of deep models often incurs substantial memory and computational costs, edge devices typically offer only very limited storage and computational capabilities that may vary substantially across devices.
1 code implementation • 8 Jul 2022 • Shunyu Liu, Jie Song, Yihe Zhou, Na Yu, KaiXuan Chen, Zunlei Feng, Mingli Song
In this work, we introduce a novel interactiOn Pattern disenTangling (OPT) method, to disentangle not only the joint value function into agent-wise value functions for decentralized execution, but also the entity interactions into interaction prototypes, each of which represents an underlying interaction pattern within a subgroup of the entities.
Multi-agent Reinforcement Learning reinforcement-learning +1
1 code implementation • 5 Jul 2022 • Shunyu Liu, KaiXuan Chen, Na Yu, Jie Song, Zunlei Feng, Mingli Song
Despite the promising results achieved, state-of-the-art interactive reinforcement learning schemes rely on passively receiving supervision signals from advisor experts, in the form of either continuous monitoring or pre-defined rules, which inevitably result in a cumbersome and expensive learning process.
1 code implementation • 12 May 2022 • KaiXuan Chen, Shunyu Liu, Na Yu, Rong Yan, Quan Zhang, Jie Song, Zunlei Feng, Mingli Song
As the topology of the power system is in the form of graph structure, graph neural network based representation learning is naturally suitable for learning the status of the power system.
no code implementations • 16 Dec 2021 • Gengshi Han, Shunyu Liu, KaiXuan Chen, Na Yu, Zunlei Feng, Mingli Song
This paper proposes a controllable sample generation framework based on Conditional Tabular Generative Adversarial Network (CTGAN) to generate specified transient stability samples.
1 code implementation • 29 Sep 2021 • KaiXuan Chen, Jie Song, Shunyu Liu, Na Yu, Zunlei Feng, Gengshi Han, Mingli Song
A DKEPool network de facto disassembles representation learning into two stages, structure learning and distribution learning.
no code implementations • 21 Jan 2020 • Kaixuan Chen, Dalin Zhang, Lina Yao, Bin Guo, Zhiwen Yu, Yunhao Liu
In this study, we present a survey of the state-of-the-art deep learning methods for sensor-based human activity recognition.
no code implementations • 22 May 2019 • Kaixuan Chen, Lina Yao, Dalin Zhang, Bin Guo, Zhiwen Yu
And the multiple agents in the proposed model represent activities with collective motions across body parts by independently selecting modalities associated with single motions.
no code implementations • 12 Nov 2018 • Kaixuan Chen, Lina Yao, Dalin Zhang, Xiaojun Chang, Guodong Long, Sen Wang
Semi-supervised learning is crucial for alleviating labelling burdens in people-centric sensing.
no code implementations • 17 May 2018 • Kaixuan Chen, Lina Yao, Xianzhi Wang, Dalin Zhang, Tao Gu, Zhiwen Yu, Zheng Yang
Multimodal features play a key role in wearable sensor-based human activity recognition (HAR).
no code implementations • 21 Nov 2017 • Kaixuan Chen, Lina Yao, Tao Gu, Zhiwen Yu, Xianzhi Wang, Dalin Zhang
Multimodal features play a key role in wearable sensor based Human Activity Recognition (HAR).