1 code implementation • 1 Feb 2025 • Yurui Li, Yuxuan Chen, Li Zhang, Shijian Li, Gang Pan
The significant role of division of labor (DOL) in promoting cooperation is widely recognized in real-world applications. Many cooperative multi-agent reinforcement learning (MARL) methods have incorporated the concept of DOL to improve cooperation among agents. However, the tasks used in existing testbeds typically correspond to tasks where DOL is often not a necessary feature for achieving optimal policies. Additionally, the full utilize of DOL concept in MARL methods remains unrealized due to the absence of appropriate tasks. To enhance the generality and applicability of MARL methods in real-world scenarios, there is a necessary to develop tasks that demand multi-agent DOL and cooperation. In this paper, we propose a series of tasks designed to meet these requirements, drawing on real-world rules as the guidance for their design. We guarantee that DOL and cooperation are necessary condition for completing tasks and introduce three factors to expand the diversity of proposed tasks to cover more realistic situations. We evaluate 10 cooperative MARL methods on the proposed tasks. The results indicate that all baselines perform poorly on these tasks. To further validate the solvability of these tasks, we also propose simplified variants of proposed tasks. Experimental results show that baselines are able to handle these simplified variants, providing evidence of the solvability of the proposed tasks. The source files is available at https://github. com/Yurui-Li/CTC.
1 code implementation • 26 Dec 2024 • Huiyuan Tian, Bonan Xu, Shijian Li, Gang Pan
In this paper, we propose SpectralKD, a novel unified analytical framework that offers deeper insights into ViTs and optimizes KD via spectral analysis.
Ranked #7 on
Knowledge Distillation
on ImageNet
no code implementations • 18 Dec 2024 • Jianyu Zhang, Li Zhang, Shijian Li
Visual Tokenization, trained by self-supervised reconstructive learning, compresses visual data by codebook in patch-level with marginal information loss, but the visual tokens does not have semantic meaning.
Open Vocabulary Semantic Segmentation
Open-Vocabulary Semantic Segmentation
+1
1 code implementation • 10 Dec 2024 • Jiquan Wang, Sha Zhao, Zhiling Luo, Yangxuan Zhou, Haiteng Jiang, Shijian Li, Tao Li, Gang Pan
Secondly, existing EEG foundation models have limited generalizability on a wide range of downstream BCI tasks due to varying formats of EEG data, making it challenging to adapt to.
no code implementations • 5 Dec 2024 • Yibin Liu, Jianyu Zhang, Li Zhang, Shijian Li, Gang Pan
The semantic label in priory describes the inherent distribution pattern with underlying cross-image relationships, which is supplement to the text description for understanding the full characteristics of image.
1 code implementation • 10 Jul 2024 • Tao Li, Ruihang Li, Huangnan Zheng, Shanding Ye, Shijian Li, Zhijie Pan
Additionally, we design new baselines that incorporate urban characteristics, such as road orientation and building density, in the process of generating road networks and building layouts using the RoBus dataset, enhancing the practicality of automated urban design.
1 code implementation • 13 Dec 2023 • Jiquan Wang, Sha Zhao, Haiteng Jiang, Shijian Li, Tao Li, Gang Pan
In this paper, we introduce domain generalization into automatic sleep staging and propose the task of generalizable sleep staging which aims to improve the model generalization ability to unseen datasets.
1 code implementation • 29 Sep 2023 • Huiyuan Tian, Li Zhang, Shijian Li, Min Yao, Gang Pan
We visualize this process using feature maps, and further demonstrate the rationality and effectiveness of this design using proposed novel Fourier spectral analysis methods.
no code implementations • 9 Sep 2022 • Yan Cai, Shijian Li, Wei zhang, Hao Wu, Xu-Ri Yao, Qing Zhao
Hadamard single-pixel imaging (HSI) is an appealing imaging technique due to its features of low hardware complexity and industrial cost.
no code implementations • 15 Jun 2021 • Long Yang, Zhao Li, Zehong Hu, Shasha Ruan, Shijian Li, Gang Pan, Hongyang Chen
In this paper, we propose a Thompson Sampling algorithm for \emph{unimodal} bandits, where the expected reward is unimodal over the partially ordered arms.
no code implementations • 22 Apr 2021 • Yuxuan Chen, Li Zhang, Shijian Li, Gang Pan
Optimization of deep learning algorithms to approach Nash Equilibrium remains a significant problem in imperfect information games, e. g. StarCraft and poker.
1 code implementation • 16 Apr 2021 • Shijian Li, Oren Mangoubi, Lijie Xu, Tian Guo
Further, we observe that Sync-Switch achieves 3. 8% higher converged accuracy with just 1. 23X the training time compared to training with ASP.
1 code implementation • 7 Apr 2020 • Shijian Li, Robert J. Walls, Tian Guo
However, it is challenging to determine the appropriate cluster configuration---e. g., server type and number---for different training workloads while balancing the trade-offs in training time, cost, and model accuracy.
1 code implementation • 5 Dec 2019 • Matthew LeMay, Shijian Li, Tian Guo
Leveraging Perseus, we evaluated the inference throughput and cost for serving various models and demonstrated that multi-tenant model serving led to up to 12% cost reduction.
no code implementations • 31 Jul 2019 • Pablo Samuel Castro, Shijian Li, Daqing Zhang
We consider the problem of learning to behave optimally in a Markov Decision Process when a reward function is not specified, but instead we have access to a set of demonstrators of varying performance.
no code implementations • 1 Jul 2019 • Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Zheng, Gang Pan
Alternatively, derivative-based methods treat the optimization process as a blackbox and show robustness and stability in learning continuous control tasks, but not data efficient in learning.
no code implementations • 17 May 2019 • Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Pan
However, existing off-policy learning methods based on probabilistic policy measurement are inefficient when utilizing traces under a greedy target policy, which is ineffective for control problems.
no code implementations • 22 Mar 2019 • Li Zhang, Wei Wang, Shijian Li, Gang Pan
Experimentally, we demonstrate that the proposed Monte Carlo Neural Fictitious Self Play can converge to approximate Nash equilibrium in games with large-scale search depth while the Neural Fictitious Self Play can't.
no code implementations • 28 Feb 2019 • Shijian Li, Robert J. Walls, Lijie Xu, Tian Guo
Distributed training frameworks, like TensorFlow, have been proposed as a means to reduce the training time of deep learning models by using a cluster of GPU servers.
no code implementations • 25 Feb 2019 • Li Zhang, Weichen Shen, Shijian Li, Gang Pan
This model can have strong second order feature interactive learning ability like Field-aware Factorization Machine, on this basis, deep neural network is used for higher-order feature combination learning.
no code implementations • 21 Feb 2018 • Nan Zhou, Li Zhang, Shijian Li, Zhijian Wang
In application, we hope, the frameworks, the algorithm design as well as the experiment environment illustrated in this work, can be an incubator or a test bed for researchers and policymakers to handle the emerging algorithmic collusion.