Search Results for author: Shijian Li

Found 21 papers, 9 papers with code

The Composite Task Challenge for Cooperative Multi-Agent Reinforcement Learning

1 code implementation1 Feb 2025 Yurui Li, Yuxuan Chen, Li Zhang, Shijian Li, Gang Pan

The significant role of division of labor (DOL) in promoting cooperation is widely recognized in real-world applications. Many cooperative multi-agent reinforcement learning (MARL) methods have incorporated the concept of DOL to improve cooperation among agents. However, the tasks used in existing testbeds typically correspond to tasks where DOL is often not a necessary feature for achieving optimal policies. Additionally, the full utilize of DOL concept in MARL methods remains unrealized due to the absence of appropriate tasks. To enhance the generality and applicability of MARL methods in real-world scenarios, there is a necessary to develop tasks that demand multi-agent DOL and cooperation. In this paper, we propose a series of tasks designed to meet these requirements, drawing on real-world rules as the guidance for their design. We guarantee that DOL and cooperation are necessary condition for completing tasks and introduce three factors to expand the diversity of proposed tasks to cover more realistic situations. We evaluate 10 cooperative MARL methods on the proposed tasks. The results indicate that all baselines perform poorly on these tasks. To further validate the solvability of these tasks, we also propose simplified variants of proposed tasks. Experimental results show that baselines are able to handle these simplified variants, providing evidence of the solvability of the proposed tasks. The source files is available at https://github. com/Yurui-Li/CTC.

Multi-agent Reinforcement Learning

SpectralKD: A Unified Framework for Interpreting and Distilling Vision Transformers via Spectral Analysis

1 code implementation26 Dec 2024 Huiyuan Tian, Bonan Xu, Shijian Li, Gang Pan

In this paper, we propose SpectralKD, a novel unified analytical framework that offers deeper insights into ViTs and optimizes KD via spectral analysis.

Knowledge Distillation Transfer Learning

Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation

no code implementations18 Dec 2024 Jianyu Zhang, Li Zhang, Shijian Li

Visual Tokenization, trained by self-supervised reconstructive learning, compresses visual data by codebook in patch-level with marginal information loss, but the visual tokens does not have semantic meaning.

Open Vocabulary Semantic Segmentation Open-Vocabulary Semantic Segmentation +1

CBraMod: A Criss-Cross Brain Foundation Model for EEG Decoding

1 code implementation10 Dec 2024 Jiquan Wang, Sha Zhao, Zhiling Luo, Yangxuan Zhou, Haiteng Jiang, Shijian Li, Tao Li, Gang Pan

Secondly, existing EEG foundation models have limited generalizability on a wide range of downstream BCI tasks due to varying formats of EEG data, making it challenging to adapt to.

EEG Eeg Decoding

A Framework For Image Synthesis Using Supervised Contrastive Learning

no code implementations5 Dec 2024 Yibin Liu, Jianyu Zhang, Li Zhang, Shijian Li, Gang Pan

The semantic label in priory describes the inherent distribution pattern with underlying cross-image relationships, which is supplement to the text description for understanding the full characteristics of image.

Contrastive Learning Generative Adversarial Network +2

RoBus: A Multimodal Dataset for Controllable Road Networks and Building Layouts Generation

1 code implementation10 Jul 2024 Tao Li, Ruihang Li, Huangnan Zheng, Shanding Ye, Shijian Li, Zhijie Pan

Additionally, we design new baselines that incorporate urban characteristics, such as road orientation and building density, in the process of generating road networks and building layouts using the RoBus dataset, enhancing the practicality of automated urban design.

Autonomous Driving

Generalizable Sleep Staging via Multi-Level Domain Alignment

1 code implementation13 Dec 2023 Jiquan Wang, Sha Zhao, Haiteng Jiang, Shijian Li, Tao Li, Gang Pan

In this paper, we introduce domain generalization into automatic sleep staging and propose the task of generalizable sleep staging which aims to improve the model generalization ability to unseen datasets.

Domain Generalization Sleep Staging

Multi-Depth Branch Network for Efficient Image Super-Resolution

1 code implementation29 Sep 2023 Huiyuan Tian, Li Zhang, Shijian Li, Min Yao, Gang Pan

We visualize this process using feature maps, and further demonstrate the rationality and effectiveness of this design using proposed novel Fourier spectral analysis methods.

Image Super-Resolution

A detail-enhanced sampling strategy in Hadamard single-pixel imaging

no code implementations9 Sep 2022 Yan Cai, Shijian Li, Wei zhang, Hao Wu, Xu-Ri Yao, Qing Zhao

Hadamard single-pixel imaging (HSI) is an appealing imaging technique due to its features of low hardware complexity and industrial cost.

compressed sensing Image Reconstruction

Thompson Sampling for Unimodal Bandits

no code implementations15 Jun 2021 Long Yang, Zhao Li, Zehong Hu, Shasha Ruan, Shijian Li, Gang Pan, Hongyang Chen

In this paper, we propose a Thompson Sampling algorithm for \emph{unimodal} bandits, where the expected reward is unimodal over the partially ordered arms.

Thompson Sampling

Optimize Neural Fictitious Self-Play in Regret Minimization Thinking

no code implementations22 Apr 2021 Yuxuan Chen, Li Zhang, Shijian Li, Gang Pan

Optimization of deep learning algorithms to approach Nash Equilibrium remains a significant problem in imperfect information games, e. g. StarCraft and poker.

Starcraft

Sync-Switch: Hybrid Parameter Synchronization for Distributed Deep Learning

1 code implementation16 Apr 2021 Shijian Li, Oren Mangoubi, Lijie Xu, Tian Guo

Further, we observe that Sync-Switch achieves 3. 8% higher converged accuracy with just 1. 23X the training time compared to training with ASP.

Deep Learning

Characterizing and Modeling Distributed Training with Transient Cloud GPU Servers

1 code implementation7 Apr 2020 Shijian Li, Robert J. Walls, Tian Guo

However, it is challenging to determine the appropriate cluster configuration---e. g., server type and number---for different training workloads while balancing the trade-offs in training time, cost, and model accuracy.

Perseus: Characterizing Performance and Cost of Multi-Tenant Serving for CNN Models

1 code implementation5 Dec 2019 Matthew LeMay, Shijian Li, Tian Guo

Leveraging Perseus, we evaluated the inference throughput and cost for serving various models and demonstrated that multi-tenant model serving led to up to 12% cost reduction.

Inverse Reinforcement Learning with Multiple Ranked Experts

no code implementations31 Jul 2019 Pablo Samuel Castro, Shijian Li, Daqing Zhang

We consider the problem of learning to behave optimally in a Markov Decision Process when a reward function is not specified, but instead we have access to a set of demonstrators of varying performance.

reinforcement-learning Reinforcement Learning +1

FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control

no code implementations1 Jul 2019 Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Zheng, Gang Pan

Alternatively, derivative-based methods treat the optimization process as a blackbox and show robustness and stability in learning continuous control tasks, but not data efficient in learning.

continuous-control Continuous Control +3

TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning

no code implementations17 May 2019 Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Pan

However, existing off-policy learning methods based on probabilistic policy measurement are inefficient when utilizing traces under a greedy target policy, which is ineffective for control problems.

reinforcement-learning Reinforcement Learning +1

Monte Carlo Neural Fictitious Self-Play: Approach to Approximate Nash equilibrium of Imperfect-Information Games

no code implementations22 Mar 2019 Li Zhang, Wei Wang, Shijian Li, Gang Pan

Experimentally, we demonstrate that the proposed Monte Carlo Neural Fictitious Self Play can converge to approximate Nash equilibrium in games with large-scale search depth while the Neural Fictitious Self Play can't.

Speeding up Deep Learning with Transient Servers

no code implementations28 Feb 2019 Shijian Li, Robert J. Walls, Lijie Xu, Tian Guo

Distributed training frameworks, like TensorFlow, have been proposed as a means to reduce the training time of deep learning models by using a cluster of GPU servers.

Deep Learning

Field-aware Neural Factorization Machine for Click-Through Rate Prediction

no code implementations25 Feb 2019 Li Zhang, Weichen Shen, Shijian Li, Gang Pan

This model can have strong second order feature interactive learning ability like Field-aware Factorization Machine, on this basis, deep neural network is used for higher-order feature combination learning.

Click-Through Rate Prediction Feature Engineering +2

Algorithmic Collusion in Cournot Duopoly Market: Evidence from Experimental Economics

no code implementations21 Feb 2018 Nan Zhou, Li Zhang, Shijian Li, Zhijian Wang

In application, we hope, the frameworks, the algorithm design as well as the experiment environment illustrated in this work, can be an incubator or a test bed for researchers and policymakers to handle the emerging algorithmic collusion.

Cannot find the paper you are looking for? You can Submit a new open access paper.