Search Results for author: Yang Dai

Found 5 papers, 2 papers with code

TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition

1 code implementation19 Aug 2024 Tianwei Lin, Jiang Liu, Wenqiao Zhang, Zhaocheng Li, Yang Dai, Haoyuan Li, Zhelun Yu, Wanggui He, Juncheng Li, Hao Jiang, Siliang Tang, Yueting Zhuang

Considering this, we introduce an innovative PEFT method, TeamLoRA, consisting of a collaboration and competition module for experts, and thus achieving the right balance of effectiveness and efficiency: (i) For collaboration, a novel knowledge-sharing and -organizing mechanism is devised to appropriately reduce the scale of matrix operations, thereby boosting the training and inference speed.

Multi-Task Learning parameter-efficient fine-tuning +1

Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?

no code implementations20 May 2024 Yang Dai, Oubo Ma, Longfei Zhang, Xingxing Liang, Shengchao Hu, Mengzhu Wang, Shouling Ji, Jincai Huang, Li Shen

Transformer-based trajectory optimization methods have demonstrated exceptional performance in offline Reinforcement Learning (offline RL), yet it poses challenges due to substantial parameter size and limited scalability, which is particularly critical in sequential decision-making scenarios where resources are constrained such as in robots and drones with limited computational power.

Atari Games Mamba +3

SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning Systems

1 code implementation6 Feb 2024 Oubo Ma, Yuwen Pu, Linkang Du, Yang Dai, Ruo Wang, Xiaolei Liu, Yingcai Wu, Shouling Ji

Furthermore, we evaluate three potential defenses aimed at exploring ways to mitigate security threats posed by adversarial policies, providing constructive recommendations for deploying MARL in competitive environments.

Multi-agent Reinforcement Learning

Information retrieval in single cell chromatin analysis using TF-IDF transformation methods

no code implementations10 Dec 2022 Mehrdad Zandigohar, Yang Dai

We compared several scenarios for transformation and dimension reduction as well as the SVD-based feature analysis to investigate potential enhancements in scATAC-seq information retrieval.

Dimensionality Reduction Information Retrieval +1

Reference-Aided Part-Aligned Feature Disentangling for Video Person Re-Identification

no code implementations21 Mar 2021 Guoqing Zhang, Yuhao Chen, Yang Dai, yuhui Zheng, Yi Wu

Due to the inaccurate person detections and pose changes, pedestrian misalignment significantly increases the difficulty of feature extraction and matching.

Video-Based Person Re-Identification

Cannot find the paper you are looking for? You can Submit a new open access paper.