Search Results for author: Mingjie Tang

Found 7 papers, 4 papers with code

MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA based Mixture of Experts

1 code implementation • 22 Apr 2024 • Dengchun Li, Yingzi Ma, Naizheng Wang, Zhiyuan Cheng, Lei Duan, Jie Zuo, Cal Yang, Mingjie Tang

Unlike other LoRA based MoE methods, MixLoRA enhances model performance by utilizing independently configurable attention-layer LoRA adapters, supporting the use of LoRA and its variants for the construction of experts, and applying auxiliary load balance loss to address the imbalance problem of the router.

Multi-Task Learning Quantization

Paper
Code

BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks

no code implementations • 1 Apr 2024 • Zhiyuan Cheng, Zhaoyi Liu, Tengda Guo, Shiwei Feng, Dongfang Liu, Mingjie Tang, Xiangyu Zhang

Our attack prototype, named BadPart, is evaluated on both MDE and OFE tasks, utilizing a total of 7 models.

Adversarial Robustness Autonomous Driving +3

Paper
Add Code

Couler: Unified Machine Learning Workflow Optimization in Cloud

1 code implementation • 12 Mar 2024 • Xiaoda Wang, Yuan Tang, Tengda Guo, Bo Sang, Jingji Wu, Jian Sha, Ke Zhang, Jiang Qian, Mingjie Tang

This variety poses a challenge for end-users in terms of mastering different engine APIs.

890

Paper
Code

ASPEN: High-Throughput LoRA Fine-Tuning of Large Language Models with a Single GPU

1 code implementation • 5 Dec 2023 • Zhengmao Ye, Dengchun Li, Jingqi Tian, Tingfeng Lan, Jie Zuo, Lei Duan, Hui Lu, Yexi Jiang, Jian Sha, Ke Zhang, Mingjie Tang

Transformer-based large language models (LLMs) have demonstrated outstanding performance across diverse domains, particularly when fine-turned for specific domains.

Large Language Model Scheduling

182

Paper
Code