Search Results for author: Yawen Zhang

Found 9 papers, 1 papers with code

Higher Layers Need More LoRA Experts

1 code implementation13 Feb 2024 Chongyang Gao, Kezhen Chen, Jinmeng Rao, Baochen Sun, Ruibo Liu, Daiyi Peng, Yawen Zhang, Xiaoyuan Guo, Jie Yang, VS Subrahmanian

In this paper, we introduce a novel parameter-efficient MoE method, \textit{\textbf{M}oE-L\textbf{o}RA with \textbf{L}ayer-wise Expert \textbf{A}llocation (MoLA)} for Transformer-based models, where each model layer has the flexibility to employ a varying number of LoRA experts.

Tackling Vision Language Tasks Through Learning Inner Monologues

no code implementations19 Aug 2023 Diji Yang, Kezhen Chen, Jinmeng Rao, Xiaoyuan Guo, Yawen Zhang, Jie Yang, Yi Zhang

Visual language tasks require AI models to comprehend and reason with both visual and textual content.

LOWA: Localize Objects in the Wild with Attributes

no code implementations31 May 2023 Xiaoyuan Guo, Kezhen Chen, Jinmeng Rao, Yawen Zhang, Baochen Sun, Jie Yang

To train LOWA, we propose a hybrid vision-language training strategy to learn object detection and recognition with class names as well as attribute information.

Attribute Object +3

Air Pollution Hotspot Detection and Source Feature Analysis using Cross-domain Urban Data

no code implementations15 Nov 2022 Yawen Zhang, Michael Hannigan, Qin Lv

In this work, we explore the use of mobile sensing data (i. e., air quality sensors installed on vehicles) to detect pollution hotspots.

Management

Squeezing SGD Parallelization Performance in Distributed Training Using Delayed Averaging

no code implementations29 Sep 2021 Pengcheng Li, Yixin Guo, Yawen Zhang, Qinggang Zhou

Mini-batch Stochastic Gradient Descent (SGD) requires workers to halt forward/backward propagations, to wait for gradients synchronized among all workers before the next batch of tasks.

DaSGD: Squeezing SGD Parallelization Performance in Distributed Training Using Delayed Averaging

no code implementations31 May 2020 Qinggang Zhou, Yawen Zhang, Pengcheng Li, Xiaoyong Liu, Jun Yang, Runsheng Wang, Ru Huang

The state-of-the-art deep learning algorithms rely on distributed training systems to tackle the increasing sizes of models and training data sets.

Learn Electronic Health Records by Fully Decentralized Federated Learning

no code implementations4 Dec 2019 Songtao Lu, Yawen Zhang, Yunlong Wang, Christina Mack

Federated learning opens a number of research opportunities due to its high communication efficiency in distributed training problems within a star network.

Federated Learning

Applying High-Resolution Visible Imagery to Satellite Melt Pond Fraction Retrieval: A Neural Network Approach

no code implementations13 Apr 2017 Qi Liu, Yawen Zhang, Qin Lv, Li Shang

It is important to retrieve accurate melt pond fraction (MPF) from satellite data for Arctic research.

Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.