Search Results for author: Chengyue Wu

Found 9 papers, 3 papers with code

Adapting LLaMA Decoder to Vision Transformer

no code implementations10 Apr 2024 Jiahao Wang, Wenqi Shao, Mengzhao Chen, Chengyue Wu, Yong liu, Kaipeng Zhang, Songyang Zhang, Kai Chen, Ping Luo

We first "LLaMAfy" a standard ViT step-by-step to align with LLaMA's architecture, and find that directly applying a casual mask to the self-attention brings an attention collapse issue, resulting in the failure to the network training.

Computational Efficiency Quantization +1

LLaMA Pro: Progressive LLaMA with Block Expansion

1 code implementation4 Jan 2024 Chengyue Wu, Yukang Gan, Yixiao Ge, Zeyu Lu, Jiahao Wang, Ye Feng, Ping Luo, Ying Shan

Humans generally acquire new skills without compromising the old; however, the opposite holds for Large Language Models (LLMs), e. g., from LLaMA to CodeLLaMA.

Instruction Following Math

$π$-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation

1 code implementation27 Apr 2023 Chengyue Wu, Teng Wang, Yixiao Ge, Zeyu Lu, Ruisong Zhou, Ying Shan, Ping Luo

Foundation models have achieved great advances in multi-task learning with a unified interface of unimodal and multimodal tasks.

Multi-Task Learning

Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation

no code implementations24 Apr 2023 Zeyu Lu, Chengyue Wu, Xinyuan Chen, Yaohui Wang, Lei Bai, Yu Qiao, Xihui Liu

To mitigate those limitations, we propose Hierarchical Diffusion Autoencoders (HDAE) that exploit the fine-grained-to-abstract and lowlevel-to-high-level feature hierarchy for the latent space of diffusion models.

Image Generation Image Manipulation +1

Generative Data Augmentation for Non-IID Problem in Decentralized Clinical Machine Learning

no code implementations2 Dec 2022 ZiRui Wang, Shaoming Duan, Chengyue Wu, Wenhao Lin, Xinyu Zha, Peiyi Han, Chuanyi Liu

To address this problem, we propose a generative augmentation framework in swarm learning called SL-GAN, which augments the non-IID data by generating the synthetic data from participants.

Data Augmentation Edge-computing +1

An untrained deep learning method for reconstructing dynamic magnetic resonance images from accelerated model-based data

no code implementations3 May 2022 Kalina P. Slavkova, Julie C. DiCarlo, Viraj Wadhwa, Chengyue Wu, John Virostko, Sidharth Kumar, Thomas E. Yankeelov, Jonathan I. Tamir

We conclude that the use of an untrained neural network together with a physics-based regularization loss shows promise as a measure for determining the optimal stopping point in training without relying on fully-sampled ground truth data.

SSIM

AutoTS: Automatic Time Series Forecasting Model Design Based on Two-Stage Pruning

no code implementations26 Mar 2022 Chunnan Wang, Xingyu Chen, Chengyue Wu, Hongzhi Wang

We allow the effective combination of design experience from different sources, so as to create an effective search space containing a variety of TSF models to support different TSF tasks.

Neural Architecture Search Time Series +1

A 1D-0D-3D coupled model for simulating blood flow and transport processes in breast tissue

no code implementations14 Jan 2022 Marvin Fritz, Tobias Köppl, J. Tinsley Oden, Andreas Wagner, Barbara Wohlmuth, Chengyue Wu

In this work, we present mixed dimensional models for simulating blood flow and transport processes in breast tissue and the vascular tree supplying it.

Cannot find the paper you are looking for? You can Submit a new open access paper.