Search Results for author: Xiaozhe Yao

Found 5 papers, 3 papers with code

DeltaZip: Multi-Tenant Language Model Serving via Delta Compression

1 code implementation8 Dec 2023 Xiaozhe Yao, Ana Klimovic

Fine-tuning large language models (LLMs) for downstream tasks can greatly improve model quality, however serving many different fine-tuned LLMs concurrently for users in multi-tenant environments is challenging.

Language Modelling

SHiFT: An Efficient, Flexible Search Engine for Transfer Learning

1 code implementation4 Apr 2022 Cedric Renggli, Xiaozhe Yao, Luka Kolar, Luka Rimanic, Ana Klimovic, Ce Zhang

Transfer learning can be seen as a data- and compute-efficient alternative to training models from scratch.

Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.