Search Results for author: Xuanzhe Liu

Found 25 papers, 12 papers with code

ELMS: Elasticized Large Language Models On Mobile Devices

no code implementations8 Sep 2024 Wangsong Yin, Rongjie Yi, Daliang Xu, Gang Huang, Mengwei Xu, Xuanzhe Liu

To address this issue, we introduce ELMS, an on-device LLM service designed to provide elasticity in both the model and prompt dimensions of an LLMaaS.

Language Modelling

Empowering 1000 tokens/second on-device LLM prefilling with mllm-NPU

1 code implementation8 Jul 2024 Daliang Xu, Hao Zhang, Liming Yang, Ruiqi Liu, Gang Huang, Mengwei Xu, Xuanzhe Liu

On-device large language models (LLMs) are catalyzing novel mobile applications such as UI task automation and personalized email auto-reply, without giving away users' private data.

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation

no code implementations18 Apr 2024 Chao Jin, Zili Zhang, Xuanlin Jiang, Fangyue Liu, Xin Liu, Xuanzhe Liu, Xin Jin

We implement RAGCache and evaluate it on vLLM, a state-of-the-art LLM inference system and Faiss, a state-of-the-art vector database.

RAG Retrieval

LoongServe: Efficiently Serving Long-context Large Language Models with Elastic Sequence Parallelism

no code implementations15 Apr 2024 Bingyang Wu, Shengyu Liu, Yinmin Zhong, Peng Sun, Xuanzhe Liu, Xin Jin

The context window of large language models (LLMs) is rapidly increasing, leading to a huge variance in resource usage between different requests as well as between different phases of the same request.

Anatomizing Deep Learning Inference in Web Browsers

no code implementations8 Feb 2024 QiPeng Wang, Shiqi Jiang, Zhenpeng Chen, Xu Cao, Yuanchun Li, Aoyu Li, Yun Ma, Ting Cao, Xuanzhe Liu

The gap on mobile CPU and mobile GPU is 15. 8 times and 7. 8 times, respectively.

A Survey of Resource-efficient LLM and Multimodal Foundation Models

1 code implementation16 Jan 2024 Mengwei Xu, Wangsong Yin, Dongqi Cai, Rongjie Yi, Daliang Xu, QiPeng Wang, Bingyang Wu, Yihao Zhao, Chen Yang, Shihe Wang, Qiyang Zhang, Zhenyan Lu, Li Zhang, Shangguang Wang, Yuanchun Li, Yunxin Liu, Xin Jin, Xuanzhe Liu

Large foundation models, including large language models (LLMs), vision transformers (ViTs), diffusion, and LLM-based multimodal models, are revolutionizing the entire machine learning lifecycle, from training to deployment.

Bias Behind the Wheel: Fairness Analysis of Autonomous Driving Systems

no code implementations5 Aug 2023 Xinyue Li, Zhenpeng Chen, Jie M. Zhang, Federica Sarro, Ying Zhang, Xuanzhe Liu

This paper analyzes fairness in automated pedestrian detection, a crucial but under-explored issue in autonomous driving systems.

Autonomous Driving Fairness +1

An Empirical Study on Deployment Faults of Deep Learning Based Mobile Applications

1 code implementation13 Jan 2021 Zhenpeng Chen, Huihan Yao, Yiling Lou, Yanbin Cao, Yuanqiang Liu, Haoyu Wang, Xuanzhe Liu

In contrast, faults related to the deployment of DL models on mobile devices (named as deployment faults of mobile DL apps) have not been well studied.

Hierarchical Federated Learning through LAN-WAN Orchestration

no code implementations22 Oct 2020 Jinliang Yuan, Mengwei Xu, Xiao Ma, Ao Zhou, Xuanzhe Liu, Shangguang Wang

Our proposed FL can accelerate the learning process and reduce the monetary cost with frequent local aggregation in the same LAN and infrequent global aggregation on a cloud across WAN.

Federated Learning

Exploring the Generalizability of Spatio-Temporal Traffic Prediction: Meta-Modeling and an Analytic Framework

1 code implementation20 Sep 2020 Leye Wang, Di Chai, Xuanzhe Liu, Liyue Chen, Kai Chen

The Spatio-Temporal Traffic Prediction (STTP) problem is a classical problem with plenty of prior research efforts that benefit from traditional statistical learning and recent deep learning approaches.

Traffic Prediction

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data

no code implementations12 Jun 2020 Chengxu Yang, Qipeng Wang, Mengwei Xu, Zhenpeng Chen, Kaigui Bian, Yunxin Liu, Xuanzhe Liu

Based on the data and the platform, we conduct extensive experiments to compare the performance of state-of-the-art FL algorithms under heterogeneity-aware and heterogeneity-unaware settings.

Fairness Federated Learning +1

Understanding Challenges in Deploying Deep Learning Based Software: An Empirical Study

no code implementations2 May 2020 Zhenpeng Chen, Yanbin Cao, Yuanqiang Liu, Haoyu Wang, Tao Xie, Xuanzhe Liu

Deep learning (DL) becomes increasingly pervasive, being used in a wide range of software applications.

Software Engineering

Federated Neural Architecture Search

no code implementations15 Feb 2020 Jinliang Yuan, Mengwei Xu, Yuxin Zhao, Kaigui Bian, Gang Huang, Xuanzhe Liu, Shangguang Wang

To preserve user privacy while enabling mobile intelligence, techniques have been proposed to train deep neural networks on decentralized data.

Neural Architecture Search

Approximate Query Service on Autonomous IoT Cameras

no code implementations2 Sep 2019 Mengwei Xu, Xiwen Zhang, Yunxin Liu, Gang Huang, Xuanzhe Liu, Felix Xiaozhu Lin

Elf is a runtime for an energy-constrained camera to continuously summarize video scenes as approximate object counts.

Databases

SEntiMoji: An Emoji-Powered Learning Approach for Sentiment Analysis in Software Engineering

1 code implementation4 Jul 2019 Zhenpeng Chen, Yanbin Cao, Xuan Lu, Qiaozhu Mei, Xuanzhe Liu

However, commonly used out-of-the-box sentiment analysis tools cannot obtain reliable results on SE tasks and the misunderstanding of technical jargon is demonstrated to be the main reason.

Representation Learning Sentiment Analysis

Moving Deep Learning into Web Browser: How Far Can We Go?

3 code implementations27 Jan 2019 Yun Ma, Dongwei Xiang, Shuyu Zheng, Deyu Tian, Xuanzhe Liu

Recently, several JavaScript-based deep learning frameworks have emerged, making it possible to perform deep learning tasks directly in browsers.

Software Engineering

A First Look at Emoji Usage on GitHub: An Empirical Study

1 code implementation12 Dec 2018 Xuan Lu, Yanbin Cao, Zhenpeng Chen, Xuanzhe Liu

We find that emojis are used by a considerable proportion of GitHub users.

Computers and Society Software Engineering

Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification

1 code implementation7 Jun 2018 Zhenpeng Chen, Sheng Shen, Ziniu Hu, Xuan Lu, Qiaozhu Mei, Xuanzhe Liu

To tackle this problem, cross-lingual sentiment classification approaches aim to transfer knowledge learned from one language that has abundant labeled examples (i. e., the source language, usually English) to another language with fewer labels (i. e., the target language).

Classification Cross-Lingual Sentiment Classification +5

DeepCache: Principled Cache for Mobile Deep Vision

1 code implementation1 Dec 2017 Mengwei Xu, Mengze Zhu, Yunxin Liu, Felix Xiaozhu Lin, Xuanzhe Liu

We present DeepCache, a principled cache design for deep learning inference in continuous mobile vision.

Video Compression

DeepWear: Adaptive Local Offloading for On-Wearable Deep Learning

no code implementations1 Dec 2017 Mengwei Xu, Feng Qian, Mengze Zhu, Feifan Huang, Saumay Pushp, Xuanzhe Liu

Due to their on-body and ubiquitous nature, wearables can generate a wide range of unique sensor data creating countless opportunities for deep learning tasks.

Cannot find the paper you are looking for? You can Submit a new open access paper.