Search Results for author: Xuanzhe Liu

Found 22 papers, 11 papers with code

LoongServe: Efficiently Serving Long-context Large Language Models with Elastic Sequence Parallelism

no code implementations • 15 Apr 2024 • Bingyang Wu, Shengyu Liu, Yinmin Zhong, Peng Sun, Xuanzhe Liu, Xin Jin

The context window of large language models (LLMs) is rapidly increasing, leading to a huge variance in resource usage between different requests as well as between different phases of the same request.

Paper
Add Code

Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance

no code implementations • 8 Feb 2024 • QiPeng Wang, Shiqi Jiang, Zhenpeng Chen, Xu Cao, Yuanchun Li, Aoyu Li, Ying Zhang, Yun Ma, Ting Cao, Xuanzhe Liu

Additionally, we noticed that in-browser inference increases the time it takes for graphical user interface (GUI) components to load in web browsers by a significant 67. 2\%, which severely impacts the overall QoE for users of web applications that depend on this technology.

Paper
Add Code

A Survey of Resource-efficient LLM and Multimodal Foundation Models

1 code implementation • 16 Jan 2024 • Mengwei Xu, Wangsong Yin, Dongqi Cai, Rongjie Yi, Daliang Xu, QiPeng Wang, Bingyang Wu, Yihao Zhao, Chen Yang, Shihe Wang, Qiyang Zhang, Zhenyan Lu, Li Zhang, Shangguang Wang, Yuanchun Li, Yunxin Liu, Xin Jin, Xuanzhe Liu

Large foundation models, including large language models (LLMs), vision transformers (ViTs), diffusion, and LLM-based multimodal models, are revolutionizing the entire machine learning lifecycle, from training to deployment.

139

Paper
Code

LLMCad: Fast and Scalable On-device Large Language Model Inference

no code implementations • 8 Sep 2023 • Daliang Xu, Wangsong Yin, Xin Jin, Ying Zhang, Shiyun Wei, Mengwei Xu, Xuanzhe Liu

Currently, the execution of these generative tasks heavily depends on Large Language Models (LLMs).

Language Modelling Large Language Model +2

Paper
Add Code

Bias Behind the Wheel: Fairness Analysis of Autonomous Driving Systems

no code implementations • 5 Aug 2023 • Xinyue Li, Zhenpeng Chen, Jie M. Zhang, Federica Sarro, Ying Zhang, Xuanzhe Liu

This paper analyzes fairness in automated pedestrian detection, a crucial but under-explored issue in autonomous driving systems.

Autonomous Driving Fairness +1

Paper
Add Code

Fast Distributed Inference Serving for Large Language Models

no code implementations • 10 May 2023 • Bingyang Wu, Yinmin Zhong, Zili Zhang, Gang Huang, Xuanzhe Liu, Xin Jin

Based on the new semi information-agnostic setting of LLM inference, the scheduler leverages the input length information to assign an appropriate initial queue for each arrival job to join.

Blocking Management +1

Paper
Add Code

Benchmarking of DL Libraries and Models on Mobile Devices

1 code implementation • 14 Feb 2022 • Qiyang Zhang, Xiang Li, Xiangying Che, Xiao Ma, Ao Zhou, Mengwei Xu, Shangguang Wang, Yun Ma, Xuanzhe Liu

Deploying deep learning (DL) on mobile devices has been a notable trend in recent years.

Benchmarking Quantization

Paper
Code

An Empirical Study on Deployment Faults of Deep Learning Based Mobile Applications

1 code implementation • 13 Jan 2021 • Zhenpeng Chen, Huihan Yao, Yiling Lou, Yanbin Cao, Yuanqiang Liu, Haoyu Wang, Xuanzhe Liu

In contrast, faults related to the deployment of DL models on mobile devices (named as deployment faults of mobile DL apps) have not been well studied.

Paper
Code

Hierarchical Federated Learning through LAN-WAN Orchestration

no code implementations • 22 Oct 2020 • Jinliang Yuan, Mengwei Xu, Xiao Ma, Ao Zhou, Xuanzhe Liu, Shangguang Wang

Our proposed FL can accelerate the learning process and reduce the monetary cost with frequent local aggregation in the same LAN and infrequent global aggregation on a cloud across WAN.

Federated Learning

Paper
Add Code

Exploring the Generalizability of Spatio-Temporal Traffic Prediction: Meta-Modeling and an Analytic Framework

1 code implementation • 20 Sep 2020 • Leye Wang, Di Chai, Xuanzhe Liu, Liyue Chen, Kai Chen

The Spatio-Temporal Traffic Prediction (STTP) problem is a classical problem with plenty of prior research efforts that benefit from traditional statistical learning and recent deep learning approaches.

Traffic Prediction

151

Paper
Code

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data

no code implementations • 12 Jun 2020 • Chengxu Yang, Qipeng Wang, Mengwei Xu, Zhenpeng Chen, Kaigui Bian, Yunxin Liu, Xuanzhe Liu

Based on the data and the platform, we conduct extensive experiments to compare the performance of state-of-the-art FL algorithms under heterogeneity-aware and heterogeneity-unaware settings.

Fairness Federated Learning +1

Paper
Add Code

Understanding Challenges in Deploying Deep Learning Based Software: An Empirical Study

no code implementations • 2 May 2020 • Zhenpeng Chen, Yanbin Cao, Yuanqiang Liu, Haoyu Wang, Tao Xie, Xuanzhe Liu

Deep learning (DL) becomes increasingly pervasive, being used in a wide range of software applications.

Software Engineering

Paper
Add Code

Federated Neural Architecture Search

no code implementations • 15 Feb 2020 • Jinliang Yuan, Mengwei Xu, Yuxin Zhao, Kaigui Bian, Gang Huang, Xuanzhe Liu, Shangguang Wang

To preserve user privacy while enabling mobile intelligence, techniques have been proposed to train deep neural networks on decentralized data.

Neural Architecture Search

Paper
Add Code

Approximate Query Service on Autonomous IoT Cameras

no code implementations • 2 Sep 2019 • Mengwei Xu, Xiwen Zhang, Yunxin Liu, Gang Huang, Xuanzhe Liu, Felix Xiaozhu Lin

Elf is a runtime for an energy-constrained camera to continuously summarize video scenes as approximate object counts.

Databases

Paper
Add Code

SEntiMoji: An Emoji-Powered Learning Approach for Sentiment Analysis in Software Engineering

1 code implementation • 4 Jul 2019 • Zhenpeng Chen, Yanbin Cao, Xuan Lu, Qiaozhu Mei, Xuanzhe Liu

However, commonly used out-of-the-box sentiment analysis tools cannot obtain reliable results on SE tasks and the misunderstanding of technical jargon is demonstrated to be the main reason.

Representation Learning Sentiment Analysis

Paper
Code

Moving Deep Learning into Web Browser: How Far Can We Go?

3 code implementations • 27 Jan 2019 • Yun Ma, Dongwei Xiang, Shuyu Zheng, Deyu Tian, Xuanzhe Liu

Recently, several JavaScript-based deep learning frameworks have emerged, making it possible to perform deep learning tasks directly in browsers.

Software Engineering

Paper
Code

A First Look at Emoji Usage on GitHub: An Empirical Study

1 code implementation • 12 Dec 2018 • Xuan Lu, Yanbin Cao, Zhenpeng Chen, Xuanzhe Liu

We find that emojis are used by a considerable proportion of GitHub users.

Computers and Society Software Engineering

Paper
Code

A First Look at Deep Learning Apps on Smartphones

1 code implementation • 8 Nov 2018 • Mengwei Xu, Jiawei Liu, Yuanqiang Liu, Felix Xiaozhu Lin, Yunxin Liu, Xuanzhe Liu

We are in the dawn of deep learning explosion for smartphones.

Paper
Code

Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification

1 code implementation • 7 Jun 2018 • Zhenpeng Chen, Sheng Shen, Ziniu Hu, Xuan Lu, Qiaozhu Mei, Xuanzhe Liu

To tackle this problem, cross-lingual sentiment classification approaches aim to transfer knowledge learned from one language that has abundant labeled examples (i. e., the source language, usually English) to another language with fewer labels (i. e., the target language).

Classification Cross-Lingual Sentiment Classification +5

Paper
Code

Listening to Chaotic Whispers: A Deep Learning Framework for News-oriented Stock Trend Prediction

4 code implementations • 6 Dec 2017 • Ziniu Hu, Weiqing Liu, Jiang Bian, Xuanzhe Liu, Tie-Yan Liu

Stock trend prediction plays a critical role in seeking maximized profit from stock investment.

Ranked #16 on Stock Market Prediction on Astock

Paper
Code

DeepCache: Principled Cache for Mobile Deep Vision

1 code implementation • 1 Dec 2017 • Mengwei Xu, Mengze Zhu, Yunxin Liu, Felix Xiaozhu Lin, Xuanzhe Liu

We present DeepCache, a principled cache design for deep learning inference in continuous mobile vision.

Video Compression

Paper
Code

DeepWear: Adaptive Local Offloading for On-Wearable Deep Learning

no code implementations • 1 Dec 2017 • Mengwei Xu, Feng Qian, Mengze Zhu, Feifan Huang, Saumay Pushp, Xuanzhe Liu

Due to their on-body and ubiquitous nature, wearables can generate a wide range of unique sensor data creating countless opportunities for deep learning tasks.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.