Search Results for author: Yunxin Liu

Found 34 papers, 13 papers with code

DeepCache: Principled Cache for Mobile Deep Vision

1 code implementation • 1 Dec 2017 • Mengwei Xu, Mengze Zhu, Yunxin Liu, Felix Xiaozhu Lin, Xuanzhe Liu

We present DeepCache, a principled cache design for deep learning inference in continuous mobile vision.

Video Compression

Paper
Code

A First Look at Deep Learning Apps on Smartphones

1 code implementation • 8 Nov 2018 • Mengwei Xu, Jiawei Liu, Yuanqiang Liu, Felix Xiaozhu Lin, Yunxin Liu, Xuanzhe Liu

We are in the dawn of deep learning explosion for smartphones.

Paper
Code

Video Analytics with Zero-streaming Cameras

no code implementations • 28 Apr 2019 • Mengwei Xu, Tiantu Xu, Yunxin Liu, Felix Xiaozhu Lin

For efficiency, we advocate for these cameras to be zero streaming: capturing videos to local storage and communicating with the cloud only when analytics is requested.

Paper
Add Code

Approximate Query Service on Autonomous IoT Cameras

no code implementations • 2 Sep 2019 • Mengwei Xu, Xiwen Zhang, Yunxin Liu, Gang Huang, Xuanzhe Liu, Felix Xiaozhu Lin

Elf is a runtime for an energy-constrained camera to continuously summarize video scenes as approximate object counts.

Databases

Paper
Add Code

Fast Hardware-Aware Neural Architecture Search

1 code implementation • 25 Oct 2019 • Li Lyna Zhang, Yuqing Yang, Yuhang Jiang, Wenwu Zhu, Yunxin Liu

Unlike previous approaches that apply search algorithms on a small, human-designed search space without considering hardware diversity, we propose HURRICANE that explores the automatic hardware-aware search over a much larger search space and a two-stage search algorithm, to efficiently generate tailored models for different types of hardware.

Hardware Aware Neural Architecture Search Neural Architecture Search

Paper
Code

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data

no code implementations • 12 Jun 2020 • Chengxu Yang, Qipeng Wang, Mengwei Xu, Zhenpeng Chen, Kaigui Bian, Yunxin Liu, Xuanzhe Liu

Based on the data and the platform, we conduct extensive experiments to compare the performance of state-of-the-art FL algorithms under heterogeneity-aware and heterogeneity-unaware settings.

Fairness Federated Learning +1

Paper
Add Code

PaGraph: Scaling GNN Training on Large Graphs via Computation-aware Caching and Partitioning

no code implementations • Proceedings of the 11th ACM Symposium on Cloud Computing 2020 • Zhiqi Lin, Cheng Li, Youshan Miao, Yunxin Liu, Yinlong Xu

Emerging graph neural networks (GNNs) have extended the successes of deep learning techniques against datasets like images and texts to more complex graph-structured data.

Paper
Add Code

StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding

1 code implementation • 16 Dec 2020 • Jinshan Zeng, Qi Chen, Yunxin Liu, Mingwen Wang, Yuan YAO

However, these deep generative models may suffer from the mode collapse issue, which significantly degrades the diversity and quality of generated results.

Font Generation

Paper
Code

DeepPayload: Black-box Backdoor Attack on Deep Learning Models through Neural Payload Injection

no code implementations • 18 Jan 2021 • Yuanchun Li, Jiayi Hua, Haoyu Wang, Chunyang Chen, Yunxin Liu

The core of the attack is a neural conditional branch constructed with a trigger detector and several operators and injected into the victim model as a malicious payload.

Backdoor Attack

Paper
Add Code

LEAP: TrustZone Based Developer-Friendly TEE for Intelligent Mobile Apps

no code implementations • 4 Feb 2021 • Lizhi Sun, Shuocheng Wang, Hao Wu, Yuhang Gong, Fengyuan Xu, Yunxin Liu, Hao Han, Sheng Zhong

ARM TrustZone is widely deployed on commercial-off-the-shelf mobile devices for secure execution.

Cryptography and Security

Paper
Add Code

Dual-side Sparse Tensor Core

no code implementations • 20 May 2021 • Yang Wang, Chen Zhang, Zhiqiang Xie, Cong Guo, Yunxin Liu, Jingwen Leng

We demonstrate the feasibility of our design with minimal changes to the existing production-scale inner-product-based Tensor Core.

Paper
Add Code

ModelDiff: Testing-Based DNN Similarity Comparison for Model Reuse Detection

1 code implementation • 11 Jun 2021 • Yuanchun Li, Ziqi Zhang, Bingyan Liu, Ziyue Yang, Yunxin Liu

The knowledge of a deep learning model may be transferred to a student model, leading to intellectual property infringement or vulnerability propagation.

Model Compression Transfer Learning

Paper
Code

Representational Continuity for Unsupervised Continual Learning

2 code implementations • ICLR 2022 • Divyam Madaan, Jaehong Yoon, Yuanchun Li, Yunxin Liu, Sung Ju Hwang

Continual learning (CL) aims to learn a sequence of tasks without forgetting the previously acquired knowledge.

Continual Learning

458

Paper
Code

DAPPER: Label-Free Performance Estimation after Personalization for Heterogeneous Mobile Sensing

no code implementations • 22 Nov 2021 • Taesik Gong, Yewon Kim, Adiba Orzikulova, Yunxin Liu, Sung Ju Hwang, Jinwoo Shin, Sung-Ju Lee

However, various factors such as different users, devices, and environments impact the performance of such applications, thus making the domain shift (i. e., distributional shift between the training domain and the target domain) a critical issue in mobile sensing.

Domain Adaptation

Paper
Add Code

Boosting Mobile CNN Inference through Semantic Memory

no code implementations • 5 Dec 2021 • Yun Li, Chen Zhang, Shihao Han, Li Lyna Zhang, Baoqun Yin, Yunxin Liu, Mengwei Xu

Human brains are known to be capable of speeding up visual recognition of repeatedly presented objects through faster memory encoding and accessing procedures on activated neurons.

Paper
Add Code

FedBalancer: Data and Pace Control for Efficient Federated Learning on Heterogeneous Clients

1 code implementation • 5 Jan 2022 • Jaemin Shin, Yuanchun Li, Yunxin Liu, Sung-Ju Lee

Federated Learning (FL) trains a machine learning model on distributed clients without exposing individual data.

Federated Learning

Paper
Code

SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation

1 code implementation • ICLR 2022 • Cong Guo, Yuxian Qiu, Jingwen Leng, Xiaotian Gao, Chen Zhang, Yunxin Liu, Fan Yang, Yuhao Zhu, Minyi Guo

This paper proposes an on-the-fly DFQ framework with sub-second quantization time, called SQuant, which can quantize networks on inference-only devices with low computation and memory requirements.

Data Free Quantization

154

Paper
Code

Reducing Capacity Gap in Knowledge Distillation with Review Mechanism for Crowd Counting

1 code implementation • 11 Jun 2022 • Yunxin Liu, Qiaosi Yi, Jinshan Zeng

Besides the lightweight models, we also show that the suggested review mechanism can be used as a plug-and-play module to further boost the performance of a kind of heavy crowd counting models without modifying the neural network architecture and introducing any additional model parameter.

Computational Efficiency Crowd Counting +1

Paper
Code

ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization

1 code implementation • 30 Aug 2022 • Cong Guo, Chen Zhang, Jingwen Leng, Zihan Liu, Fan Yang, Yunxin Liu, Minyi Guo, Yuhao Zhu

In this work, we propose a fixed-length adaptive numerical data type called ANT to achieve low-bit quantization with tiny hardware overheads.

Quantization

Paper
Code

Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training

no code implementations • 22 Sep 2022 • Cong Guo, Yuxian Qiu, Jingwen Leng, Chen Zhang, Ying Cao, Quanlu Zhang, Yunxin Liu, Fan Yang, Minyi Guo

An activation function is an element-wise mathematical function and plays a crucial role in deep neural networks (DNN).

Paper
Add Code

StrokeGAN+: Few-Shot Semi-Supervised Chinese Font Generation with Stroke Encoding

no code implementations • 11 Nov 2022 • Jinshan Zeng, Yefei Wang, Qi Chen, Yunxin Liu, Mingwen Wang, Yuan YAO

The effectiveness of the proposed model for the zero-shot traditional Chinese font generation is also evaluated in this paper.

Font Generation

Paper
Add Code

LUT-NN: Empower Efficient Neural Network Inference with Centroid Learning and Table Lookup

no code implementations • 7 Feb 2023 • Xiaohu Tang, Yang Wang, Ting Cao, Li Lyna Zhang, Qi Chen, Deng Cai, Yunxin Liu, Mao Yang

On-device Deep Neural Network (DNN) inference consumes significant computing resources and development efforts.

Efficient Neural Network speech-recognition +1

Paper
Add Code

AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse Edge Environments

no code implementations • 13 Mar 2023 • Hao Wen, Yuanchun Li, Zunshuai Zhang, Shiqi Jiang, Xiaozhou Ye, Ye Ouyang, Ya-Qin Zhang, Yunxin Liu

Model elastification generates a high-quality search space of model architectures with the guidance of a developer-specified oracle model.

valid

Paper
Add Code

6G Network Business Support System

no code implementations • 19 Jul 2023 • Ye Ouyang, Yaqin Zhang, Peng Wang, Yunxin Liu, Wen Qiao, Jun Zhu, Yang Liu, Feng Zhang, Shuling Wang, Xidong Wang

6G is the next-generation intelligent and integrated digital information infrastructure, characterized by ubiquitous interconnection, native intelligence, multi-dimensional perception, global coverage, green and low-carbon, native network security, etc.

Paper
Add Code

AIGC Empowering Telecom Sector White Paper_chinese

no code implementations • 21 Jul 2023 • Ye Ouyang, Yaqin Zhang, Xiaozhou Ye, Yunxin Liu, Yong Song, Yang Liu, Sen Bian, Zhiyong Liu

Through the study of GPT, a typical representative of AIGC, the authors have analyzed how GPT empowers the telecom sector in the form of scenarios, discussed the gap between the current GPT general model and telecom services, proposed for the first time a Telco Augmented Cognition capability system, provided answers to how to construct a telecom service GPT in the telecom sector, and carried out various practices.

Paper
Add Code

PatchBackdoor: Backdoor Attack against Deep Neural Networks without Model Modification

1 code implementation • 22 Aug 2023 • Yizhen Yuan, Rui Kong, Shenghao Xie, Yuanchun Li, Yunxin Liu

However, most backdoor attacks have to modify the neural network models through training with poisoned data and/or direct model editing, which leads to a common but false belief that backdoor attack can be easily avoided by properly protecting the model.

Backdoor Attack Model Editing

Paper
Code

AutoDroid: LLM-powered Task Automation in Android

no code implementations • 29 Aug 2023 • Hao Wen, Yuanchun Li, Guohong Liu, Shanhui Zhao, Tao Yu, Toby Jia-Jun Li, Shiqi Jiang, Yunhao Liu, Yaqin Zhang, Yunxin Liu

Mobile task automation is an attractive technique that aims to enable voice-based hands-free user interaction with smartphones.

Language Modelling

Paper
Add Code

SwapMoE: Efficient Memory-Constrained Serving of Large Sparse MoE Models via Dynamic Expert Pruning and Swapping

no code implementations • 29 Aug 2023 • Rui Kong, Yuanchun Li, Qingtian Feng, Weijun Wang, Linghe Kong, Yunxin Liu

The main idea of SwapMoE is to keep a small dynamic set of important experts, namely Virtual Experts, in the main memory for inference, while seamlessly maintaining how the Virtual Experts map to the actual experts.

object-detection Object Detection

Paper
Add Code

Generative Model for Models: Rapid DNN Customization for Diverse Tasks and Resource Constraints

no code implementations • 29 Aug 2023 • Wenxing Xu, Yuanchun Li, Jiacheng Liu, Yi Sun, Zhengyang Cao, Yixuan Li, Hao Wen, Yunxin Liu

Unlike cloud-based deep learning models that are often large and uniform, edge-deployed models usually demand customization for domain-specific tasks and resource-limited environments.

Image Classification object-detection +1

Paper
Add Code

Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations

no code implementations • 16 Sep 2023 • Fucheng Jia, Shiqi Jiang, Ting Cao, Wei Cui, Tianrui Xia, Xu Cao, Yuanchun Li, Deyu Zhang, Ju Ren, Yunxin Liu, Lili Qiu, Mao Yang

Web applications are increasingly becoming the primary platform for AI service delivery, making in-browser deep learning (DL) inference more prominent.

Paper
Add Code

FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning

no code implementations • 25 Oct 2023 • Jaemin Shin, Hyungjun Yoon, SeungJoo Lee, Sungjoon Park, Yunxin Liu, Jinho D. Choi, Sung-Ju Lee

Psychiatrists diagnose mental disorders via the linguistic use of patients.

Federated Learning Language Modelling +1

Paper
Add Code

BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge

no code implementations • 25 Dec 2023 • Lin Sun, Weijun Wang, Tingting Yuan, Liang Mi, Haipeng Dai, Yunxin Liu, XiaoMing Fu

To achieve this goal, we propose BiSwift, a bi-level framework that scales the concurrent real-time video analytics by a novel adaptive hybrid codec integrated with multi-level pipelines, and a global bandwidth controller for multiple video streams.

Fairness Management +3

Paper
Add Code

Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security

2 code implementations • 10 Jan 2024 • Yuanchun Li, Hao Wen, Weijun Wang, Xiangyu Li, Yizhen Yuan, Guohong Liu, Jiacheng Liu, Wenxing Xu, Xiang Wang, Yi Sun, Rui Kong, Yile Wang, Hanfei Geng, Jian Luan, Xuefeng Jin, Zilong Ye, Guanjing Xiong, Fan Zhang, Xiang Li, Mengwei Xu, Zhijun Li, Peng Li, Yang Liu, Ya-Qin Zhang, Yunxin Liu

Next, we discuss several key challenges to achieve intelligent, efficient and secure Personal LLM Agents, followed by a comprehensive survey of representative solutions to address these challenges.

217

Paper
Code

A Survey of Resource-efficient LLM and Multimodal Foundation Models

1 code implementation • 16 Jan 2024 • Mengwei Xu, Wangsong Yin, Dongqi Cai, Rongjie Yi, Daliang Xu, QiPeng Wang, Bingyang Wu, Yihao Zhao, Chen Yang, Shihe Wang, Qiyang Zhang, Zhenyan Lu, Li Zhang, Shangguang Wang, Yuanchun Li, Yunxin Liu, Xin Jin, Xuanzhe Liu

Large foundation models, including large language models (LLMs), vision transformers (ViTs), diffusion, and LLM-based multimodal models, are revolutionizing the entire machine learning lifecycle, from training to deployment.

139

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.