Search Results for author: Xipeng Shen

Found 16 papers, 2 papers with code

SmartMem: Layout Transformation Elimination and Adaptation for Efficient DNN Execution on Mobile

no code implementations21 Apr 2024 Wei Niu, Md Musfiqur Rahman Sanim, Zhihao Shu, Jiexiong Guan, Xipeng Shen, Miao Yin, Gagan Agrawal, Bin Ren

Focusing on emerging transformers (specifically the ones with computationally efficient Swin-like architectures) and large models (e. g., Stable Diffusion and LLMs) based on transformers, we observe that layout transformations between the computational operators cause a significant slowdown in these applications.

Efficient Large Language Models Fine-Tuning On Graphs

no code implementations7 Dec 2023 Rui Xue, Xipeng Shen, Ruozhou Yu, Xiaorui Liu

In this work, we introduce a novel and efficient approach for the end-to-end fine-tuning of Large Language Models (LLMs) on TAGs, named LEADING.

Graph Learning

BitGNN: Unleashing the Performance Potential of Binary Graph Neural Networks on GPUs

no code implementations4 May 2023 Jou-An Chen, Hsin-Hsuan Sung, Xipeng Shen, Sutanay Choudhury, Ang Li

It fills the gap by proposing a series of abstractions and techniques to map binary GNNs and their computations best to fit the nature of bit manipulations on GPUs.

Survey: Exploiting Data Redundancy for Optimization of Deep Learning

no code implementations29 Aug 2022 Jou-An Chen, Wei Niu, Bin Ren, Yanzhi Wang, Xipeng Shen

It surveys hundreds of recent papers on the topic, introduces a novel taxonomy to put the various techniques into a single categorization framework, offers a comprehensive description of the main methods used for exploiting data redundancy in improving multiple kinds of DNNs on data, and points out a set of research opportunities for future to explore.

Survey

CoCoPIE XGen: A Full-Stack AI-Oriented Optimizing Framework

no code implementations21 Jun 2022 Xiaofeng Li, Bin Ren, Xipeng Shen, Yanzhi Wang

There is a growing demand for shifting the delivery of AI capability from data centers on the cloud to edge or end devices, exemplified by the fast emerging real-time AI-based apps running on smartphones, AR/VR devices, autonomous vehicles, and various IoT devices.

Autonomous Vehicles

Simple Augmentation Goes a Long Way: ADRL for DNN Quantization

no code implementations ICLR 2021 Lin Ning, Guoyang Chen, Weifeng Zhang, Xipeng Shen

This new strategy augments the neural networks in DRL with a complementary scheme to boost the performance of learning.

Quantization Reinforcement Learning (RL)

RT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices

no code implementations20 Jul 2020 Wei Niu, Mengshu Sun, Zhengang Li, Jou-An Chen, Jiexiong Guan, Xipeng Shen, Yanzhi Wang, Sijia Liu, Xue Lin, Bin Ren

The vanilla sparsity removes whole kernel groups, while KGS sparsity is a more fine-grained structured sparsity that enjoys higher flexibility while exploiting full on-device parallelism.

Code Generation Model Compression

CoCoPIE: Making Mobile AI Sweet As PIE --Compression-Compilation Co-Design Goes a Long Way

no code implementations14 Mar 2020 Shaoshan Liu, Bin Ren, Xipeng Shen, Yanzhi Wang

Assuming hardware is the major constraint for enabling real-time mobile intelligence, the industry has mainly dedicated their efforts to developing specialized hardware accelerators for machine learning and inference.

In-Place Zero-Space Memory Protection for CNN

1 code implementation NeurIPS 2019 Hui Guan, Lin Ning, Zhen Lin, Xipeng Shen, Huiyang Zhou, Seung-Hwan Lim

Convolutional Neural Networks (CNN) are being actively explored for safety-critical applications such as autonomous vehicles and aerospace, where it is essential to ensure the reliability of inference results in the presence of possible memory faults.

Autonomous Vehicles

How to "DODGE" Complex Software Analytics?

no code implementations5 Feb 2019 Amritanshu Agrawal, Wei Fu, Di Chen, Xipeng Shen, Tim Menzies

Machine learning techniques applied to software engineering tasks can be improved by hyperparameter optimization, i. e., automatic tools that find good settings for a learner's control parameters.

BIG-bench Machine Learning Hyperparameter Optimization

Hyperparameter Optimization for Effort Estimation

1 code implementation28 Apr 2018 Tianpei Xia, Rahul Krishna, Jianfeng Chen, George Mathew, Xipeng Shen, Tim Menzies

We test OIL on a wide range of hyperparameter optimizers using data from 945 software projects.

Software Engineering

Cannot find the paper you are looking for? You can Submit a new open access paper.