Search Results for author: Xucheng Ye

Found 6 papers, 2 papers with code

KwaiYiiMath: Technical Report

no code implementations • 11 Oct 2023 • Jiayi Fu, Lei Lin, Xiaoyang Gao, Pengli Liu, Zhengzong Chen, Zhirui Yang, ShengNan Zhang, Xue Zheng, Yan Li, Yuliang Liu, Xucheng Ye, Yiqiao Liao, Chao Liao, Bin Chen, Chengru Song, Junchen Wan, Zijia Lin, Fuzheng Zhang, Zhongyuan Wang, Di Zhang, Kun Gai

Recent advancements in large language models (LLMs) have demonstrated remarkable abilities in handling a variety of natural language processing (NLP) downstream tasks, even on mathematical tasks requiring multi-step reasoning.

Ranked #87 on Arithmetic Reasoning on GSM8K (using extra training data)

Arithmetic Reasoning GSM8K +1

Paper
Add Code

FedSkel: Efficient Federated Learning on Heterogeneous Systems with Skeleton Gradients Update

1 code implementation • 20 Aug 2021 • Junyu Luo, Jianlei Yang, Xucheng Ye, Xin Guo, Weisheng Zhao

Federated learning aims to protect users' privacy while performing data analysis from different participants.

Federated Learning

Paper
Code

S2Engine: A Novel Systolic Architecture for Sparse Convolutional Neural Networks

2 code implementations • 15 Jun 2021 • Jianlei Yang, Wenzhi Fu, Xingzhou Cheng, Xucheng Ye, Pengcheng Dai, Weisheng Zhao

Convolutional neural networks (CNNs) have achieved great success in performing cognitive tasks.

Paper
Code

RoSearch: Search for Robust Student Architectures When Distilling Pre-trained Language Models

no code implementations • 7 Jun 2021 • Xin Guo, Jianlei Yang, Haoyi Zhou, Xucheng Ye, JianXin Li

In order to overcome these security problems, RoSearch is proposed as a comprehensive framework to search the student models with better adversarial robustness when performing knowledge distillation.

Adversarial Robustness Knowledge Distillation +1

Paper
Add Code

SparseTrain: Exploiting Dataflow Sparsity for Efficient Convolutional Neural Networks Training

no code implementations • 21 Jul 2020 • Pengcheng Dai, Jianlei Yang, Xucheng Ye, Xingzhou Cheng, Junyu Luo, Linghao Song, Yiran Chen, Weisheng Zhao

In this paper, \textit{SparseTrain} is proposed to accelerate CNN training by fully exploiting the sparsity.

Paper
Add Code

Accelerating CNN Training by Pruning Activation Gradients

no code implementations • ECCV 2020 • Xucheng Ye, Pengcheng Dai, Junyu Luo, Xin Guo, Yingjie Qi, Jianlei Yang, Yiran Chen

Sparsification is an efficient approach to accelerate CNN inference, but it is challenging to take advantage of sparsity in training procedure because the involved gradients are dynamically changed.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.