1 code implementation • 24 May 2024 • Leyuan Wang, Liuyu Xiang, Yunlong Wang, Huijia Wu, Zhaofeng He
We argue that the imbalance between old task and new task data contributes to forgetting of the old tasks.
1 code implementation • 24 May 2024 • Leyuan Wang, Liuyu Xiang, Yujie Wei, Yunlong Wang, Zhaofeng He
Online Lifelong Learning (OLL) addresses the challenge of learning from continuous and non-stationary data streams.
1 code implementation • 6 Oct 2022 • Yujia Zhai, Chengquan Jiang, Leyuan Wang, Xiaoying Jia, Shang Zhang, Zizhong Chen, Xin Liu, Yibo Zhu
In this paper, we present ByteTransformer, a high-performance transformer boosted for variable-length inputs.
no code implementations • 25 Oct 2021 • Jiarong Xing, Leyuan Wang, Shang Zhang, Jack Chen, Ang Chen, Yibo Zhu
Today's auto-tuners (e. g., AutoTVM, Ansor) generate efficient tensor programs by navigating a large search space to identify effective implementations, but they do so with opaque hardware details.
1 code implementation • 29 Jun 2021 • Leyuan Wang, Kunbo Zhang, Yunlong Wang, Zhenan Sun
To accommodate users at different distances, it is necessary to control focus quickly and accurately.
no code implementations • 21 Jan 2021 • Jian Weng, Animesh Jain, Jie Wang, Leyuan Wang, Yida Wang, Tony Nowatzki
However, it is hard to leverage mixed precision without hardware support because of the overhead of data casting.
1 code implementation • 20 Nov 2020 • Zhewei Yao, Zhen Dong, Zhangcheng Zheng, Amir Gholami, Jiali Yu, Eric Tan, Leyuan Wang, Qijing Huang, Yida Wang, Michael W. Mahoney, Kurt Keutzer
Current low-precision quantization algorithms often have the hidden cost of conversion back and forth from floating point to quantized integer values.
1 code implementation • 1 Sep 2020 • Leyuan Wang, Kunbo Zhang, Min Ren, Yunlong Wang, Zhenan Sun
The quality metric proposed in this paper can significantly improve the performance of the recognition algorithm while reducing the number of images discarded for recognition, which is advantageous over hand-crafted factors based iris quality assessment methods.
no code implementations • 18 Mar 2020 • Yu Tian, Kunbo Zhang, Leyuan Wang, Zhenan Sun
Extensive experiments demonstrate the advantages of the PAAS technique to counter diverse face spoofing attacks (print, replay, mask) in uncontrolled indoor and outdoor conditions by learning polarized face images of 33 people.
no code implementations • 1 Mar 2020 • Leyuan Wang, John D. Owens
In this paper, we propose a GPU-efficient subgraph isomorphism algorithm using the Gunrock graph analytic framework, GSM (Gunrock Subgraph Matching), to compute graph matching on GPUs.
Distributed, Parallel, and Cluster Computing
1 code implementation • 12 Feb 2018 • Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Meghan Cowan, Haichen Shen, Leyuan Wang, Yuwei Hu, Luis Ceze, Carlos Guestrin, Arvind Krishnamurthy
Experimental results show that TVM delivers performance across hardware back-ends that are competitive with state-of-the-art, hand-tuned libraries for low-power CPU, mobile GPU, and server-class GPUs.