Search Results for author: Guyue Huang

Found 5 papers, 4 papers with code

LightSeq2: Accelerated Training for Transformer-based Models on GPUs

1 code implementation • 12 Oct 2021 • Xiaohui Wang, Yang Wei, Ying Xiong, Guyue Huang, Xian Qian, Yufei Ding, Mingxuan Wang, Lei LI

In this paper, we present LightSeq2, a system to accelerate training for a general family of Transformer models on GPUs.

Machine Translation Speech Recognition +1

3,088

Paper
Code

GE-SpMM: General-purpose Sparse Matrix-Matrix Multiplication on GPUs for Graph Neural Networks

2 code implementations • 7 Jul 2020 • Guyue Huang, Guohao Dai, Yu Wang, Huazhong Yang

GE-SpMM performs SpMM-like operation on sparse matrices represented in the most common Compressed Sparse Row (CSR) format, so it can be embedded in GNN frameworks with no preprocessing overheads and support general GNN algorithms.

Distributed, Parallel, and Cluster Computing

1,679

Paper
Code

TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs

2 code implementations • 3 Dec 2021 • yuke wang, Boyuan Feng, Zheng Wang, Guyue Huang, Yufei Ding

Recently, graph neural networks (GNNs), as the backbone of graph-based machine learning, demonstrate great success in various domains (e. g., e-commerce).

Translation

Paper
Code

Machine Learning for Electronic Design Automation: A Survey

1 code implementation • 10 Jan 2021 • Guyue Huang, Jingbo Hu, Yifan He, Jialong Liu, Mingyuan Ma, Zhaoyang Shen, Juejian Wu, Yuanfan Xu, Hengrui Zhang, Kai Zhong, Xuefei Ning, Yuzhe ma, HaoYu Yang, Bei Yu, Huazhong Yang, Yu Wang

With the down-scaling of CMOS technology, the design complexity of very large-scale integrated (VLSI) is increasing.

BIG-bench Machine Learning

Paper
Code

Understanding GNN Computational Graph: A Coordinated Computation, IO, and Memory Perspective

no code implementations • 18 Oct 2021 • Hengrui Zhang, Zhongming Yu, Guohao Dai, Guyue Huang, Yufei Ding, Yuan Xie, Yu Wang

The same data are propagated through the graph structure to perform the same neural operation multiple times in GNNs, leading to redundant computation which accounts for 92. 4% of total operators.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.