Search Results for author: Zeke Wang

Found 5 papers, 2 papers with code

TorchGT: A Holistic System for Large-scale Graph Transformer Training

no code implementations19 Jul 2024 Meng Zhang, Jie Sun, Qinghao Hu, Peng Sun, Zeke Wang, Yonggang Wen, Tianwei Zhang

While there emerge inspiring algorithm advancements, their practical adoption is still limited, particularly on real-world graphs involving up to millions of nodes.

Graph Learning

DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference

no code implementations30 Mar 2024 Jinwei Yao, Kaiqi Chen, Kexun Zhang, Jiaxuan You, Binhang Yuan, Zeke Wang, Tao Lin

Large language models (LLMs) are increasingly employed for complex tasks that process multiple generation calls in a tree structure with shared prefixes of tokens, including few-shot prompting, multi-step reasoning, speculative decoding, etc.

Benchmarking High Bandwidth Memory on FPGAs

2 code implementations9 May 2020 Zeke Wang, Hongjing Huang, Jie Zhang, Gustavo Alonso

FPGAs are starting to be enhanced with High Bandwidth Memory (HBM) as a way to reduce the memory bandwidth bottleneck encountered in some applications and to give the FPGA more capacity to deal with application state.

Hardware Architecture

Cannot find the paper you are looking for? You can Submit a new open access paper.