Search Results for author: Weiguo Liu

Found 3 papers, 2 papers with code

Evaluating Small Language Models for News Summarization: Implications and Factors Influencing Performance

1 code implementation2 Feb 2025 Borui Xu, Yao Chen, Zeyi Wen, Weiguo Liu, Bingsheng He

This research not only contributes to the understanding of SLMs but also provides practical insights for researchers seeking efficient summarization solutions that balance performance and resource use.

News Summarization

FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs

no code implementations22 Oct 2024 Haoran Lin, Xianzhi Yu, Kang Zhao, Lu Hou, Zongyuan Zhan, Stanislav Kamenev, Han Bao, Ting Hu, Mingkai Wang, Qixin Chang, Siyue Sui, Weihao Sun, Jiaxin Hu, Jun Yao, Zekun Yin, Cheng Qian, Ying Zhang, Yinfei Pan, Yu Yang, Weiguo Liu

In this work, we propose FastAttention which pioneers the adaptation of FlashAttention series for NPUs and low-resource GPUs to boost LLM inference efficiency.

WarpCore: A Library for fast Hash Tables on GPUs

1 code implementation16 Sep 2020 Daniel Jünger, Robin Kobus, André Müller, Christian Hundt, Kai Xu, Weiguo Liu, Bertil Schmidt

The rapidly growing amount of data emerging in many fields motivated the need for accelerated hash tables designed for modern parallel architectures.

Distributed, Parallel, and Cluster Computing

Cannot find the paper you are looking for? You can Submit a new open access paper.