Search Results for author: Shixuan Sun

Found 3 papers, 2 papers with code

vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving

1 code implementation22 Jul 2024 Jiale Xu, Rui Zhang, Cong Guo, Weiming Hu, Zihan Liu, Feiyang Wu, Yu Feng, Shixuan Sun, Changxu Shao, Yuhong Guo, Junping Zhao, Ke Zhang, Minyi Guo, Jingwen Leng

This study introduces the vTensor, an innovative tensor structure for LLM inference based on GPU virtual memory management (VMM).

Management

CANDY: A Benchmark for Continuous Approximate Nearest Neighbor Search with Dynamic Data Ingestion

1 code implementation28 Jun 2024 Xianzhi Zeng, Zhuoyan Wu, Xinjing Hu, Xuanhua Shi, Shixuan Sun, Shuhao Zhang

Although numerous AKNN algorithms and benchmarks have been developed recently to evaluate their effectiveness, the dynamic nature of real-world data presents significant challenges that existing benchmarks fail to address.

Information Retrieval Retrieval

Efficient Deep Learning Pipelines for Accurate Cost Estimations Over Large Scale Query Workload

no code implementations23 Mar 2021 Johan Kok Zhi Kang, Gaurav, Sien Yi Tan, Feng Cheng, Shixuan Sun, Bingsheng He

The use of deep learning models for forecasting the resource consumption patterns of SQL queries have recently been a popular area of study.

Cannot find the paper you are looking for? You can Submit a new open access paper.