Search Results for author: Yixuan Mei

Found 2 papers, 0 papers with code

Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs

no code implementations3 Jun 2024 Yixuan Mei, Yonghao Zhuang, Xupeng Miao, Juncheng Yang, Zhihao Jia, Rashmi Vinayak

This paper introduces Helix, a distributed system for high-throughput, low-latency large language model (LLM) serving on heterogeneous GPU clusters.

Language Modeling Language Modelling +2

Quarl: A Learning-Based Quantum Circuit Optimizer

no code implementations17 Jul 2023 Zikun Li, Jinjun Peng, Yixuan Mei, Sina Lin, Yi Wu, Oded Padon, Zhihao Jia

Applying reinforcement learning (RL) to quantum circuit optimization raises two main challenges: the large and varying action space and the non-uniform state representation.

Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.