Search Results for author: Xuanlei Zhao

Found 5 papers, 4 papers with code

Real-Time Video Generation with Pyramid Attention Broadcast

1 code implementation22 Aug 2024 Xuanlei Zhao, Xiaolong Jin, Kai Wang, Yang You

We present Pyramid Attention Broadcast (PAB), a real-time, high quality and training-free approach for DiT-based video generation.

Video Generation

AutoChunk: Automated Activation Chunk for Memory-Efficient Long Sequence Inference

no code implementations19 Jan 2024 Xuanlei Zhao, Shenggan Cheng, Guangyang Lu, Jiarui Fang, Haotian Zhou, Bin Jia, Ziming Liu, Yang You

The experiments demonstrate that AutoChunk can reduce over 80\% of activation memory while maintaining speed loss within 10%, extend max sequence length by 3. 2x to 11. 7x, and outperform state-of-the-art methods by a large margin.

Code Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.