Search Results for author: Driss Guessous

Found 1 papers, 0 papers with code

Flex Attention: A Programming Model for Generating Optimized Attention Kernels

no code implementations7 Dec 2024 Juechu Dong, Boyuan Feng, Driss Guessous, Yanbo Liang, Horace He

We introduce FlexAttention, a novel compiler-driven programming model that allows implementing the majority of attention variants in a few lines of idiomatic PyTorch code.

Cannot find the paper you are looking for? You can Submit a new open access paper.