Search Results for author: Raghav Addanki

Found 1 papers, 0 papers with code

One Pass Streaming Algorithm for Super Long Token Attention Approximation in Sublinear Space

no code implementations24 Nov 2023 Raghav Addanki, Chenyang Li, Zhao Song, Chiwun Yang

Considering a single-layer self-attention with Query, Key, and Value matrices $Q, K, V \in \mathbb{R}^{n \times d}$, the polynomial method approximates the attention output $T \in \mathbb{R}^{n \times d}$.

Attribute

Cannot find the paper you are looking for? You can Submit a new open access paper.