Search Results for author: Aakash Lahoti

Found 3 papers, 2 papers with code

Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers

1 code implementation13 Jul 2024 Sukjun Hwang, Aakash Lahoti, Tri Dao, Albert Gu

We identify a key axis of matrix parameterizations termed sequence alignment, which increases the flexibility and performance of matrix mixers, providing insights into the strong performance of Transformers and recent SSMs such as Mamba.

Sharpened Lazy Incremental Quasi-Newton Method

1 code implementation26 May 2023 Aakash Lahoti, Spandan Senapati, Ketan Rajawat, Alec Koppel

Specifically, they exhibit a superlinear rate with $O(d^2)$ cost in contrast to the linear rate of first-order methods with $O(d)$ cost and the quadratic rate of second-order methods with $O(d^3)$ cost.

Second-order methods

Cannot find the paper you are looking for? You can Submit a new open access paper.