Search Results for author: Less Wright

Found 3 papers, 1 papers with code

Accelerating a Triton Fused Kernel for W4A16 Quantized Inference with SplitK work decomposition

no code implementations5 Jan 2024 Adnan Hoque, Less Wright, Chih-Chieh Yang, Mudhakar Srivatsa, Raghu Ganti

Our implementation shows improvement for the type of skinny matrix-matrix multiplications found in foundation model inference workloads.

Ranger21: a synergistic deep learning optimizer

2 code implementations25 Jun 2021 Less Wright, Nestor Demeure

As optimizers are critical to the performances of neural networks, every year a large number of papers innovating on the subject are published.

Cannot find the paper you are looking for? You can Submit a new open access paper.