Search Results for author: Hassan Shapourian

Found 1 papers, 1 papers with code

The Unreasonable Ineffectiveness of the Deeper Layers

1 code implementation • 26 Mar 2024 • Andrey Gromov, Kushal Tirumala, Hassan Shapourian, Paolo Glorioso, Daniel A. Roberts

We empirically study a simple layer-pruning strategy for popular families of open-weight pretrained LLMs, finding minimal degradation of performance on different question-answering benchmarks until after a large fraction (up to half) of the layers are removed.

Quantization Question Answering

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.