Search Results for author: Bilge Soran

Found 4 papers, 2 papers with code

SpinQuant: LLM quantization with learned rotations

3 code implementations26 May 2024 Zechun Liu, Changsheng Zhao, Igor Fedorov, Bilge Soran, Dhruv Choudhary, Raghuraman Krishnamoorthi, Vikas Chandra, Yuandong Tian, Tijmen Blankevoort

With 4-bit quantization of weight, activation, and KV-cache, SpinQuant narrows the accuracy gap on zero-shot reasoning tasks with full precision to merely 2. 9 points on the LLaMA-2 7B model, surpassing LLM-QAT by 19. 1 points and SmoothQuant by 25. 0 points.

Quantization

Generating Notifications for Missing Actions: Don't Forget to Turn the Lights Off!

no code implementations ICCV 2015 Bilge Soran, Ali Farhadi, Linda Shapiro

The overall prediction accuracy is 46. 2% when only 10 frames of an action are seen (2/3 of a sec).

Cannot find the paper you are looking for? You can Submit a new open access paper.