Search Results for author: Faysal Ishtiaq

Found 2 papers, 0 papers with code

CompactifAI: Extreme Compression of Large Language Models using Quantum-Inspired Tensor Networks

no code implementations25 Jan 2024 Andrei Tomut, Saeed S. Jahromi, Sukhbinder Singh, Faysal Ishtiaq, Cesar Muñoz, Prabdeep Singh Bajaj, Ali Elborady, Gianni Del Bimbo, Mehrazin Alizadeh, David Montero, Pablo Martin-Ramiro, Muhammad Ibrahim, Oussama Tahiri Alaoui, John Malcolm, Samuel Mugel, Roman Orus

Large Language Models (LLMs) such as ChatGPT and LlaMA are advancing rapidly in generative Artificial Intelligence (AI), but their immense size poses significant challenges, such as huge training and inference costs, substantial energy demands, and limitations for on-site deployment.

Model Compression Quantization +1

Cannot find the paper you are looking for? You can Submit a new open access paper.