Search Results for author: John Gkountouras

INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation

We introduce a method that dramatically reduces fine-tuning VRAM requirements and rectifies quantization errors in quantized Large Language Models.

2,523

Paper
Code

The lack of interpretability of the Vision Transformer may hinder its use in critical real-world applications despite its effectiveness.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.