Search Results for author: Nate True

Found 1 papers, 1 papers with code

FastVLM: Efficient Vision Encoding for Vision Language Models

1 code implementation17 Dec 2024 Pavan Kumar Anasosalu Vasu, Fartash Faghri, Chun-Liang Li, Cem Koc, Nate True, Albert Antony, Gokul Santhanam, James Gabriel, Peter Grasch, Oncel Tuzel, Hadi Pouransari

At different operational resolutions, the vision encoder of a VLM can be optimized along two axes: reducing encoding latency and minimizing the number of visual tokens passed to the LLM, thereby lowering overall latency.

Cannot find the paper you are looking for? You can Submit a new open access paper.