no code implementations • 20 Feb 2024 • Varad Pimpalkhute, John Heyer, Xusen Yin, Sameer Gupta
We investigate the integration of Large Language Models (LLMs) into query encoders to improve dense retrieval without increasing latency and cost, by circumventing the dependency on LLMs at inference time.