codelion/optillm • 26 Jul 2024
This paper introduces Patched MOA (Mixture of Agents), an inference optimization technique that significantly enhances the performance of large language models (LLMs) across diverse software development tasks.
Inference Optimization