no code implementations • 15 Oct 2024 • Hugo Abonizio, Thales Sales Almeida, Thiago Laitz, Roseval Malaquias Junior, Giovana Kerche Bonás, Rodrigo Nogueira, Ramon Pires
This report presents Sabi\'a-3, our new flagship language model, and Sabiazinho-3, a more cost-effective sibling.
1 code implementation • 9 Feb 2024 • Fernando Ferraretto, Thiago Laitz, Roberto Lotufo, Rodrigo Nogueira
ExaRanker recently introduced an approach to training information retrieval (IR) models, incorporating natural language explanations as additional labels.
no code implementations • 12 Jan 2024 • Thiago Laitz, Konstantinos Papakostas, Roberto Lotufo, Rodrigo Nogueira
Despite multi-billion parameter neural rankers being common components of state-of-the-art information retrieval pipelines, they are rarely used in production due to the enormous amount of compute required for inference.
1 code implementation • 11 Jul 2023 • Thales Sales Almeida, Thiago Laitz, Giovana K. Bonás, Rodrigo Nogueira
One common trend in recent studies of language models (LMs) is the use of standardized tests for evaluation.
1 code implementation • 25 Jan 2023 • Fernando Ferraretto, Thiago Laitz, Roberto Lotufo, Rodrigo Nogueira
Recent work has shown that inducing a large language model (LLM) to generate explanations prior to outputting an answer is an effective strategy to improve performance on a wide range of reasoning tasks.
no code implementations • 26 Oct 2022 • Thales Sales Almeida, Thiago Laitz, João Seródio, Luiz Henrique Bonifacio, Roberto Lotufo, Rodrigo Nogueira
We compare our system with Microsoft's Biomedical Search and show that our design choices led to a much cost-effective system with competitive QPS while having close to state-of-the-art results on a wide range of public benchmarks.