no code implementations • 6 Mar 2024 • Pierre Colombo, Telmo Pessoa Pires, Malik Boudiaf, Dominic Culver, Rui Melo, Caio Corro, Andre F. T. Martins, Fabrizio Esposito, Vera Lúcia Raposo, Sofia Morgado, Michael Desa
In this paper, we introduce SaulLM-7B, a large language model (LLM) tailored for the legal domain.
no code implementations • 4 Sep 2023 • Telmo Pessoa Pires, António V. Lopes, Yannick Assogba, Hendra Setiawan
The Transformer architecture has two main non-embedding components: Attention and the Feed Forward Network (FFN).
no code implementations • 4 May 2023 • Telmo Pessoa Pires, Robin M. Schmidt, Yi-Hsiu Liao, Stephan Peitz
Multilingual Machine Translation promises to improve translation quality between non-English languages.
no code implementations • 25 Apr 2023 • Ali Vardasbi, Telmo Pessoa Pires, Robin M. Schmidt, Stephan Peitz
Structured State Spaces for Sequences (S4) is a recently proposed sequence model with successful applications in various tasks, e. g. vision, language modeling, and audio.