Search Results for author: Veton Matoshi

Found 3 papers, 2 papers with code

SCALE: Scaling up the Complexity for Advanced Language Model Evaluation

2 code implementations • 15 Jun 2023 • Vishvaksenan Rasiah, Ronja Stern, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho, Joel Niklaus

In this paper, we introduce a novel NLP benchmark that poses challenges to current LLMs across four key dimensions: processing long documents (up to 50K tokens), utilizing domain specific knowledge (embodied in legal texts), multilingual understanding (covering five languages), and multitasking (comprising legal document to document Information Retrieval, Court View Generation, Leading Decision Summarization, Citation Extraction, and eight challenging Text Classification tasks).

Information Retrieval Language Modelling +2

Paper
Code

MultiLegalPile: A 689GB Multilingual Legal Corpus

no code implementations • 3 Jun 2023 • Joel Niklaus, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho

Large, high-quality datasets are crucial for training Large Language Models (LLMs).

Paper
Add Code

LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain

1 code implementation • 30 Jan 2023 • Joel Niklaus, Veton Matoshi, Pooja Rani, Andrea Galassi, Matthias Stürmer, Ilias Chalkidis

To provide a fair comparison, we propose two aggregate scores, one based on the datasets and one on the languages.

XLM-R

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.