Search Results for author: Veton Matoshi

Found 3 papers, 2 papers with code

SCALE: Scaling up the Complexity for Advanced Language Model Evaluation

2 code implementations15 Jun 2023 Vishvaksenan Rasiah, Ronja Stern, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho, Joel Niklaus

In this paper, we introduce a novel NLP benchmark that poses challenges to current LLMs across four key dimensions: processing long documents (up to 50K tokens), utilizing domain specific knowledge (embodied in legal texts), multilingual understanding (covering five languages), and multitasking (comprising legal document to document Information Retrieval, Court View Generation, Leading Decision Summarization, Citation Extraction, and eight challenging Text Classification tasks).

Information Retrieval Language Modelling +2

MultiLegalPile: A 689GB Multilingual Legal Corpus

no code implementations3 Jun 2023 Joel Niklaus, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho

Large, high-quality datasets are crucial for training Large Language Models (LLMs).

LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain

1 code implementation30 Jan 2023 Joel Niklaus, Veton Matoshi, Pooja Rani, Andrea Galassi, Matthias Stürmer, Ilias Chalkidis

To provide a fair comparison, we propose two aggregate scores, one based on the datasets and one on the languages.

XLM-R

Cannot find the paper you are looking for? You can Submit a new open access paper.