1 code implementation • 2 Apr 2024 • Joel Niklaus, Lucia Zheng, Arya D. McCarthy, Christopher Hahn, Brian M. Rosen, Peter Henderson, Daniel E. Ho, Garrett Honke, Percy Liang, Christopher Manning
In this work, we curate LawInstruct, a large legal instruction dataset, covering 17 jurisdictions, 24 languages and a total of 12M examples.
no code implementations • 26 Feb 2024 • Santosh T. Y. S. S, Nina Baumgartner, Matthias Stürmer, Matthias Grabmair, Joel Niklaus
The assessment of explainability in Legal Judgement Prediction (LJP) systems is of paramount importance in building trustworthy and transparent systems, particularly considering the reliance of these systems on factors that may lack legal relevance or involve sensitive attributes.
1 code implementation • 6 Feb 2024 • Dor Bernsohn, Gil Semo, Yaron Vazana, Gila Hayat, Ben Hagag, Joel Niklaus, Rohit Saha, Kyryl Truskovskyi
In this study, we focus on two main tasks, the first for detecting legal violations within unstructured textual data, and the second for associating these violations with potentially affected individuals.
no code implementations • 7 Oct 2023 • Joel Niklaus, Robin Mamié, Matthias Stürmer, Daniel Brunner, Marcel Gygli
Releasing court decisions to the public relies on proper anonymization to protect all involved parties, where necessary.
1 code implementation • 15 Sep 2023 • Ramona Christen, Anastassia Shaitarova, Matthias Stürmer, Joel Niklaus
Resolving the scope of a negation within a sentence is a challenging NLP task.
1 code implementation • 22 Aug 2023 • Alex Nyffenegger, Matthias Stürmer, Joel Niklaus
Anonymity of both natural and legal persons in court rulings is a critical aspect of privacy protection in the European Union and Switzerland.
1 code implementation • NeurIPS 2023 • Neel Guha, Julian Nyarko, Daniel E. Ho, Christopher Ré, Adam Chilton, Aditya Narayana, Alex Chohlas-Wood, Austin Peters, Brandon Waldon, Daniel N. Rockmore, Diego Zambrano, Dmitry Talisman, Enam Hoque, Faiz Surani, Frank Fagan, Galit Sarfaty, Gregory M. Dickinson, Haggai Porat, Jason Hegland, Jessica Wu, Joe Nudell, Joel Niklaus, John Nay, Jonathan H. Choi, Kevin Tobia, Margaret Hagan, Megan Ma, Michael Livermore, Nikon Rasumov-Rahe, Nils Holzenberger, Noam Kolt, Peter Henderson, Sean Rehaag, Sharad Goel, Shang Gao, Spencer Williams, Sunny Gandhi, Tom Zur, Varun Iyer, Zehua Li
The advent of large language models (LLMs) and their adoption by the legal community has given rise to the question: what types of legal reasoning can LLMs perform?
2 code implementations • 15 Jun 2023 • Vishvaksenan Rasiah, Ronja Stern, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho, Joel Niklaus
In this paper, we introduce a novel NLP benchmark that poses challenges to current LLMs across four key dimensions: processing long documents (up to 50K tokens), utilizing domain specific knowledge (embodied in legal texts), multilingual understanding (covering five languages), and multitasking (comprising legal document to document Information Retrieval, Court View Generation, Leading Decision Summarization, Citation Extraction, and eight challenging Text Classification tasks).
no code implementations • 3 Jun 2023 • Joel Niklaus, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho
Large, high-quality datasets are crucial for training Large Language Models (LLMs).
1 code implementation • 2 May 2023 • Tobias Brugger, Matthias Stürmer, Joel Niklaus
Sentence Boundary Detection (SBD) is one of the foundational building blocks of Natural Language Processing (NLP), with incorrectly split sentences heavily influencing the output quality of downstream tasks.
1 code implementation • 30 Jan 2023 • Joel Niklaus, Veton Matoshi, Pooja Rani, Andrea Galassi, Matthias Stürmer, Ilias Chalkidis
To provide a fair comparison, we propose two aggregate scores, one based on the datasets and one on the languages.
no code implementations • 30 Nov 2022 • Joel Niklaus, Daniele Giofré
In specialized domains though (such as legal, scientific or biomedical), models often need to process very long text (sometimes well above 10000 tokens).
1 code implementation • 1 Nov 2022 • Gil Semo, Dor Bernsohn, Ben Hagag, Gila Hayat, Joel Niklaus
The research field of Legal Natural Language Processing (NLP) has been very active recently, with Legal Judgment Prediction (LJP) becoming one of the most extensively studied tasks.
2 code implementations • 25 Sep 2022 • Joel Niklaus, Matthias Stürmer, Ilias Chalkidis
We find that in both settings (legal areas, origin regions), models trained across all groups perform overall better, while they also have improved results in the worst-case scenarios.
1 code implementation • EMNLP (NLLP) 2021 • Joel Niklaus, Ilias Chalkidis, Matthias Stürmer
We evaluate state-of-the-art BERT-based methods including two variants of BERT that overcome the BERT input (text) length limitation (up to 512 tokens).
2 code implementations • 11 Jun 2019 • Joel Niklaus, Michele Alberti, Vinaychandran Pondenkandath, Rolf Ingold, Marcus Liwicki
Jass is a very popular card game in Switzerland and is closely connected with Swiss culture.