no code implementations • EMNLP (sdp) 2020 • Mark-Christoph Müller, Sucheta Ghosh, Maja Rey, Ulrike Wittig, Wolfgang Müller, Michael Strube
We introduce a novel scientific document processing task for making previously inaccessible information in printed paper documents available to automatic processing.
1 code implementation • COLING (LAW) 2020 • Mark-Christoph Müller
pyMMAX2 is an API for processing MMAX2 stand-off annotation data in Python.
1 code implementation • NAACL (BioNLP) 2021 • Mark-Christoph Müller, Sucheta Ghosh, Ulrike Wittig, Maja Rey
We describe a simple procedure for the automatic creation of word-level alignments between printed documents and their respective full-text versions.
1 code implementation • 29 Apr 2019 • Mark-Christoph Müller
We present a very simple, unsupervised method for the pairwise matching of documents from heterogeneous collections.
1 code implementation • COLING 2018 • Mark-Christoph Müller, Michael Strube
We present WOMBAT, a Python tool which supports NLP practitioners in accessing word embeddings from code.