Search Results for author: Mark-Christoph Müller

Found 5 papers, 4 papers with code

pyMMAX2: Deep Access to MMAX2 Projects from Python

1 code implementation COLING (LAW) 2020 Mark-Christoph Müller

pyMMAX2 is an API for processing MMAX2 stand-off annotation data in Python.

Reconstructing Manual Information Extraction with DB-to-Document Backprojection: Experiments in the Life Science Domain

no code implementations EMNLP (sdp) 2020 Mark-Christoph Müller, Sucheta Ghosh, Maja Rey, Ulrike Wittig, Wolfgang Müller, Michael Strube

We introduce a novel scientific document processing task for making previously inaccessible information in printed paper documents available to automatic processing.

Word-Level Alignment of Paper Documents with their Electronic Full-Text Counterparts

1 code implementation NAACL (BioNLP) 2021 Mark-Christoph Müller, Sucheta Ghosh, Ulrike Wittig, Maja Rey

We describe a simple procedure for the automatic creation of word-level alignments between printed documents and their respective full-text versions.

Optical Character Recognition

Semantic Matching of Documents from Heterogeneous Collections: A Simple and Transparent Method for Practical Applications

1 code implementation29 Apr 2019 Mark-Christoph Müller

We present a very simple, unsupervised method for the pairwise matching of documents from heterogeneous collections.

General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.