no code implementations • LChange (ACL) 2022 • Iiro Rastas, Yann Ciarán Ryan, Iiro Tiihonen, Mohammadreza Qaraei, Liina Repo, Rohit Babbar, Eetu Mäkelä, Mikko Tolonen, Filip Ginter
In this paper, we describe a BERT model trained on the Eighteenth Century Collections Online (ECCO) dataset of digitized documents.
Optical Character Recognition Optical Character Recognition (OCR)
no code implementations • 8 Feb 2023 • David Rosson, Eetu Mäkelä, Ville Vaara, Ananth Mahadevan, Yann Ryan, Mikko Tolonen
The Reception Reader is a web tool for studying text reuse in the Early English Books Online (EEBO-TCP) and Eighteenth Century Collections Online (ECCO) data.
no code implementations • 17 Mar 2021 • Tanja Säily, Eetu Mäkelä, Mika Hämäläinen
We study neologism use in two samples of early English correspondence, from 1640--1660 and 1760--1780.
no code implementations • 9 Nov 2016 • Kimmo Kettunen, Eetu Mäkelä, Teemu Ruokolainen, Juha Kuokkala, Laura Löfberg
In this paper we report first large scale trials and evaluation of NER with data out of a digitized Finnish historical newspaper collection Digi.