no code implementations • LREC 2012 • Stefania Degaetano-Ortlieb, Ekaterina Lapshinova-Koltunski, Elke Teich
In this paper, we present corpus-based procedures to semi-automatically discover features relevant for the study of recent language change in scientific registers.
no code implementations • LREC 2014 • Stefania Degaetano-Ortlieb, Peter Fankhauser, Hannah Kermes, Ekaterina Lapshinova-Koltunski, Noam Ordan, Elke Teich
We present a methodology to analyze the linguistic evolution of scientific registers with data mining techniques, comparing the insights gained from shallow vs. linguistic features.
no code implementations • LREC 2014 • Peter Fankhauser, J{\"o}rg Knappen, Elke Teich
Language resources are often compiled for the purpose of variational analysis, such as studying differences between genres, registers, and disciplines, regional and diachronic variation, influence of gender, cultural context, etc.
no code implementations • LREC 2016 • Hannah Kermes, Stefania Degaetano-Ortlieb, Ashraf Khamis, J{\"o}rg Knappen, Elke Teich
We present the Royal Society Corpus (RSC) built from the Philosophical Transactions and Proceedings of the Royal Society of London.
no code implementations • COLING 2016 • Raphael Rubino, Stefania Degaetano-Ortlieb, Elke Teich, Josef van Genabith
In this paper we investigate the introduction of information theory inspired features to study long term diachronic change on three levels: lexis, part-of-speech and syntax.
no code implementations • WS 2017 • Stefania Degaetano-Ortlieb, Elke Teich
We present a data-driven approach to investigate intra-textual variation by combining entropy and surprisal.
no code implementations • COLING 2018 • Stefania Degaetano-Ortlieb, Elke Teich
We present a data-driven approach to detect periods of linguistic change and the lexical and grammatical features contributing to change.
no code implementations • WS 2019 • Yuri Bizzoni, Stefania Degaetano-Ortlieb, Katrin Menzel, Pauline Krielke, Elke Teich
The paper showcases the application of word embeddings to change in language use in the domain of science, focusing on the Late Modern English period (17-19th century).
no code implementations • LREC 2020 • Stefan Fischer, J{\"o}rg Knappen, Katrin Menzel, Elke Teich
We present a new, extended version of the Royal Society Corpus (RSC), a diachronic corpus of scientific English now covering 300+ years of scientific writing (1665--1996).
no code implementations • WS 2020 • Yuri Bizzoni, Tom S Juzek, Cristina Espa{\~n}a-Bonet, Koel Dutta Chowdhury, Josef van Genabith, Elke Teich
Some translationese features tend to appear in simultaneous interpreting with higher frequency than in human text translation, but the reasons for this are unclear.
no code implementations • EMNLP (LaTeCHCLfL, CLFL, LaTeCH) 2021 • Yuri Bizzoni, Stefania Degaetano-Ortlieb, Katrin Menzel, Elke Teich
Tracing the influence of individuals or groups in social networks is an increasingly popular task in sociolinguistic studies.
no code implementations • UDW (COLING) 2020 • Tom S Juzek, Marie-Pauline Krielke, Elke Teich
Our starting assumption is that over time, Scientific English develops specific syntactic choice preferences that increase efficiency in (expert-to-expert) communication.
no code implementations • LREC 2022 • Heike Przybyl, Ekaterina Lapshinova-Koltunski, Katrin Menzel, Stefan Fischer, Elke Teich
In this paper, we describe the creation and annotation of EPIC UdS, a multilingual corpus of simultaneous interpreting for English, German and Spanish.