no code implementations • LREC 2022 • Steven Moran, Christian Bentz, Ximena Gutierrez-Vasques, Olga Pelloni, Tanja Samardzic
We present the TeDDi sample, a diversity sample of text data for language comparison and multilingual Natural Language Processing.
1 code implementation • 28 Mar 2024 • Eleanor Chodroff, Blaž Pažon, Annie Baker, Steven Moran
We also discuss the utility of the VoxAngeles corpus for general research and pedagogy in crosslinguistic phonetics, as well as for low-resource and multilingual speech technologies.
no code implementations • 6 Mar 2024 • Tanja Samardzic, Ximena Gutierrez, Christian Bentz, Steven Moran, Olga Pelloni
Typologically diverse benchmarks are increasingly created to track the progress achieved in multilingual NLP.
1 code implementation • 5 Jan 2023 • Zifan Jiang, Adrian Soldati, Isaac Schamberg, Adriano R. Lameira, Steven Moran
We present a novel approach to automatically detect and classify great ape calls from continuous raw audio recordings collected during field research.
no code implementations • LREC 2020 • Eitan Grossman, Elad Eisen, Dmitry Nikolaev, Steven Moran
Phonological segment borrowing is a process through which languages acquire new contrastive speech sounds as the result of borrowing new words from other languages.
no code implementations • LREC 2020 • Anna Jancso, Steven Moran, Sabine Stoll
We present the ACQDIV corpus database and aggregation pipeline, a tool developed as part of the European Research Council (ERC) funded project ACQDIV, which aims to identify the universal cognitive processes that allow children to acquire any language.
no code implementations • ACL 2019 • Georgia R. Loukatou, Steven Moran, Damian Blasi, Sabine Stoll, Alej Cristia, rina
When learning language, infants need to break down the flow of input speech into minimal word-like units, a process best described as unsupervised bottom-up segmentation.
no code implementations • LREC 2016 • John Philip McCrae, Christian Chiarcos, Francis Bond, Philipp Cimiano, Thierry Declerck, Gerard de Melo, Jorge Gracia, Sebastian Hellmann, Bettina Klimek, Steven Moran, Petya Osenova, Antonio Pareja-Lora, Jonathan Pool
The Open Linguistics Working Group (OWLG) brings together researchers from various fields of linguistics, natural language processing, and information technology to present and discuss principles, case studies, and best practices for representing, publishing and linking linguistic data collections.
no code implementations • LREC 2016 • Steven Moran
One of the most pressing questions in cognitive science remains unanswered: what cognitive mechanisms enable children to learn any of the world{'}s 7000 or so languages?
no code implementations • LREC 2014 • Jean-Philippe Goldman, Adrian Leeman, Marie-Jos{\'e} Kolly, Ingrid Hove, Ibrahim Almajai, Volker Dellwo, Steven Moran
Besides, the Voice {\~A}pp takes its users on a journey in which they explore the individuality of their own voices, answering questions such as: How high is my voice?
no code implementations • LREC 2012 • Christian Chiarcos, Sebastian Hellmann, Sebastian Nordhoff, Steven Moran, Richard Littauer, Judith Eckle-Kohler, Iryna Gurevych, Silvana Hartmann, Michael Matuschek, Christian M. Meyer
This paper describes the Open Linguistics Working Group (OWLG) of the Open Knowledge Foundation (OKFN).