Search Results for author: Steven Moran

Found 17 papers, 4 papers with code

TeDDi Sample: Text Data Diversity Sample for Language Comparison and Multilingual NLP

no code implementations • LREC 2022 • Steven Moran, Christian Bentz, Ximena Gutierrez-Vasques, Olga Pelloni, Tanja Samardzic

We present the TeDDi sample, a diversity sample of text data for language comparison and multilingual Natural Language Processing.

Multilingual NLP

Paper
Add Code

Phonetic Segmentation of the UCLA Phonetics Lab Archive

1 code implementation • 28 Mar 2024 • Eleanor Chodroff, Blaž Pažon, Annie Baker, Steven Moran

We also discuss the utility of the VoxAngeles corpus for general research and pedagogy in crosslinguistic phonetics, as well as for low-resource and multilingual speech technologies.

Paper
Code

A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data Sets

no code implementations • 6 Mar 2024 • Tanja Samardzic, Ximena Gutierrez, Christian Bentz, Steven Moran, Olga Pelloni

Typologically diverse benchmarks are increasingly created to track the progress achieved in multilingual NLP.

Multilingual NLP

Paper
Add Code

Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural Networks

1 code implementation • 5 Jan 2023 • Zifan Jiang, Adrian Soldati, Isaac Schamberg, Adriano R. Lameira, Steven Moran

We present a novel approach to automatically detect and classify great ape calls from continuous raw audio recordings collected during field research.

Event Detection Sound Event Detection

Paper
Code

SegBo: A Database of Borrowed Sounds in the World's Languages

no code implementations • LREC 2020 • Eitan Grossman, Elad Eisen, Dmitry Nikolaev, Steven Moran

Phonological segment borrowing is a process through which languages acquire new contrastive speech sounds as the result of borrowing new words from other languages.

Paper
Add Code

The ACQDIV Corpus Database and Aggregation Pipeline

no code implementations • LREC 2020 • Anna Jancso, Steven Moran, Sabine Stoll

We present the ACQDIV corpus database and aggregation pipeline, a tool developed as part of the European Research Council (ERC) funded project ACQDIV, which aims to identify the universal cognitive processes that allow children to acquire any language.

Language Acquisition

Paper
Add Code

Is Word Segmentation Child's Play in All Languages?

no code implementations • ACL 2019 • Georgia R. Loukatou, Steven Moran, Damian Blasi, Sabine Stoll, Alej Cristia, rina

When learning language, infants need to break down the flow of input speech into minimal word-like units, a process best described as unsupervised bottom-up segmentation.

Segmentation valid

Paper
Add Code

BDPROTO: A Database of Phonological Inventories from Ancient and Reconstructed Languages

no code implementations • LREC 2018 • Egidio Marsico, Sebastien Flavier, Annemarie Verkerk, Steven Moran

Paper
Add Code

Towards faithfully visualizing global linguistic diversity

1 code implementation • LREC 2018 • Garl McNew, , Curdin Derungs, Steven Moran

Paper
Code

Cross-linguistically Small World Networks are Ubiquitous in Child-directed Speech

no code implementations • LREC 2018 • Steven Moran, Danica Pajovi{\'c}, Sabine Stoll

Language Acquisition

Paper
Add Code

The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud

no code implementations • LREC 2016 • John Philip McCrae, Christian Chiarcos, Francis Bond, Philipp Cimiano, Thierry Declerck, Gerard de Melo, Jorge Gracia, Sebastian Hellmann, Bettina Klimek, Steven Moran, Petya Osenova, Antonio Pareja-Lora, Jonathan Pool

The Open Linguistics Working Group (OWLG) brings together researchers from various fields of linguistics, natural language processing, and information technology to present and discuss principles, case studies, and best practices for representing, publishing and linking linguistic data collections.

Paper
Add Code

The ACQDIV Database: Min(d)ing the Ambient Language

no code implementations • LREC 2016 • Steven Moran

One of the most pressing questions in cognitive science remains unanswered: what cognitive mechanisms enable children to learn any of the world{'}s 7000 or so languages?

Language Acquisition

Paper
Add Code

A Crowdsourcing Smartphone Application for Swiss German: Putting Language Documentation in the Hands of the Users

no code implementations • LREC 2014 • Jean-Philippe Goldman, Adrian Leeman, Marie-Jos{\'e} Kolly, Ingrid Hove, Ibrahim Almajai, Volker Dellwo, Steven Moran

Besides, the Voice {\~A}pp takes its users on a journey in which they explore the individuality of their own voices, answering questions such as: How high is my voice?

speech-recognition Speech Recognition