Search Results for author: Jonne Sälevä

Found 5 papers, 2 papers with code

ParaNames: A Massively Multilingual Entity Name Corpus

1 code implementation NAACL (SIGTYP) 2022 Jonne Sälevä, Constantine Lignos

We demonstrate an application of ParaNames by training a multilingual model for canonical name translation to and from English.

named-entity-recognition Named Entity Recognition +3

Mining Wikidata for Name Resources for African Languages

1 code implementation1 Apr 2021 Jonne Sälevä, Constantine Lignos

This work supports further development of language technology for the languages of Africa by providing a Wikidata-derived resource of name lists corresponding to common entity types (person, location, and organization).

Toward More Meaningful Resources for Lower-resourced Languages

no code implementations Findings (ACL) 2022 Constantine Lignos, Nolan Holley, Chester Palen-Michel, Jonne Sälevä

We then discuss the importance of creating annotation for lower-resourced languages in a thoughtful and ethical way that includes the languages' speakers as part of the development process.

Position

What changes when you randomly choose BPE merge operations? Not much

no code implementations4 May 2023 Jonne Sälevä, Constantine Lignos

We introduce three simple randomized variants of byte pair encoding (BPE) and explore whether randomizing the selection of merge operations substantially affects a downstream machine translation task.

Machine Translation Translation

Cannot find the paper you are looking for? You can Submit a new open access paper.