OTEANNv3

Introduced by Marjou in OTEANN: Estimating the Transparency of Orthographies with an Artificial Neural Network

This dataset contains orthographic samples of words in 19 languages (ar, br, de, en, eno, ent, eo, es, fi, fr, fro, it, ko, nl, pt, ru, sh, tr, zh). Each sample contains two text features: a Word (the textual representation of the word according to its orthography) and a Pronunciation (the highest-surface IPA pronunciation of the word as pronunced in its language).

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

marxav/oteann3

Tasks

License

Unknown

Modalities

Languages

English
French
Spanish
German
Italian
Russian
Portuguese
Arabic
Breton
Dutch
Finnish
Korean
Turkish
Mandarin Chinese
Esperanto
Serbo-Croatian

OTEANNv3

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

License Edit

Modalities Edit

Languages Edit