Datasets

11,379 machine learning datasets
Filter by Task
Machine Translation 7 Speech Recognition 7 Language Modelling 5 Language Identification 3 Named Entity Recognition (NER) 3 Automatic Post-Editing 2 Cross-Lingual Transfer 2 Data Augmentation 2 Text Generation 2 Text Summarization 2 Unsupervised Machine Translation 2 Audio Classification 1 Automatic Speech Recognition 1 Citation Recommendation 1 Classification 1 Croatian Text Diacritization 1 Cross-Lingual ASR 1 Cross-Lingual NER 1 Cross-Lingual POS Tagging 1 Cross-lingual zero-shot dependency parsing 1 Czech Text Diacritization 1 Dependency Parsing 1 Document Classification 1 Document Summarization 1 Fact Verification 1 Few-Shot Audio Classification 1 Fill Mask 1 French Text Diacritization 1 Hungarian Text Diacritization 1 Irish Text Diacritization 1 K-complex detection 1 Keyphrase Extraction 1 Keyphrase Generation 1 Latvian Text Diacritization 1 Misinformation 1 Multilingual Image-Text Classification 1 Multilingual text classification 1 NMT 1 Nested Named Entity Recognition 1 News Recommendation 1 News Retrieval 1 Part-Of-Speech Tagging 1 Polish Text Diacritization 1 Question Answering 1 Recommendation Systems 1 Romanian Text Diacritization 1 Satire Detection 1 Sentence Embeddings 1 Sentiment Analysis 1 Sequence-to-sequence Language Modeling 1 Sleep Stage Detection 1 Slovak Text Diacritization 1 Spanish Text Diacritization 1 Speech Synthesis 1 Speech-to-Text Translation 1 Spindle Detection 1 Text Retrieval 1 Text-to-Image Generation 1 Token Classification 1 Translation deu-eng 1 Translation eng-deu 1 Turkish Text Diacritization 1 UIE 1 Video Captioning 1 Vietnamese Text Diacritization 1 Word Alignment 1 Word Embeddings 1 Zero-shot Cross-lingual Fact-checking 1 text annotation 1
Filter by Language (clear)
Romanian English 3578 Chinese 416 German 189 French 180 Spanish 146 Russian 130 Japanese 102 Arabic 96 Italian 92 Portuguese 87 Hindi 78 Vietnamese 68 Korean 63 Bengali 60 Turkish 57 Persian 52 Tamil 47 Dutch 46 Polish 42 Indonesian 41 Czech 39 Danish 34 Finnish 34 Telugu 34 Urdu 33 Marathi 30 Multilingual 29 Thai 29 Swedish 28 Hungarian 27 Greek 25 Gujarati 25 Estonian 24 Mandarin Chinese 24 Hebrew 23 Bulgarian 22 Malayalam 22 Ukrainian 21 Basque 20 Punjabi 19 Kannada 18 Catalan 17 Slovak 17 Swahili 17 Croatian 16 Latvian 16 Lithuanian 16 Slovenian 16 Kazakh 15 Serbian 15 Norwegian 14 Amharic 13 Iranian Persian 13 Albanian 12 Kurdish 12 Assamese 11 Tagalog 11 Armenian 10 Azerbaijani 10 Irish 10 Macedonian 10 Sinhala 10 Welsh 10 Yoruba 10 American Sign Language 9 Maltese 9 Mongolian 9 Sanskrit 9 Breton 8 Burmese 8 Georgian 8 Hausa 8 Igbo 8 Odia 8 Oriya (macrolanguage) 8 Esperanto 7 Filipino 7 Galician 7 Nepali (macrolanguage) 7 Oromo 7 Uzbek 7 Bambara 6 Belarusian 6 Guarani 6 Icelandic 6 Malagasy 6 Nigerian Pidgin 6 Serbo-Croatian 6 Somali 6 Western Panjabi 6 Wolof 6 Afrikaans 5 Bosnian 5 Central Khmer 5 Central Kurdish 5 Fon 5 Ganda 5 Haitian 5 Javanese 5 Latin 5 Malay (individual language) 5 Norwegian Nynorsk 5 Quechua 5 Scottish Gaelic 5 Sindhi 5 Standard Arabic 5 Sundanese 5 Tibetan 5 Tigrinya 5 Aymara 4 Bangala 4 Chechen 4 Dhivehi 4 Egyptian Arabic 4 Ewe 4 Kabyle 4 Lingala 4 Nepali (individual language) 4 Norwegian Bokmål 4 Tatar 4 Tetum 4 Tswana 4 Twi 4 Upper Sorbian 4 Aragonese 3 Bashkir 3 Bavarian 3 Bishnupriya 3 Cebuano 3 Chuvash 3 Erzya 3 Faroese 3 Fulah 3 German Sign Language 3 Goan Konkani 3 Iloko 3 Interlingue 3 Kinyarwanda 3 Kirghiz 3 Lao 3 Luo (Kenya and Tanzania) 3 Maithili 3 Nyanja 3 Occitan (post 1500) 3 Romansh 3 Rundi 3 Russia Buriat 3 Sardinian 3 South Azerbaijani 3 Swiss German 3 Turkmen 3 Uighur 3 Xhosa 3 Yiddish 3 Zulu 3 Argentine Sign Language 2 Asturian 2 Avaric 2 Bangladeshi Sign Language 2 Bhojpuri 2 Central Bikol 2 Central Pashto 2 Cherokee 2 Church Slavic 2 Cornish 2 Corsican 2 Dimli (individual language) 2 Eastern Mari 2 Gothic 2 Ido 2 Inuktitut 2 Jamaican Creole English 2 Jejueo 2 Kalaallisut 2 Kalmyk 2 Karachay-Balkar 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luxembourgish 2 Malay (macrolanguage) 2 Manipuri 2 Manx 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Modern Greek 2 Moksha 2 Mossi 2 Naxi 2 Neapolitan 2 Newari 2 Northern Frisian 2 Northern Kurdish 2 Northern Luri 2 Northern Sami 2 Old Spanish 2 Ossetian 2 Pampanga 2 Piemontese 2 Pushto 2 Shona 2 Sichuan Yi 2 Sicilian 2 Southern Pashto 2 Swati 2 Swiss-German Sign Language 2 Tai 2 Tajik 2 Tsonga 2 Turkish Sign Language 2 Tuvinian 2 Udmurt 2 Venetian 2 Volapük 2 Walloon 2 Waray (Philippines) 2 Western Frisian 2 Western Mari 2 Wu Chinese 2 Yakut 2 Yue Chinese 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ambonese Malay 1 Ancient Greek 1 Ancient Hebrew 1 Andaman Creole Hindi 1 Apurinã 1 Arpitan 1 Assyrian Neo-Aramaic 1 Banjar 1 Bemba (Zambia) 1 Bislama 1 Bodo (India) 1 Buginese 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Congo Swahili 1 Coptic 1 Cree 1 Creek 1 Crimean Tatar 1 Dogri (macrolanguage) 1 Dzongkha 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 French Sign Language 1 Friulian 1 Gagauz 1 Gan Chinese 1 Geez 1 Gilaki 1 Greek Sign Language 1 Gulf Arabic 1 Hakha Chin 1 Hakka Chinese 1 Halh Mongolian 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inupiaq 1 Kabardian 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Krio 1 Kuanyama 1 Kupang Malay 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Lozi 1 Lunda 1 Luo (Cameroon) 1 Lushai 1 Makasar 1 Malayic Dayak 1 Maori 1 Marshallese 1 Mbyá Guaraní 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Moroccan Arabic 1 Mundurukú 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Novial 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pedi 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Pontic 1 Rajasthani 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Sotho 1 Sranan Tongo 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Tahitian 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tonga (Zambia) 1 Tosk Albanian 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Uab Meto 1 Venda 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 Zaza 1 Zeeuws 1 Zhuang 1 Dogri (individual language) 0 Kabuverdianu 0 Kachin 0 Lingua Franca 0 Mesopotamian Arabic 0 Najdi Arabic 0 Nigerian Fulfulde 0 North Azerbaijani 0 North Levantine Arabic 0 Northern Huishui Hmong 0 Northern Uzbek 0 Plateau Malagasy 0 Portuguse 0 Saidi Arabic 0 Santali 0 Shan 0 Standard Latvian 0 Thai Song 0 Tunisian Sign Language 0 West Central Oromo 0

33 dataset results for Romanian