Datasets

5,405 machine learning datasets
Filter by Task
Machine Translation 9 Cross-Lingual Transfer 5 Domain Adaptation 4 Language Modelling 4 Abstractive Text Summarization 3 Language Identification 3 Speech Recognition 3 Word Embeddings 3 Cross-Lingual NER 2 Cross-Lingual POS Tagging 2 Image Classification 2 Information Retrieval 2 Natural Language Understanding 2 Sentiment Analysis 2 Text Summarization 2 Translation 2 Accented Speech Recognition 1 Audio Source Separation 1 Causal Inference 1 Chord Recognition 1 Clustering Algorithms Evaluation 1 Coreference Resolution 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-Lingual Document Classification 1 Cross-Lingual Entity Linking 1 Cross-Lingual Natural Language Inference 1 Cross-Lingual Paraphrase Identification 1 Cross-Lingual Sentiment Classification 1 Cross-lingual zero-shot dependency parsing 1 Dependency Parsing 1 Dialogue Generation 1 Document Classification 1 Entity Alignment 1 Entity Linking 1 Facial Expression Recognition 1 Fine-Grained Image Classification 1 Image Captioning 1 Image Retrieval 1 Image Super-Resolution 1 Image/Document Clustering 1 Intent Classification 1 Keyword Spotting 1 Low-Resource Neural Machine Translation 1 Misinformation 1 Morphological Analysis 1 Multilingual NLP 1 Music Information Retrieval 1 Named Entity Recognition 1 Natural Language Inference 1 Object Detection 1 Open-Domain Question Answering 1 Outlier Detection 1 Paraphrase Generation 1 Paraphrase Identification 1 Part-Of-Speech Tagging 1 Question Answering 1 Semantic Parsing 1 Semantic Similarity 1 Slot Filling 1 Speech Synthesis 1 Speech-to-Speech Translation 1 Speech-to-Text Translation 1 Spoken Language Understanding 1 Spoken language identification 1 Text Classification 1 Text Generation 1 Token Classification 1 Zero-Shot Cross-Lingual Transfer 1
Filter by Language (clear)
Japanese English 1241 Chinese 166 German 114 French 84 Russian 75 Spanish 74 Arabic 53 Italian 53 Portuguese 48 Turkish 41 Korean 36 Hindi 35 Dutch 31 Vietnamese 29 Persian 27 Polish 25 Czech 24 Finnish 24 Tamil 24 Romanian 23 Bengali 21 Indonesian 21 Multilingual 19 Telugu 19 Urdu 18 Thai 17 Basque 15 Estonian 15 Malayalam 15 Mandarin Chinese 15 Marathi 15 Swedish 15 Hungarian 14 Bulgarian 13 Kannada 13 Danish 12 Gujarati 12 Catalan 11 Hebrew 11 Norwegian 11 Ukrainian 11 Greek 10 Latvian 10 Punjabi 10 Slovak 10 Slovenian 10 Amharic 9 Croatian 9 Kazakh 9 Serbian 9 Swahili 9 Welsh 9 Albanian 8 Armenian 8 Assamese 8 Breton 8 Lithuanian 8 Sinhala 8 Georgian 7 Mongolian 7 Esperanto 6 Icelandic 6 Irish 6 Kurdish 6 Macedonian 6 Maltese 6 Oriya (macrolanguage) 6 Sanskrit 6 Yoruba 6 Afrikaans 5 American Sign Language 5 Azerbaijani 5 Galician 5 Igbo 5 Latin 5 Scottish Gaelic 5 Sindhi 5 Tagalog 5 Uzbek 5 Belarusian 4 Bosnian 4 Burmese 4 Chechen 4 Haitian 4 Hausa 4 Javanese 4 Malagasy 4 Nepali (macrolanguage) 4 Serbo-Croatian 4 Somali 4 Standard Arabic 4 Sundanese 4 Tatar 4 Upper Sorbian 4 Wolof 4 Aragonese 3 Bambara 3 Bavarian 3 Bishnupriya 3 Central Khmer 3 Chuvash 3 Dhivehi 3 Egyptian Arabic 3 Erzya 3 Filipino 3 Guarani 3 Iranian Persian 3 Kinyarwanda 3 Kirghiz 3 Lao 3 Malay (individual language) 3 Norwegian Nynorsk 3 Oromo 3 Quechua 3 Romansh 3 Russia Buriat 3 South Azerbaijani 3 Swiss German 3 Tibetan 3 Uighur 3 Yiddish 3 Asturian 2 Avaric 2 Bashkir 2 Cebuano 2 Central Bikol 2 Central Kurdish 2 Cherokee 2 Church Slavic 2 Cornish 2 Dimli (individual language) 2 Eastern Mari 2 Faroese 2 Fon 2 Fulah 2 Ganda 2 Goan Konkani 2 Gothic 2 Ido 2 Iloko 2 Interlingue 2 Inuktitut 2 Jejueo 2 Kabyle 2 Kalmyk 2 Karachay-Balkar 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Lingala 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luxembourgish 2 Maithili 2 Manipuri 2 Manx 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Modern Greek 2 Moksha 2 Naxi 2 Neapolitan 2 Newari 2 Nigerian Pidgin 2 Northern Frisian 2 Northern Luri 2 Northern Sami 2 Occitan (post 1500) 2 Odia 2 Ossetian 2 Pampanga 2 Piemontese 2 Pushto 2 Sardinian 2 Sichuan Yi 2 Sicilian 2 Swati 2 Tai 2 Tajik 2 Tigrinya 2 Tswana 2 Turkish Sign Language 2 Turkmen 2 Tuvinian 2 Venetian 2 Volapük 2 Walloon 2 Waray (Philippines) 2 Western Frisian 2 Western Mari 2 Western Panjabi 2 Wu Chinese 2 Xhosa 2 Yakut 2 Yue Chinese 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ancient Greek 1 Apurinã 1 Argentine Sign Language 1 Arpitan 1 Assyrian Neo-Aramaic 1 Aymara 1 Bangladeshi Sign Language 1 Banjar 1 Bhojpuri 1 Bislama 1 Bodo (India) 1 Buginese 1 Central Pashto 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Coptic 1 Corsican 1 Cree 1 Creek 1 Crimean Tatar 1 Dzongkha 1 Ewe 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 Friulian 1 Gagauz 1 Gan Chinese 1 Geez 1 German Sign Language 1 Gilaki 1 Greek Sign Language 1 Gulf Arabic 1 Hakha Chin 1 Hakka Chinese 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inupiaq 1 Jamaican Creole English 1 Kabardian 1 Kalaallisut 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Kuanyama 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Luo (Cameroon) 1 Luo (Kenya and Tanzania) 1 Malay (macrolanguage) 1 Maori 1 Marshallese 1 Mbyá Guaraní 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Moroccan Arabic 1 Mundurukú 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Nepali (individual language) 1 Northern Kurdish 1 Norwegian Bokmål 1 Novial 1 Nyanja 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pedi 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Pontic 1 Portuguse 1 Rajasthani 1 Rundi 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Shona 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Sotho 1 Sranan Tongo 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Swiss-German Sign Language 1 Tahitian 1 Tetum 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tosk Albanian 1 Tsonga 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Twi 1 Udmurt 1 Venda 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 Zeeuws 1 Zhuang 1 Zulu 1 Northern Huishui Hmong 0 Santali 0

56 dataset results for Japanese