Datasets

9,265 machine learning datasets
Filter by Task
Machine Translation 9 Question Answering 6 Cross-Lingual Transfer 5 Image Classification 4 Language Modelling 4 Natural Language Inference 4 Text Classification 4 Abstractive Text Summarization 3 Domain Adaptation 3 Natural Language Understanding 3 Sentiment Analysis 3 Speech Recognition 3 Text Summarization 3 Token Classification 3 Translation 3 Word Embeddings 3 Audio Source Separation 2 Cross-Lingual NER 2 Cross-Lingual POS Tagging 2 Entity Alignment 2 Information Retrieval 2 Knowledge Graphs 2 Language Identification 2 Misinformation 2 Multi-modal Entity Alignment 2 Multilingual NLP 2 Named Entity Recognition (NER) 2 Outlier Detection 2 Part-Of-Speech Tagging 2 Relation Extraction 2 Zero-Shot Cross-Lingual Transfer 2 Accented Speech Recognition 1 Anomaly Detection 1 Arithmetic Reasoning 1 Automatic Phoneme Recognition 1 Automatic Speech Recognition 1 Bias Detection 1 Blind Super-Resolution 1 Body Detection 1 Causal Inference 1 Chinese Reading Comprehension 1 Chord Recognition 1 Citation Recommendation 1 Clustering Algorithms Evaluation 1 Code Generation 1 Code Summarization 1 Coreference Resolution 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-Lingual Document Classification 1 Cross-Lingual Entity Linking 1 Cross-Lingual Natural Language Inference 1 Cross-Lingual Paraphrase Identification 1 Cross-Lingual Sentiment Classification 1 Cross-lingual zero-shot dependency parsing 1 Dependency Parsing 1 Document Classification 1 Document Summarization 1 Document Translation 1 Entity Linking 1 FLUE 1 Face Detection 1 Facial Expression Recognition (FER) 1 Fake News Detection 1 Few-Shot Audio Classification 1 Few-shot NER 1 Fine-Grained Image Classification 1 Gesture Generation 1 Image Captioning 1 Image Retrieval 1 Image Super-Resolution 1 Image/Document Clustering 1 Intent Classification 1 Keyword Spotting 1 Knowledge Graph Completion 1 Knowledge Graph Embedding 1 Knowledge Graph Embeddings 1 LABELED_DEPENDENCIES 1 LEMMA 1 Low-Resource Neural Machine Translation 1 MORPH 1 Machine Reading Comprehension 1 Math Word Problem Solving 1 Mathematical Reasoning 1 Max-Shot Cross-Lingual Image-to-Text Retrieval 1 Max-Shot Cross-Lingual Text-to-Image Retrieval 1 Max-Shot Cross-Lingual Visual Natural Language Inference 1 Max-Shot Cross-Lingual Visual Question Answering 1 Max-Shot Cross-Lingual Visual Reasoning 1 Morphological Analysis 1 Multi-task Language Understanding 1 Multilingual Machine Comprehension in English Hindi 1 Multilingual Named Entity Recognition 1 Multilingual text classification 1 Multiple-choice 1 Music Information Retrieval 1 Named Entity Recognition 1 Natural Questions 1 Node Classification 1 Object Detection 1 Open-Domain Question Answering 1 POS 1 Paraphrase Generation 1 Paraphrase Identification 1 Pretrained Multilingual Language Models 1 Reading Comprehension 1 Reading Comprehension (Few-Shot) 1 Reading Comprehension (One-Shot) 1 Reading Comprehension (Zero-Shot) 1 Relation Classification 1 Relation Linking 1 SENTS 1 Semantic Parsing 1 Semantic Similarity 1 Semantic Textual Similarity 1 Slot Filling 1 Speech Synthesis 1 Speech-to-Speech Translation 1 Speech-to-Text Translation 1 Spoken Language Understanding 1 Spoken language identification 1 TAG 1 Table Recognition 1 Text Generation 1 Text-To-SQL 1 Text-To-Speech Synthesis 1 UNLABELED_DEPENDENCIES 1 Urdu Speech Recognition 1 Vietnamese Machine Reading Comprehension 1 Visual Reasoning 1 Vocal ensemble separation 1 XLM-R 1 Zero-Shot Cross-Lingual Image-to-Text Retrieval 1 Zero-Shot Cross-Lingual Text-to-Image Retrieval 1 Zero-Shot Cross-Lingual Visual Natural Language Inference 1 Zero-Shot Cross-Lingual Visual Question Answering 1 Zero-Shot Cross-Lingual Visual Reasoning 1 speech-recognition 1 text annotation 1
Filter by Language (clear)
Japanese English 2727 Chinese 309 German 166 French 141 Spanish 116 Russian 111 Arabic 80 Italian 77 Portuguese 73 Hindi 65 Korean 53 Turkish 49 Bengali 45 Vietnamese 45 Persian 43 Dutch 40 Tamil 39 Polish 36 Czech 34 Indonesian 32 Danish 30 Telugu 29 Finnish 28 Multilingual 28 Romanian 28 Urdu 26 Marathi 25 Hungarian 22 Malayalam 22 Swedish 21 Thai 21 Greek 20 Mandarin Chinese 20 Estonian 19 Gujarati 19 Hebrew 19 Basque 18 Bulgarian 18 Kannada 17 Punjabi 17 Slovak 16 Ukrainian 15 Croatian 14 Norwegian 14 Slovenian 14 Catalan 13 Latvian 13 Swahili 13 Assamese 12 Iranian Persian 12 Kazakh 12 Lithuanian 12 Amharic 11 Serbian 11 Kurdish 10 Albanian 9 Armenian 9 Irish 9 Maltese 9 Oriya (macrolanguage) 9 Welsh 9 American Sign Language 8 Mongolian 8 Sanskrit 8 Sinhala 8 Tagalog 8 Yoruba 8 Azerbaijani 7 Breton 7 Burmese 7 Georgian 7 Hausa 7 Icelandic 7 Igbo 7 Macedonian 7 Odia 7 Uzbek 7 Afrikaans 6 Central Kurdish 6 Esperanto 6 Galician 6 Oromo 6 Serbo-Croatian 6 Sindhi 6 Somali 6 Bambara 5 Belarusian 5 Egyptian Arabic 5 Filipino 5 Guarani 5 Haitian 5 Javanese 5 Latin 5 Malagasy 5 Malay (individual language) 5 Nepali (macrolanguage) 5 Norwegian Bokmål 5 Norwegian Nynorsk 5 Quechua 5 Scottish Gaelic 5 Standard Arabic 5 Sundanese 5 Tigrinya 5 Wolof 5 Cebuano 4 Central Khmer 4 Chechen 4 Dhivehi 4 Fulah 4 Ganda 4 Iloko 4 Kabyle 4 Kinyarwanda 4 Kirghiz 4 Lao 4 Lingala 4 Nigerian Pidgin 4 South Azerbaijani 4 Tatar 4 Tibetan 4 Upper Sorbian 4 Western Panjabi 4 Aragonese 3 Bashkir 3 Bavarian 3 Bishnupriya 3 Bosnian 3 Chuvash 3 Erzya 3 Faroese 3 Fon 3 German Sign Language 3 Goan Konkani 3 Maithili 3 Malay (macrolanguage) 3 Nyanja 3 Romansh 3 Russia Buriat 3 Swati 3 Swiss German 3 Tajik 3 Tsonga 3 Tswana 3 Twi 3 Uighur 3 Waray (Philippines) 3 Xhosa 3 Yiddish 3 Argentine Sign Language 2 Asturian 2 Avaric 2 Aymara 2 Bangala 2 Bangladeshi Sign Language 2 Bhojpuri 2 Central Bikol 2 Cherokee 2 Church Slavic 2 Cornish 2 Dimli (individual language) 2 Eastern Mari 2 Ewe 2 Gothic 2 Gulf Arabic 2 Ido 2 Interlingue 2 Inuktitut 2 Jejueo 2 Kalaallisut 2 Kalmyk 2 Karachay-Balkar 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luo (Kenya and Tanzania) 2 Luxembourgish 2 Manipuri 2 Manx 2 Maori 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Modern Greek 2 Moksha 2 Moroccan Arabic 2 Naxi 2 Neapolitan 2 Nepali (individual language) 2 Newari 2 Northern Frisian 2 Northern Kurdish 2 Northern Luri 2 Northern Sami 2 Occitan (post 1500) 2 Ossetian 2 Pampanga 2 Pedi 2 Piemontese 2 Pushto 2 Rundi 2 Sardinian 2 Shona 2 Sichuan Yi 2 Sicilian 2 Southern Sotho 2 Swiss-German Sign Language 2 Tai 2 Tosk Albanian 2 Turkish Sign Language 2 Turkmen 2 Tuvinian 2 Udmurt 2 Venetian 2 Volapük 2 Walloon 2 Western Frisian 2 Western Mari 2 Wu Chinese 2 Yakut 2 Yue Chinese 2 Zulu 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ancient Greek 1 Ancient Hebrew 1 Apurinã 1 Arpitan 1 Assyrian Neo-Aramaic 1 Banjar 1 Bemba (Zambia) 1 Bislama 1 Bodo (India) 1 Buginese 1 Central Pashto 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Congo Swahili 1 Coptic 1 Corsican 1 Cree 1 Creek 1 Crimean Tatar 1 Dogri (macrolanguage) 1 Dzongkha 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 Friulian 1 Gagauz 1 Gan Chinese 1 Geez 1 Gilaki 1 Greek Sign Language 1 Hakha Chin 1 Hakka Chinese 1 Halh Mongolian 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inupiaq 1 Jamaican Creole English 1 Kabardian 1 Kabuverdianu 1 Kachin 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Krio 1 Kuanyama 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Lozi 1 Lunda 1 Luo (Cameroon) 1 Lushai 1 Marshallese 1 Mbyá Guaraní 1 Mesopotamian Arabic 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Mundurukú 1 Najdi Arabic 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Nigerian Fulfulde 1 North Azerbaijani 1 North Levantine Arabic 1 Northern Uzbek 1 Novial 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Plateau Malagasy 1 Pontic 1 Rajasthani 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Shan 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Pashto 1 Sranan Tongo 1 Standard Latvian 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Tahitian 1 Tetum 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tonga (Zambia) 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Venda 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 West Central Oromo 1 Zaza 1 Zeeuws 1 Zhuang 1 Dogri (individual language) 0 Northern Huishui Hmong 0 Portuguse 0 Saidi Arabic 0 Santali 0

79 dataset results for Japanese