Datasets

7,709 machine learning datasets
Filter by Task
Sentiment Analysis 6 Misinformation 5 Question Answering 5 Cross-Lingual Transfer 4 Speech Recognition 4 Abstractive Text Summarization 3 Image Classification 3 Language Identification 3 Language Modelling 3 Machine Translation 3 Natural Language Inference 3 Part-Of-Speech Tagging 3 Sarcasm Detection 3 Text Classification 3 Token Classification 3 Automatic Speech Recognition 2 Chinese Named Entity Recognition 2 Cross-Lingual NER 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Cross-Lingual Question Answering 2 Dialect Identification 2 Domain Adaptation 2 Fake News Detection 2 Information Retrieval 2 Named Entity Recognition 2 Natural Language Understanding 2 Open-Domain Question Answering 2 Question Generation 2 Reading Comprehension 2 Speech-to-Text Translation 2 Text Summarization 2 Text-To-Speech Synthesis 2 Word Embeddings 2 Zero-Shot Cross-Lingual Transfer 2 Arabic Sentiment Analysis 1 Arabic Text Diacritization 1 Audio Source Separation 1 Automatic Phoneme Recognition 1 Chinese Sentence Pair Classification 1 Coreference Resolution 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-lingual zero-shot dependency parsing 1 Data Augmentation 1 Dependency Parsing 1 Dialogue Generation 1 Document Summarization 1 Emotion Classification 1 Entity Embeddings 1 Entity Typing 1 FG-1-PG-1 1 FLUE 1 Fact Checking 1 Fact Verification 1 Few-shot NER 1 Generalized Zero-Shot Learning 1 Handwriting Recognition 1 Hate Speech Detection 1 Intent Classification 1 Irony Identification 1 Max-Shot Cross-Lingual Image-to-Text Retrieval 1 Max-Shot Cross-Lingual Text-to-Image Retrieval 1 Max-Shot Cross-Lingual Visual Natural Language Inference 1 Max-Shot Cross-Lingual Visual Question Answering 1 Max-Shot Cross-Lingual Visual Reasoning 1 Morphological Analysis 1 Node Classification 1 Object Detection 1 Resynthesis 1 Semantic Role Labeling 1 Semantic Similarity 1 Sentence Embeddings 1 Sequence-to-sequence Language Modeling 1 Slot Filling 1 Speech Enhancement 1 Speech Synthesis 1 Speech-to-Speech Translation 1 Spoken Language Understanding 1 Stance Detection 1 Style Transfer 1 Text Categorization 1 Text Effects Transfer 1 Translation 1 Transliteration 1 Video Recognition 1 Visual Reasoning 1 Weakly-Supervised Named Entity Recognition 1 Zero-Shot Cross-Lingual Image-to-Text Retrieval 1 Zero-Shot Cross-Lingual Text-to-Image Retrieval 1 Zero-Shot Cross-Lingual Visual Natural Language Inference 1 Zero-Shot Cross-Lingual Visual Question Answering 1 Zero-Shot Cross-Lingual Visual Reasoning 1 Zero-shot Cross-lingual Fact-checking 1 coreference-resolution 1 speech-recognition 1 text2text-generation 1
Filter by Language (clear)
Arabic English 2093 Chinese 297 German 147 French 124 Spanish 106 Russian 90 Japanese 77 Italian 74 Portuguese 62 Hindi 57 Korean 57 Turkish 44 Vietnamese 39 Dutch 38 Polish 34 Tamil 34 Persian 33 Czech 32 Bengali 31 Indonesian 30 Danish 27 Romanian 27 Finnish 26 Telugu 24 Malayalam 23 Multilingual 23 Marathi 21 Urdu 21 Thai 20 Estonian 19 Hungarian 19 Mandarin Chinese 19 Greek 18 Swedish 18 Bulgarian 17 Gujarati 17 Hebrew 17 Basque 16 Kannada 16 Punjabi 15 Slovak 14 Slovenian 14 Croatian 13 Latvian 13 Norwegian 13 Ukrainian 13 Catalan 12 Kazakh 12 Lithuanian 12 Assamese 11 Amharic 10 Serbian 10 Swahili 10 Albanian 9 Armenian 9 Irish 9 Oriya (macrolanguage) 9 Welsh 9 Breton 8 Kurdish 8 Maltese 8 Mongolian 8 Sinhala 8 Georgian 7 Icelandic 7 Macedonian 7 Sanskrit 7 Yoruba 7 Afrikaans 6 American Sign Language 6 Azerbaijani 6 Esperanto 6 Uzbek 6 Belarusian 5 Burmese 5 Filipino 5 Galician 5 Hausa 5 Igbo 5 Iranian Persian 5 Latin 5 Scottish Gaelic 5 Sindhi 5 Tagalog 5 Bosnian 4 Central Kurdish 4 Chechen 4 Egyptian Arabic 4 Haitian 4 Javanese 4 Malagasy 4 Malay (individual language) 4 Nepali (macrolanguage) 4 Norwegian Nynorsk 4 Odia 4 Serbo-Croatian 4 Somali 4 Standard Arabic 4 Sundanese 4 Tatar 4 Tibetan 4 Upper Sorbian 4 Wolof 4 Aragonese 3 Bambara 3 Bashkir 3 Bavarian 3 Bishnupriya 3 Central Khmer 3 Chuvash 3 Dhivehi 3 Erzya 3 Faroese 3 Fon 3 Guarani 3 Kinyarwanda 3 Kirghiz 3 Lao 3 Nigerian Pidgin 3 Oromo 3 Quechua 3 Romansh 3 Russia Buriat 3 South Azerbaijani 3 Swiss German 3 Uighur 3 Western Panjabi 3 Yiddish 3 Asturian 2 Avaric 2 Bangala 2 Cebuano 2 Central Bikol 2 Cherokee 2 Church Slavic 2 Cornish 2 Dimli (individual language) 2 Eastern Mari 2 Fulah 2 Ganda 2 German Sign Language 2 Goan Konkani 2 Gothic 2 Ido 2 Iloko 2 Interlingue 2 Inuktitut 2 Jejueo 2 Kabyle 2 Kalmyk 2 Karachay-Balkar 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Lingala 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luxembourgish 2 Maithili 2 Malay (macrolanguage) 2 Manipuri 2 Manx 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Modern Greek 2 Moksha 2 Naxi 2 Neapolitan 2 Newari 2 Northern Frisian 2 Northern Kurdish 2 Northern Luri 2 Northern Sami 2 Norwegian Bokmål 2 Occitan (post 1500) 2 Ossetian 2 Pampanga 2 Piemontese 2 Pushto 2 Sardinian 2 Sichuan Yi 2 Sicilian 2 Swati 2 Swiss-German Sign Language 2 Tai 2 Tajik 2 Tigrinya 2 Tswana 2 Turkish Sign Language 2 Turkmen 2 Tuvinian 2 Twi 2 Udmurt 2 Venetian 2 Volapük 2 Walloon 2 Waray (Philippines) 2 Western Frisian 2 Western Mari 2 Wu Chinese 2 Xhosa 2 Yakut 2 Yue Chinese 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ancient Greek 1 Ancient Hebrew 1 Apurinã 1 Argentine Sign Language 1 Arpitan 1 Assyrian Neo-Aramaic 1 Aymara 1 Bangladeshi Sign Language 1 Banjar 1 Bhojpuri 1 Bislama 1 Bodo (India) 1 Buginese 1 Central Pashto 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Congo Swahili 1 Coptic 1 Corsican 1 Cree 1 Creek 1 Crimean Tatar 1 Dzongkha 1 Ewe 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 Friulian 1 Gagauz 1 Gan Chinese 1 Geez 1 Gilaki 1 Greek Sign Language 1 Gulf Arabic 1 Hakha Chin 1 Hakka Chinese 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inupiaq 1 Jamaican Creole English 1 Kabardian 1 Kalaallisut 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Kuanyama 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Luo (Cameroon) 1 Luo (Kenya and Tanzania) 1 Maori 1 Marshallese 1 Mbyá Guaraní 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Moroccan Arabic 1 Mundurukú 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Nepali (individual language) 1 Novial 1 Nyanja 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pedi 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Pontic 1 Portuguse 1 Rajasthani 1 Rundi 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Shona 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Sotho 1 Sranan Tongo 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Tahitian 1 Tetum 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tosk Albanian 1 Tsonga 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Venda 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 Zaza 1 Zeeuws 1 Zhuang 1 Zulu 1 Northern Huishui Hmong 0 Saidi Arabic 0 Santali 0

69 dataset results for Arabic