Datasets

6,541 machine learning datasets
Filter by Task
Machine Translation 14 Speech Recognition 10 Question Answering 6 Text Classification 6 Cross-Lingual Transfer 5 Information Retrieval 5 Abstractive Text Summarization 4 Domain Adaptation 4 Language Identification 4 Language Modelling 4 Named Entity Recognition 4 Natural Language Inference 4 Word Embeddings 4 Cross-Lingual NER 3 Data Augmentation 3 Fake News Detection 3 Part-Of-Speech Tagging 3 Relation Extraction 3 Sentence Embeddings 3 Text Generation 3 Text Summarization 3 Token Classification 3 2D Semantic Segmentation 2 Automatic Post-Editing 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Discourse Segmentation 2 Fact Verification 2 Image Classification 2 Instance Segmentation 2 K-complex detection 2 Language Acquisition 2 Misinformation 2 Multilingual Named Entity Recognition 2 Natural Language Understanding 2 Open-Domain Question Answering 2 Paraphrase Identification 2 Reading Comprehension 2 Relation Classification 2 Sentence Embedding 2 Sequence-to-sequence Language Modeling 2 Sleep Stage Detection 2 Slot Filling 2 Speaker Recognition 2 Speech-to-Text Translation 2 Spindle Detection 2 Spoken Language Understanding 2 Text Categorization 2 2D object detection 1 Accented Speech Recognition 1 Argument Mining 1 Audio Source Separation 1 Automatic Sleep Stage Classification 1 Automatic Speech Recognition 1 COVID-19 Diagnosis 1 Chemical Reaction Prediction 1 Chinese Sentence Pair Classification 1 Citation Recommendation 1 Computed Tomography (CT) 1 Constituency Parsing 1 Contour Detection 1 Coreference Resolution 1 Croatian Text Diacritization 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-Lingual Bitext Mining 1 Cross-Lingual Document Classification 1 Cross-Lingual Paraphrase Identification 1 Cross-Lingual Question Answering 1 Cross-Lingual Sentiment Classification 1 Cross-lingual zero-shot dependency parsing 1 Czech Text Diacritization 1 Dependency Parsing 1 Dialogue Generation 1 Discourse Parsing 1 Document Classification 1 Document Embedding 1 Document Layout Analysis 1 Entity Alignment 1 Entity Embeddings 1 Entity Linking 1 Few-shot NER 1 French Text Diacritization 1 Handwriting Recognition 1 Hate Speech Detection 1 Hungarian Text Diacritization 1 Implicit Discourse Relation Classification 1 Interpretable Machine Learning 1 Irish Text Diacritization 1 Keyword Spotting 1 Knowledge Base Question Answering 1 Latvian Text Diacritization 1 Line Segment Detection 1 Max-Shot Cross-Lingual Image-to-Text Retrieval 1 Max-Shot Cross-Lingual Text-to-Image Retrieval 1 Max-Shot Cross-Lingual Visual Natural Language Inference 1 Max-Shot Cross-Lingual Visual Question Answering 1 Max-Shot Cross-Lingual Visual Reasoning 1 Multilingual text classification 1 Multimodal Lexical Translation 1 Multimodal Machine Translation 1 Multimodal Text Prediction 1 Multiview Clustering 1 NER 1 Node Classification 1 Paraphrase Generation 1 Polish Text Diacritization 1 Relation Linking 1 Romanian Text Diacritization 1 Semantic Role Labeling 1 Semantic Segmentation 1 Sentiment Analysis 1 Sign Language Production 1 Sign Language Recognition 1 Sign Language Translation 1 Single-step retrosynthesis 1 Sleep Staging 1 Slovak Text Diacritization 1 Spanish Text Diacritization 1 Speaker Identification 1 Speaker Verification 1 Speech Synthesis 1 Speech-to-Speech Translation 1 Spoken language identification 1 Stance Detection 1 Text Matching 1 Text Style Transfer 1 Text-To-Speech Synthesis 1 Translation 1 Translation deu-eng 1 Translation eng-deu 1 Turkish Text Diacritization 1 Unsupervised Machine Translation 1 Video Captioning 1 Vietnamese Text Diacritization 1 Visual Reasoning 1 W-R-L-D Sleep Staging 1 W-R-N Sleep Staging 1 Word Alignment 1 Word Sense Disambiguation 1 Zero-Shot Cross-Lingual Image-to-Text Retrieval 1 Zero-Shot Cross-Lingual Text-to-Image Retrieval 1 Zero-Shot Cross-Lingual Transfer 1 Zero-Shot Cross-Lingual Visual Natural Language Inference 1 Zero-Shot Cross-Lingual Visual Question Answering 1 Zero-Shot Cross-Lingual Visual Reasoning 1 connective detection 1
Filter by Language (clear)
French English 1654 Chinese 258 German 128 Spanish 89 Russian 83 Japanese 69 Italian 65 Arabic 57 Portuguese 54 Korean 50 Hindi 49 Turkish 43 Vietnamese 34 Dutch 33 Tamil 30 Persian 29 Bengali 28 Indonesian 28 Polish 28 Czech 27 Danish 24 Finnish 24 Romanian 24 Telugu 22 Malayalam 21 Multilingual 21 Urdu 21 Thai 20 Mandarin Chinese 18 Marathi 18 Estonian 16 Swedish 16 Basque 15 Gujarati 15 Hebrew 15 Hungarian 15 Bulgarian 14 Kannada 14 Greek 13 Punjabi 13 Kazakh 12 Norwegian 12 Ukrainian 12 Catalan 11 Slovak 11 Slovenian 11 Croatian 10 Latvian 10 Serbian 10 Swahili 10 Albanian 9 Amharic 9 Armenian 9 Assamese 9 Lithuanian 9 Welsh 9 Breton 8 Irish 8 Mongolian 8 Oriya (macrolanguage) 8 Sinhala 8 Georgian 7 Icelandic 7 Macedonian 7 Maltese 7 Esperanto 6 Kurdish 6 Sanskrit 6 Yoruba 6 Afrikaans 5 American Sign Language 5 Azerbaijani 5 Belarusian 5 Burmese 5 Filipino 5 Galician 5 Igbo 5 Latin 5 Scottish Gaelic 5 Sindhi 5 Tagalog 5 Uzbek 5 Bosnian 4 Chechen 4 Haitian 4 Hausa 4 Javanese 4 Malagasy 4 Malay (individual language) 4 Nepali (macrolanguage) 4 Norwegian Nynorsk 4 Serbo-Croatian 4 Somali 4 Standard Arabic 4 Sundanese 4 Tatar 4 Tibetan 4 Upper Sorbian 4 Wolof 4 Aragonese 3 Bambara 3 Bashkir 3 Bavarian 3 Bishnupriya 3 Central Khmer 3 Chuvash 3 Dhivehi 3 Egyptian Arabic 3 Erzya 3 Faroese 3 Fon 3 Guarani 3 Iranian Persian 3 Kinyarwanda 3 Kirghiz 3 Lao 3 Odia 3 Oromo 3 Quechua 3 Romansh 3 Russia Buriat 3 South Azerbaijani 3 Swiss German 3 Uighur 3 Western Panjabi 3 Yiddish 3 Asturian 2 Avaric 2 Cebuano 2 Central Bikol 2 Central Kurdish 2 Cherokee 2 Church Slavic 2 Cornish 2 Dimli (individual language) 2 Eastern Mari 2 Fulah 2 Ganda 2 Goan Konkani 2 Gothic 2 Ido 2 Iloko 2 Interlingue 2 Inuktitut 2 Jejueo 2 Kabyle 2 Kalmyk 2 Karachay-Balkar 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Lingala 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luxembourgish 2 Maithili 2 Malay (macrolanguage) 2 Manipuri 2 Manx 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Modern Greek 2 Moksha 2 Naxi 2 Neapolitan 2 Newari 2 Nigerian Pidgin 2 Northern Frisian 2 Northern Luri 2 Northern Sami 2 Norwegian Bokmål 2 Occitan (post 1500) 2 Ossetian 2 Pampanga 2 Piemontese 2 Portuguse 2 Pushto 2 Sardinian 2 Sichuan Yi 2 Sicilian 2 Swati 2 Tai 2 Tajik 2 Tigrinya 2 Tswana 2 Turkish Sign Language 2 Turkmen 2 Tuvinian 2 Venetian 2 Volapük 2 Walloon 2 Waray (Philippines) 2 Western Frisian 2 Western Mari 2 Wu Chinese 2 Xhosa 2 Yakut 2 Yue Chinese 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ancient Greek 1 Ancient Hebrew 1 Apurinã 1 Argentine Sign Language 1 Arpitan 1 Assyrian Neo-Aramaic 1 Aymara 1 Bangladeshi Sign Language 1 Banjar 1 Bhojpuri 1 Bislama 1 Bodo (India) 1 Buginese 1 Central Pashto 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Congo Swahili 1 Coptic 1 Corsican 1 Cree 1 Creek 1 Crimean Tatar 1 Dzongkha 1 Ewe 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 Friulian 1 Gagauz 1 Gan Chinese 1 Geez 1 German Sign Language 1 Gilaki 1 Greek Sign Language 1 Gulf Arabic 1 Hakha Chin 1 Hakka Chinese 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inupiaq 1 Jamaican Creole English 1 Kabardian 1 Kalaallisut 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Kuanyama 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Luo (Cameroon) 1 Luo (Kenya and Tanzania) 1 Maori 1 Marshallese 1 Mbyá Guaraní 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Moroccan Arabic 1 Mundurukú 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Nepali (individual language) 1 Northern Kurdish 1 Novial 1 Nyanja 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pedi 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Pontic 1 Rajasthani 1 Rundi 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Shona 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Sotho 1 Sranan Tongo 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Swiss-German Sign Language 1 Tahitian 1 Tetum 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tosk Albanian 1 Tsonga 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Twi 1 Udmurt 1 Venda 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 Zeeuws 1 Zhuang 1 Zulu 1 Northern Huishui Hmong 0 Saidi Arabic 0 Santali 0

103 dataset results for French