Datasets

5,420 machine learning datasets
Filter by Task
Machine Translation 13 Speech Recognition 7 Cross-Lingual Transfer 5 Question Answering 5 Abstractive Text Summarization 4 Domain Adaptation 4 Information Retrieval 4 Language Identification 4 Language Modelling 4 Word Embeddings 4 Cross-Lingual NER 3 Data Augmentation 3 Named Entity Recognition 3 Natural Language Inference 3 Sentence Embeddings 3 Text Generation 3 Text Summarization 3 Automatic Post-Editing 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Discourse Segmentation 2 K-complex detection 2 Language Acquisition 2 Natural Language Understanding 2 Open-Domain Question Answering 2 Paraphrase Identification 2 Part-Of-Speech Tagging 2 Reading Comprehension 2 Relation Classification 2 Relation Extraction 2 Sentence Embedding 2 Sequence-to-sequence Language Modeling 2 Sleep Stage Detection 2 Slot Filling 2 Speech-to-Text Translation 2 Spindle Detection 2 Text Categorization 2 Text Classification 2 Token Classification 2 Accented Speech Recognition 1 Argument Mining 1 Audio Source Separation 1 Automatic Sleep Stage Classification 1 COVID-19 Diagnosis 1 Chinese Sentence Pair Classification 1 Computed Tomography (CT) 1 Constituency Parsing 1 Contour Detection 1 Coreference Resolution 1 Croatian Text Diacritization 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-Lingual Bitext Mining 1 Cross-Lingual Document Classification 1 Cross-Lingual Paraphrase Identification 1 Cross-Lingual Sentiment Classification 1 Cross-lingual zero-shot dependency parsing 1 Czech Text Diacritization 1 Dependency Parsing 1 Dialogue Generation 1 Discourse Parsing 1 Document Classification 1 Document Embedding 1 Document Layout Analysis 1 Entity Alignment 1 Entity Embeddings 1 Fact Verification 1 Fake News Detection 1 French Text Diacritization 1 Handwriting Recognition 1 Hungarian Text Diacritization 1 Implicit Discourse Relation Classification 1 Instance Segmentation 1 Interpretable Machine Learning 1 Irish Text Diacritization 1 Keyword Spotting 1 Latvian Text Diacritization 1 Line Segment Detection 1 Misinformation 1 Multilingual Named Entity Recognition 1 Multilingual text classification 1 Multimodal Lexical Translation 1 Multimodal Machine Translation 1 Multimodal Text Prediction 1 Paraphrase Generation 1 Polish Text Diacritization 1 Romanian Text Diacritization 1 Semantic Role Labeling 1 Semantic Segmentation 1 Sentiment Analysis 1 Sleep Staging 1 Slovak Text Diacritization 1 Spanish Text Diacritization 1 Speech Synthesis 1 Speech-to-Speech Translation 1 Spoken Language Understanding 1 Spoken language identification 1 Stance Detection 1 Text Style Transfer 1 Translation 1 Turkish Text Diacritization 1 Unsupervised Machine Translation 1 Video Captioning 1 Vietnamese Text Diacritization 1 W-R-L-D Sleep Staging 1 W-R-N Sleep Staging 1 Word Alignment 1 Word Sense Disambiguation 1 connective detection 1
Filter by Language (clear)
French English 1245 Chinese 167 German 115 Russian 76 Spanish 74 Japanese 56 Arabic 53 Italian 53 Portuguese 48 Turkish 41 Korean 36 Hindi 35 Dutch 31 Vietnamese 29 Persian 27 Polish 25 Czech 24 Finnish 24 Tamil 24 Romanian 23 Bengali 21 Indonesian 21 Multilingual 19 Telugu 19 Urdu 18 Thai 17 Basque 15 Estonian 15 Malayalam 15 Mandarin Chinese 15 Marathi 15 Swedish 15 Hungarian 14 Bulgarian 13 Kannada 13 Danish 12 Gujarati 12 Catalan 11 Hebrew 11 Norwegian 11 Ukrainian 11 Greek 10 Latvian 10 Punjabi 10 Slovak 10 Slovenian 10 Amharic 9 Croatian 9 Kazakh 9 Serbian 9 Swahili 9 Welsh 9 Albanian 8 Armenian 8 Assamese 8 Breton 8 Lithuanian 8 Sinhala 8 Georgian 7 Mongolian 7 Esperanto 6 Icelandic 6 Irish 6 Kurdish 6 Macedonian 6 Maltese 6 Oriya (macrolanguage) 6 Sanskrit 6 Yoruba 6 Afrikaans 5 American Sign Language 5 Azerbaijani 5 Galician 5 Igbo 5 Latin 5 Scottish Gaelic 5 Sindhi 5 Tagalog 5 Uzbek 5 Belarusian 4 Bosnian 4 Burmese 4 Chechen 4 Haitian 4 Hausa 4 Javanese 4 Malagasy 4 Nepali (macrolanguage) 4 Serbo-Croatian 4 Somali 4 Standard Arabic 4 Sundanese 4 Tatar 4 Upper Sorbian 4 Wolof 4 Aragonese 3 Bambara 3 Bavarian 3 Bishnupriya 3 Central Khmer 3 Chuvash 3 Dhivehi 3 Egyptian Arabic 3 Erzya 3 Filipino 3 Guarani 3 Iranian Persian 3 Kinyarwanda 3 Kirghiz 3 Lao 3 Malay (individual language) 3 Norwegian Nynorsk 3 Oromo 3 Quechua 3 Romansh 3 Russia Buriat 3 South Azerbaijani 3 Swiss German 3 Tibetan 3 Uighur 3 Yiddish 3 Asturian 2 Avaric 2 Bashkir 2 Cebuano 2 Central Bikol 2 Central Kurdish 2 Cherokee 2 Church Slavic 2 Cornish 2 Dimli (individual language) 2 Eastern Mari 2 Faroese 2 Fon 2 Fulah 2 Ganda 2 Goan Konkani 2 Gothic 2 Ido 2 Iloko 2 Interlingue 2 Inuktitut 2 Jejueo 2 Kabyle 2 Kalmyk 2 Karachay-Balkar 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Lingala 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luxembourgish 2 Maithili 2 Manipuri 2 Manx 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Modern Greek 2 Moksha 2 Naxi 2 Neapolitan 2 Newari 2 Nigerian Pidgin 2 Northern Frisian 2 Northern Luri 2 Northern Sami 2 Occitan (post 1500) 2 Odia 2 Ossetian 2 Pampanga 2 Piemontese 2 Pushto 2 Sardinian 2 Sichuan Yi 2 Sicilian 2 Swati 2 Tai 2 Tajik 2 Tigrinya 2 Tswana 2 Turkish Sign Language 2 Turkmen 2 Tuvinian 2 Venetian 2 Volapük 2 Walloon 2 Waray (Philippines) 2 Western Frisian 2 Western Mari 2 Western Panjabi 2 Wu Chinese 2 Xhosa 2 Yakut 2 Yue Chinese 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ancient Greek 1 Apurinã 1 Argentine Sign Language 1 Arpitan 1 Assyrian Neo-Aramaic 1 Aymara 1 Bangladeshi Sign Language 1 Banjar 1 Bhojpuri 1 Bislama 1 Bodo (India) 1 Buginese 1 Central Pashto 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Coptic 1 Corsican 1 Cree 1 Creek 1 Crimean Tatar 1 Dzongkha 1 Ewe 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 Friulian 1 Gagauz 1 Gan Chinese 1 Geez 1 German Sign Language 1 Gilaki 1 Greek Sign Language 1 Gulf Arabic 1 Hakha Chin 1 Hakka Chinese 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inupiaq 1 Jamaican Creole English 1 Kabardian 1 Kalaallisut 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Kuanyama 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Luo (Cameroon) 1 Luo (Kenya and Tanzania) 1 Malay (macrolanguage) 1 Maori 1 Marshallese 1 Mbyá Guaraní 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Moroccan Arabic 1 Mundurukú 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Nepali (individual language) 1 Northern Kurdish 1 Norwegian Bokmål 1 Novial 1 Nyanja 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pedi 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Pontic 1 Portuguse 1 Rajasthani 1 Rundi 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Shona 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Sotho 1 Sranan Tongo 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Swiss-German Sign Language 1 Tahitian 1 Tetum 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tosk Albanian 1 Tsonga 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Twi 1 Udmurt 1 Venda 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 Zeeuws 1 Zhuang 1 Zulu 1 Northern Huishui Hmong 0 Santali 0

84 dataset results for French