Datasets

8,572 machine learning datasets
Filter by Task
Machine Translation 17 Question Answering 11 Speech Recognition 7 Text Classification 7 Text Summarization 6 Cross-Lingual Transfer 5 Information Retrieval 5 Named Entity Recognition (NER) 5 Text Generation 5 Abstractive Text Summarization 4 Domain Adaptation 4 Language Identification 4 Language Modelling 4 Natural Language Inference 4 Part-Of-Speech Tagging 4 Relation Extraction 4 Token Classification 4 Word Embeddings 4 Cross-Lingual NER 3 Data Augmentation 3 Entity Alignment 3 Fake News Detection 3 Natural Language Understanding 3 Reading Comprehension 3 Relation Classification 3 Retrieval 3 Sentence Embeddings 3 Translation 3 2D Semantic Segmentation 2 Automatic Post-Editing 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Dialogue Generation 2 Discourse Segmentation 2 Document Summarization 2 FLUE 2 Fact Verification 2 Generative Question Answering 2 Handwritten Text Recognition 2 Image Classification 2 Instance Segmentation 2 K-complex detection 2 Knowledge Base Question Answering 2 Knowledge Graphs 2 Language Acquisition 2 Misinformation 2 Multi-modal Entity Alignment 2 Multilingual NLP 2 Multilingual Named Entity Recognition 2 Multilingual text classification 2 Open-Domain Question Answering 2 Paraphrase Identification 2 Sentence Embedding 2 Sentiment Analysis 2 Sequence-to-sequence Language Modeling 2 Sleep Stage Detection 2 Slot Filling 2 Speech-to-Text Translation 2 Spindle Detection 2 Spoken Language Understanding 2 Text Categorization 2 Text Retrieval 2 2D Object Detection 1 Accented Speech Recognition 1 Argument Mining 1 Arithmetic Reasoning 1 Audio Source Separation 1 Automatic Sleep Stage Classification 1 Automatic Speech Recognition 1 Automatic Speech Recognition (ASR) 1 Binary text classification 1 COVID-19 Diagnosis 1 Chemical Reaction Prediction 1 Chinese Reading Comprehension 1 Chinese Sentence Pair Classification 1 Citation Recommendation 1 Classification 1 Computed Tomography (CT) 1 Connective Detection 1 Constituency Parsing 1 Contour Detection 1 Coreference Resolution 1 Croatian Text Diacritization 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-Lingual Bitext Mining 1 Cross-Lingual Document Classification 1 Cross-Lingual Paraphrase Identification 1 Cross-Lingual Question Answering 1 Cross-Lingual Sentiment Classification 1 Cross-lingual zero-shot dependency parsing 1 Czech Text Diacritization 1 Dependency Parsing 1 Dialect Identification 1 Discourse Parsing 1 Document Classification 1 Document Embedding 1 Document Layout Analysis 1 Document Translation 1 Entity Embeddings 1 Entity Linking 1 Event Extraction 1 Few-Shot Audio Classification 1 Few-shot NER 1 French Text Diacritization 1 Handwriting Recognition 1 Hate Speech Detection 1 Humanitarian 1 Hungarian Text Diacritization 1 Implicit Discourse Relation Classification 1 Interpretable Machine Learning 1 Irish Text Diacritization 1 Key Information Extraction 1 Keyword Spotting 1 Knowledge Graph Completion 1 LABELED_DEPENDENCIES 1 LEMMA 1 Latvian Text Diacritization 1 Line Segment Detection 1 Long Form Question Answering 1 MORPH 1 Machine Reading Comprehension 1 Math Word Problem Solving 1 Max-Shot Cross-Lingual Image-to-Text Retrieval 1 Max-Shot Cross-Lingual Text-to-Image Retrieval 1 Max-Shot Cross-Lingual Visual Natural Language Inference 1 Max-Shot Cross-Lingual Visual Question Answering 1 Max-Shot Cross-Lingual Visual Reasoning 1 Medical Diagnosis 1 Medical Named Entity Recognition 1 Motor Imagery Decoding (left-hand vs right-hand) 1 Multi-task Language Understanding 1 Multilabel Text Classification 1 Multilingual Machine Comprehension in English Hindi 1 Multimodal Lexical Translation 1 Multimodal Machine Translation 1 Multimodal Text Prediction 1 Multiple Choice Question Answering (MCQA) 1 Multiple-choice 1 Multiview Clustering 1 NER 1 Named Entity Recognition 1 Natural Questions 1 Nested Named Entity Recognition 1 News Classification 1 Node Classification 1 POS 1 Paraphrase Generation 1 Polish Text Diacritization 1 Pretrained Multilingual Language Models 1 Question-Answer-Generation 1 Reading Comprehension (Few-Shot) 1 Reading Comprehension (One-Shot) 1 Reading Comprehension (Zero-Shot) 1 Relation Linking 1 Romanian Text Diacritization 1 SENTS 1 Science Question Answering 1 Semantic Role Labeling 1 Semantic Segmentation 1 Sentence-Pair Classification 1 Sign Language Production 1 Sign Language Recognition 1 Sign Language Translation 1 Single-step retrosynthesis 1 Sleep Staging 1 Slovak Text Diacritization 1 Spanish Text Diacritization 1 Speaker Identification 1 Speaker Verification 1 Speech Synthesis 1 Speech-to-Speech Translation 1 Spoken language identification 1 Stance Detection 1 TAG 1 Table Recognition 1 Table Retrieval 1 Temporal Relation Classification 1 Temporal Relation Extraction 1 Temporal Tagging 1 Text Matching 1 Text Pair Classification 1 Text Style Transfer 1 Text-To-SQL 1 Text-To-Speech Synthesis 1 Topic Classification 1 Translation deu-eng 1 Translation eng-deu 1 Turkish Text Diacritization 1 UNLABELED_DEPENDENCIES 1 Unsupervised Machine Translation 1 Urdu Speech Recognition 1 Video Captioning 1 Vietnamese Machine Reading Comprehension 1 Vietnamese Text Diacritization 1 Visual Reasoning 1 W-R-L-D Sleep Staging 1 W-R-N Sleep Staging 1 Word Alignment 1 Word Sense Disambiguation 1 XLM-R 1 Zero-Shot Cross-Lingual Image-to-Text Retrieval 1 Zero-Shot Cross-Lingual Text-to-Image Retrieval 1 Zero-Shot Cross-Lingual Transfer 1 Zero-Shot Cross-Lingual Visual Natural Language Inference 1 Zero-Shot Cross-Lingual Visual Question Answering 1 Zero-Shot Cross-Lingual Visual Reasoning 1 Zero-Shot Machine Translation 1 Zero-shot Cross-lingual Fact-checking 1 speech-recognition 1 text annotation 1
Filter by Language (clear)
French English 2486 Chinese 285 German 161 Russian 108 Spanish 108 Arabic 77 Japanese 77 Italian 75 Portuguese 64 Hindi 59 Korean 53 Turkish 48 Vietnamese 43 Dutch 41 Bengali 40 Persian 40 Polish 37 Tamil 37 Czech 35 Indonesian 32 Danish 29 Finnish 29 Romanian 29 Telugu 27 Urdu 25 Marathi 24 Multilingual 24 Hungarian 23 Malayalam 22 Swedish 21 Thai 21 Estonian 20 Greek 20 Hebrew 20 Basque 19 Bulgarian 19 Gujarati 18 Mandarin Chinese 18 Kannada 17 Punjabi 16 Slovak 16 Ukrainian 16 Norwegian 15 Slovenian 15 Catalan 14 Croatian 14 Latvian 14 Lithuanian 13 Swahili 13 Assamese 12 Kazakh 12 Amharic 11 Serbian 11 Albanian 10 Armenian 10 Iranian Persian 10 Irish 9 Kurdish 9 Maltese 9 Oriya (macrolanguage) 9 Sinhala 9 Welsh 9 American Sign Language 8 Breton 8 Georgian 8 Icelandic 8 Macedonian 8 Mongolian 8 Sanskrit 8 Yoruba 8 Afrikaans 7 Azerbaijani 7 Burmese 7 Esperanto 7 Hausa 7 Igbo 7 Odia 7 Uzbek 7 Galician 6 Malay (individual language) 6 Oromo 6 Sindhi 6 Somali 6 Tagalog 6 Bambara 5 Belarusian 5 Central Kurdish 5 Egyptian Arabic 5 Filipino 5 Guarani 5 Haitian 5 Javanese 5 Latin 5 Malagasy 5 Nepali (macrolanguage) 5 Norwegian Bokmål 5 Norwegian Nynorsk 5 Quechua 5 Scottish Gaelic 5 Serbo-Croatian 5 Standard Arabic 5 Sundanese 5 Tigrinya 5 Wolof 5 Bosnian 4 Central Khmer 4 Chechen 4 Dhivehi 4 Fulah 4 Ganda 4 Iloko 4 Kabyle 4 Kinyarwanda 4 Kirghiz 4 Lao 4 Lingala 4 Nigerian Pidgin 4 South Azerbaijani 4 Tatar 4 Tibetan 4 Upper Sorbian 4 Aragonese 3 Bashkir 3 Bavarian 3 Bishnupriya 3 Cebuano 3 Chuvash 3 Erzya 3 Faroese 3 Fon 3 Goan Konkani 3 Maithili 3 Malay (macrolanguage) 3 Nyanja 3 Romansh 3 Russia Buriat 3 Swati 3 Swiss German 3 Tajik 3 Tsonga 3 Tswana 3 Twi 3 Uighur 3 Waray (Philippines) 3 Western Panjabi 3 Xhosa 3 Yiddish 3 Asturian 2 Avaric 2 Aymara 2 Bangala 2 Bhojpuri 2 Central Bikol 2 Cherokee 2 Church Slavic 2 Cornish 2 Dimli (individual language) 2 Eastern Mari 2 Ewe 2 German Sign Language 2 Gothic 2 Gulf Arabic 2 Ido 2 Interlingue 2 Inuktitut 2 Jejueo 2 Kalaallisut 2 Kalmyk 2 Karachay-Balkar 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luo (Kenya and Tanzania) 2 Luxembourgish 2 Manipuri 2 Manx 2 Maori 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Modern Greek 2 Moksha 2 Moroccan Arabic 2 Naxi 2 Neapolitan 2 Nepali (individual language) 2 Newari 2 Northern Frisian 2 Northern Kurdish 2 Northern Luri 2 Northern Sami 2 Occitan (post 1500) 2 Ossetian 2 Pampanga 2 Pedi 2 Piemontese 2 Pushto 2 Rundi 2 Sardinian 2 Shona 2 Sichuan Yi 2 Sicilian 2 Southern Sotho 2 Swiss-German Sign Language 2 Tai 2 Tosk Albanian 2 Turkish Sign Language 2 Turkmen 2 Tuvinian 2 Udmurt 2 Venetian 2 Volapük 2 Walloon 2 Western Frisian 2 Western Mari 2 Wu Chinese 2 Yakut 2 Yue Chinese 2 Zulu 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ancient Greek 1 Ancient Hebrew 1 Apurinã 1 Argentine Sign Language 1 Arpitan 1 Assyrian Neo-Aramaic 1 Bangladeshi Sign Language 1 Banjar 1 Bemba (Zambia) 1 Bislama 1 Bodo (India) 1 Buginese 1 Central Pashto 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Congo Swahili 1 Coptic 1 Corsican 1 Cree 1 Creek 1 Crimean Tatar 1 Dogri (macrolanguage) 1 Dzongkha 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 Friulian 1 Gagauz 1 Gan Chinese 1 Geez 1 Gilaki 1 Greek Sign Language 1 Hakha Chin 1 Hakka Chinese 1 Halh Mongolian 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inupiaq 1 Jamaican Creole English 1 Kabardian 1 Kabuverdianu 1 Kachin 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Krio 1 Kuanyama 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Lozi 1 Lunda 1 Luo (Cameroon) 1 Lushai 1 Marshallese 1 Mbyá Guaraní 1 Mesopotamian Arabic 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Mundurukú 1 Najdi Arabic 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Nigerian Fulfulde 1 North Azerbaijani 1 North Levantine Arabic 1 Northern Uzbek 1 Novial 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Plateau Malagasy 1 Pontic 1 Rajasthani 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Shan 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Pashto 1 Sranan Tongo 1 Standard Latvian 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Tahitian 1 Tetum 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tonga (Zambia) 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Venda 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 West Central Oromo 1 Zaza 1 Zeeuws 1 Zhuang 1 Dogri (individual language) 0 Northern Huishui Hmong 0 Portuguse 0 Saidi Arabic 0 Santali 0

137 dataset results for French