Datasets

13,736 machine learning datasets
Filter by Task
Machine Translation 17 Question Answering 11 Speech Recognition 11 Automatic Speech Recognition (ASR) 8 Text Generation 8 Language Modelling 7 Named Entity Recognition (NER) 7 Speaker Verification 7 Text Summarization 7 Automatic Phoneme Recognition 6 Natural Language Inference 6 Relation Extraction 6 Abstractive Text Summarization 5 Bandwidth Extension 5 Cross-Lingual Transfer 5 Information Retrieval 5 Text Classification 5 Text Retrieval 5 Word Embeddings 5 Automatic Speech Recognition 4 Language Identification 4 Part-Of-Speech Tagging 4 Classification 3 Cross-Lingual NER 3 Data Augmentation 3 Domain Adaptation 3 Entity Alignment 3 Entity Linking 3 Fake News Detection 3 Misinformation 3 Relation Classification 3 Retrieval 3 Sentence Embeddings 3 Sentence-Pair Classification 3 Token Classification 3 Translation 3 2 10-shot image generation 2 16k 2 2D Semantic Segmentation 2 Anomaly Detection In Surveillance Videos 2 Automatic Post-Editing 2 Coreference Resolution 2 Cross-Lingual Abstractive Summarization 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Discourse Segmentation 2 Document Summarization 2 Event Extraction 2 FLUE 2 Fact Verification 2 Generative Question Answering 2 Handwritten Text Recognition 2 Image Classification 2 Instance Segmentation 2 K-complex detection 2 Knowledge Base Question Answering 2 Knowledge Graphs 2 Language Acquisition 2 Multi-modal Entity Alignment 2 Multilingual NLP 2 Multilingual Named Entity Recognition 2 NER 2 Natural Language Understanding 2 Open-Domain Question Answering 2 Paraphrase Identification 2 RTE 2 Reading Comprehension 2 Sentence Embedding 2 Sentiment Analysis 2 Sign Language Production 2 Sign Language Translation 2 Sleep Stage Detection 2 Slot Filling 2 Speech-to-Text Translation 2 Spindle Detection 2 Spoken Language Understanding 2 Text Categorization 2 Video Anomaly Detection 2 1 Image, 2*2 Stitching 1 2D Object Detection 1 2D Panoptic Segmentation 1 3D Face Modelling 1 3DGS 1 Accented Speech Recognition 1 Adversarial Robustness 1 Anomaly Detection 1 Argument Mining 1 Arithmetic Reasoning 1 Audio Classification 1 Audio Signal Processing 1 Audio Source Separation 1 Audio Synthesis 1 Automatic Lyrics Transcription 1 Automatic Sleep Stage Classification 1 Bias Detection 1 Binary Classification 1 Binary text classification 1 COVID-19 Diagnosis 1 Chemical Reaction Prediction 1 Chinese Sentence Pair Classification 1 Citation Recommendation 1 Code Generation 1 Computed Tomography (CT) 1 Connective Detection 1 Constituency Parsing 1 Contour Detection 1 Croatian Text Diacritization 1 Cross-Language Text Summarization 1 Cross-Lingual ASR 1 Cross-Lingual Bitext Mining 1 Cross-Lingual Document Classification 1 Cross-Lingual Entity Linking 1 Cross-Lingual Paraphrase Identification 1 Cross-Lingual Question Answering 1 Cross-Lingual Sentiment Classification 1 Cross-lingual zero-shot dependency parsing 1 Czech Text Diacritization 1 Data Summarization 1 Dependency Parsing 1 Dialect Identification 1 Dialogue Generation 1 Discourse Parsing 1 Document Classification 1 Document Embedding 1 Document Layout Analysis 1 Document Translation 1 Document-level Event Extraction 1 Emotion Classification 1 Emotion Recognition 1 Entity Embeddings 1 Event Argument Extraction 1 Event Detection 1 Few-Shot Audio Classification 1 Few-shot NER 1 Fill Mask 1 French Text Diacritization 1 Gender Bias Detection 1 Gender Classification 1 Gender Prediction 1 Gloss-free Sign Language Translation 1 Handwriting Recognition 1 Hate Speech Detection 1 Humanitarian 1 Hungarian Text Diacritization 1 Image Captioning 1 Image-to-Text Retrieval 1 Implicit Discourse Relation Classification 1 Interpretable Machine Learning 1 Irish Text Diacritization 1 Key Information Extraction 1 Key-value Pair Extraction 1 Keyphrase Extraction 1 Keyphrase Generation 1 Keyword Extraction 1 Keyword Spotting 1 Knowledge Graph Completion 1 LABELED_DEPENDENCIES 1 LEMMA 1 Latvian Text Diacritization 1 License Plate Detection 1 Line Segment Detection 1 Link Prediction 1 Long Form Question Answering 1 MORPH 1 Masked Language Modeling 1 Math Word Problem Solving 1 Mathematical Reasoning 1 Max-Shot Cross-Lingual Image-to-Text Retrieval 1 Max-Shot Cross-Lingual Text-to-Image Retrieval 1 Max-Shot Cross-Lingual Visual Natural Language Inference 1 Max-Shot Cross-Lingual Visual Question Answering 1 Max-Shot Cross-Lingual Visual Reasoning 1 Medical Diagnosis 1 Medical Named Entity Recognition 1 Medical Report Generation 1 Motor Imagery Decoding (left-hand vs right-hand) 1 Multi-task Language Understanding 1 Multilabel Text Classification 1 Multilingual text classification 1 Multimodal Abstractive Text Summarization 1 Multimodal Lexical Translation 1 Multimodal Machine Translation 1 Multimodal Reasoning 1 Multimodal Text Prediction 1 Multiple Choice Question Answering (MCQA) 1 Multiple Instance Learning 1 Multiple-choice 1 Multiview Clustering 1 NMT 1 Named Entity Recognition 1 Nested Named Entity Recognition 1 News Classification 1 News Retrieval 1 News Summarization 1 Node Classification 1 Open-Ended Question Answering 1 POS 1 Paraphrase Generation 1 Polish Text Diacritization 1 Propaganda detection 1 Question-Answer-Generation 1 Relation Linking 1 Resynthesis 1 Robust Speech Recognition 1 Romanian Text Diacritization 1 SENTS 1 Science Question Answering 1 Semantic Communication 1 Semantic Role Labeling 1 Semantic Segmentation 1 Sequence-to-sequence Language Modeling 1 Sign Language Recognition 1 Singing Voice Synthesis 1 Single-step retrosynthesis 1 Sleep Staging 1 Slovak Text Diacritization 1 Spanish Text Diacritization 1 Speaker Identification 1 Speech Denoising 1 Speech Enhancement 1 Speech Separation 1 Speech Synthesis 1 Speech-to-Speech Translation 1 Speech-to-Text 1 Spoken language identification 1 Stance Detection 1 Style Transfer 1 TAG 1 Table Recognition 1 Table Retrieval 1 Tabular Data Generation 1 Temporal Relation Classification 1 Temporal Relation Extraction 1 Temporal Tagging 1 Text Matching 1 Text Pair Classification 1 Text Style Transfer 1 Text-To-SQL 1 Text-To-Speech Synthesis 1 Text2text Generation 1 TinyQA Benchmark++ 1 Topic Classification 1 Translation deu-eng 1 Translation eng-deu 1 Turkish Text Diacritization 1 UIE 1 UNLABELED_DEPENDENCIES 1 Unsupervised Machine Translation 1 Video Action Detection 1 Video Captioning 1 Vietnamese Text Diacritization 1 Visual Question Answering 1 Visual Question Answering (VQA) 1 Visual Reasoning 1 Vocal technique classification 1 Voice Conversion 1 W-R-L-D Sleep Staging 1 W-R-N Sleep Staging 1 Word Alignment 1 Word Sense Disambiguation 1 Zero-Shot Cross-Lingual Image-to-Text Retrieval 1 Zero-Shot Cross-Lingual Text-to-Image Retrieval 1 Zero-Shot Cross-Lingual Transfer 1 Zero-Shot Cross-Lingual Visual Natural Language Inference 1 Zero-Shot Cross-Lingual Visual Question Answering 1 Zero-Shot Cross-Lingual Visual Reasoning 1 Zero-Shot Machine Translation 1 Zero-Shot Multi-Speaker TTS 1 Zero-shot Cross-lingual Fact-checking 1 Zero-shot Text-to-Image Retrieval 1 automatic-speech-translation 1 de-en 1 es-en 1 fr-en 1 speech-recognition 1 text annotation 1
Filter by Language (clear)
French English 4155 Chinese 467 Spanish 237 German 201 Russian 146 Japanese 112 Arabic 109 Italian 100 Portuguese 93 Hindi 82 Vietnamese 71 Korean 68 Bengali 66 Turkish 63 Persian 60 Dutch 51 Tamil 49 Polish 48 Indonesian 45 Czech 41 Danish 35 Finnish 35 Telugu 35 Romanian 34 Urdu 34 Thai 33 Marathi 31 Hungarian 29 Multilingual 29 Swedish 29 Greek 26 Gujarati 26 Hebrew 25 Mandarin Chinese 25 Estonian 24 Malayalam 23 Ukrainian 23 Bulgarian 22 Basque 21 Punjabi 20 Kannada 19 Catalan 18 Croatian 18 Slovak 18 Swahili 18 Lithuanian 17 Latvian 16 Norwegian 16 Serbian 16 Slovenian 16 Kazakh 15 Amharic 14 Iranian Persian 14 Albanian 12 Kurdish 12 Assamese 11 Burmese 11 Sinhala 11 Tagalog 11 Yoruba 11 Armenian 10 Azerbaijani 10 Filipino 10 Irish 10 Macedonian 10 Welsh 10 American Sign Language 9 Georgian 9 Maltese 9 Mongolian 9 Old Spanish 9 Sanskrit 9 Breton 8 Galician 8 Hausa 8 Igbo 8 Odia 8 Oriya (macrolanguage) 8 Esperanto 7 Nepali (individual language) 7 Nepali (macrolanguage) 7 Oromo 7 Somali 7 Uzbek 7 Afrikaans 6 Bambara 6 Belarusian 6 Central Khmer 6 Guarani 6 Icelandic 6 Javanese 6 Malagasy 6 Nigerian Pidgin 6 Serbo-Croatian 6 Standard Arabic 6 Sundanese 6 Tibetan 6 Western Panjabi 6 Wolof 6 Bosnian 5 Central Kurdish 5 Fon 5 Ganda 5 Haitian 5 Latin 5 Malay (individual language) 5 Norwegian Nynorsk 5 Quechua 5 Scottish Gaelic 5 Sindhi 5 Tigrinya 5 Aymara 4 Bangala 4 Bavarian 4 Chechen 4 Dhivehi 4 Egyptian Arabic 4 Ewe 4 Kabyle 4 Lingala 4 Norwegian Bokmål 4 Tatar 4 Tetum 4 Tswana 4 Twi 4 Upper Sorbian 4 Xhosa 4 Aragonese 3 Bashkir 3 Bishnupriya 3 Cebuano 3 Central Pashto 3 Chuvash 3 Erzya 3 Faroese 3 Fulah 3 Goan Konkani 3 Iloko 3 Interlingue 3 Kinyarwanda 3 Kirghiz 3 Lao 3 Luo (Kenya and Tanzania) 3 Maithili 3 Modern Greek 3 Nyanja 3 Occitan (post 1500) 3 Romansh 3 Rundi 3 Russia Buriat 3 Sardinian 3 South Azerbaijani 3 Southern Pashto 3 Swiss German 3 Turkmen 3 Uighur 3 Yiddish 3 Zulu 3 Ancient Greek 2 Argentine Sign Language 2 Asturian 2 Avaric 2 Bangladeshi Sign Language 2 Bhojpuri 2 Central Bikol 2 Cherokee 2 Church Slavic 2 Cornish 2 Corsican 2 Dimli (individual language) 2 Eastern Mari 2 German Sign Language 2 Gothic 2 Gulf Arabic 2 Ido 2 Inuktitut 2 Jamaican Creole English 2 Jejueo 2 Kalaallisut 2 Kalmyk 2 Karachay-Balkar 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luxembourgish 2 Malay (macrolanguage) 2 Manipuri 2 Manx 2 Maori 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Moksha 2 Mossi 2 Naxi 2 Neapolitan 2 Newari 2 Northern Frisian 2 Northern Kurdish 2 Northern Luri 2 Northern Sami 2 Ossetian 2 Pampanga 2 Piemontese 2 Pushto 2 Shona 2 Sichuan Yi 2 Sicilian 2 Swati 2 Swiss-German Sign Language 2 Tai 2 Tajik 2 Tsonga 2 Turkish Sign Language 2 Tuvinian 2 Udmurt 2 Venda 2 Venetian 2 Volapük 2 Walloon 2 Waray (Philippines) 2 Western Frisian 2 Western Mari 2 Wu Chinese 2 Yakut 2 Yue Chinese 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ambonese Malay 1 Ancient Hebrew 1 Andaman Creole Hindi 1 Apurinã 1 Arpitan 1 Assyrian Neo-Aramaic 1 Banjar 1 Bemba (Zambia) 1 Bislama 1 Bodo (India) 1 Buginese 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Congo Swahili 1 Coptic 1 Cree 1 Creek 1 Crimean Tatar 1 Cusco Quechua 1 Dogri (macrolanguage) 1 Dzongkha 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 French Sign Language 1 Friulian 1 Gagauz 1 Gan Chinese 1 Geez 1 Gilaki 1 Greek Sign Language 1 Hakha Chin 1 Hakka Chinese 1 Halh Mongolian 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inupiaq 1 Kabardian 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Krio 1 Kuanyama 1 Kupang Malay 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Lozi 1 Lunda 1 Luo (Cameroon) 1 Lushai 1 Makasar 1 Malayic Dayak 1 Marshallese 1 Mbyá Guaraní 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Moroccan Arabic 1 Mundurukú 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Northern Pashto 1 Novial 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pedi 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Pontic 1 Rajasthani 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Sotho 1 Sranan Tongo 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Tahitian 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tonga (Zambia) 1 Tosk Albanian 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Uab Meto 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 Zaza 1 Zeeuws 1 Zhuang 1 Dogri (individual language) 0 Kabuverdianu 0 Kachin 0 Lingua Franca 0 Mesopotamian Arabic 0 Najdi Arabic 0 Nigerian Fulfulde 0 North Azerbaijani 0 North Levantine Arabic 0 Northern Huishui Hmong 0 Northern Uzbek 0 Plateau Malagasy 0 Portuguse 0 Saidi Arabic 0 Santali 0 Shan 0 Standard Latvian 0 Thai Song 0 Tunisian Sign Language 0 West Central Oromo 0

196 dataset results for French