Datasets

9,765 machine learning datasets
Filter by Task (clear)
Token Classification Machine Translation 16 Question Answering 11 Speech Recognition 8 Text Classification 7 Text Summarization 6 Cross-Lingual Transfer 5 Information Retrieval 5 Named Entity Recognition (NER) 5 Text Generation 5 Abstractive Text Summarization 4 Language Identification 4 Language Modelling 4 Natural Language Inference 4 Part-Of-Speech Tagging 4 Relation Extraction 4 Word Embeddings 4 Cross-Lingual NER 3 Data Augmentation 3 Domain Adaptation 3 Entity Alignment 3 Fake News Detection 3 Misinformation 3 Natural Language Understanding 3 Reading Comprehension 3 Relation Classification 3 Retrieval 3 Sentence Embeddings 3 Translation 3 2D Semantic Segmentation 2 Automatic Post-Editing 2 Automatic Speech Recognition 2 Automatic Speech Recognition (ASR) 2 Classification 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Discourse Segmentation 2 Document Summarization 2 FLUE 2 Fact Verification 2 Generative Question Answering 2 Handwritten Text Recognition 2 Image Classification 2 Instance Segmentation 2 K-complex detection 2 Knowledge Base Question Answering 2 Knowledge Graphs 2 Language Acquisition 2 Multi-modal Entity Alignment 2 Multilingual NLP 2 Multilingual Named Entity Recognition 2 Multilingual text classification 2 Open-Domain Question Answering 2 Paraphrase Identification 2 Sentence Embedding 2 Sentiment Analysis 2 Sequence-to-sequence Language Modeling 2 Sleep Stage Detection 2 Slot Filling 2 Speech-to-Text Translation 2 Spindle Detection 2 Spoken Language Understanding 2 Text Categorization 2 Text Retrieval 2 2D Object Detection 1 Accented Speech Recognition 1 Anomaly Detection In Surveillance Videos 1 Argument Mining 1 Arithmetic Reasoning 1 Audio Signal Processing 1 Audio Source Separation 1 Audio Synthesis 1 Automatic Lyrics Transcription 1 Automatic Phoneme Recognition 1 Automatic Sleep Stage Classification 1 Bias Detection 1 Binary text classification 1 COVID-19 Diagnosis 1 Chemical Reaction Prediction 1 Chinese Reading Comprehension 1 Chinese Sentence Pair Classification 1 Citation Recommendation 1 Code Generation 1 Computed Tomography (CT) 1 Connective Detection 1 Constituency Parsing 1 Contour Detection 1 Coreference Resolution 1 Croatian Text Diacritization 1 Cross-Lingual ASR 1 Cross-Lingual Abstractive Summarization 1 Cross-Lingual Bitext Mining 1 Cross-Lingual Document Classification 1 Cross-Lingual Paraphrase Identification 1 Cross-Lingual Question Answering 1 Cross-Lingual Sentiment Classification 1 Cross-lingual zero-shot dependency parsing 1 Czech Text Diacritization 1 Dependency Parsing 1 Dialect Identification 1 Dialogue Generation 1 Discourse Parsing 1 Document Classification 1 Document Embedding 1 Document Layout Analysis 1 Document Translation 1 Entity Embeddings 1 Entity Linking 1 Event Extraction 1 Few-Shot Audio Classification 1 Few-shot NER 1 French Text Diacritization 1 Handwriting Recognition 1 Hate Speech Detection 1 Humanitarian 1 Hungarian Text Diacritization 1 Implicit Discourse Relation Classification 1 Interpretable Machine Learning 1 Irish Text Diacritization 1 Key Information Extraction 1 Keyword Spotting 1 Knowledge Graph Completion 1 LABELED_DEPENDENCIES 1 LEMMA 1 Latvian Text Diacritization 1 Line Segment Detection 1 Long Form Question Answering 1 MORPH 1 Machine Reading Comprehension 1 Math Word Problem Solving 1 Mathematical Reasoning 1 Max-Shot Cross-Lingual Image-to-Text Retrieval 1 Max-Shot Cross-Lingual Text-to-Image Retrieval 1 Max-Shot Cross-Lingual Visual Natural Language Inference 1 Max-Shot Cross-Lingual Visual Question Answering 1 Max-Shot Cross-Lingual Visual Reasoning 1 Medical Diagnosis 1 Medical Named Entity Recognition 1 Motor Imagery Decoding (left-hand vs right-hand) 1 Multi-task Language Understanding 1 Multilabel Text Classification 1 Multilingual Machine Comprehension in English Hindi 1 Multimodal Lexical Translation 1 Multimodal Machine Translation 1 Multimodal Text Prediction 1 Multiple Choice Question Answering (MCQA) 1 Multiple-choice 1 Multiview Clustering 1 NER 1 Named Entity Recognition 1 Natural Questions 1 Nested Named Entity Recognition 1 News Classification 1 Node Classification 1 POS 1 Paraphrase Generation 1 Polish Text Diacritization 1 Pretrained Multilingual Language Models 1 Propaganda detection 1 Question-Answer-Generation 1 Reading Comprehension (Few-Shot) 1 Reading Comprehension (One-Shot) 1 Reading Comprehension (Zero-Shot) 1 Relation Linking 1 Romanian Text Diacritization 1 SENTS 1 Science Question Answering 1 Semantic Role Labeling 1 Semantic Segmentation 1 Sentence-Pair Classification 1 Sign Language Production 1 Sign Language Recognition 1 Sign Language Translation 1 Single-step retrosynthesis 1 Sleep Staging 1 Slovak Text Diacritization 1 Spanish Text Diacritization 1 Speaker Identification 1 Speaker Verification 1 Speech Denoising 1 Speech Enhancement 1 Speech Separation 1 Speech Synthesis 1 Speech-to-Speech Translation 1 Spoken language identification 1 Stance Detection 1 TAG 1 Table Recognition 1 Table Retrieval 1 Temporal Relation Classification 1 Temporal Relation Extraction 1 Temporal Tagging 1 Text Matching 1 Text Pair Classification 1 Text Style Transfer 1 Text-To-SQL 1 Text-To-Speech Synthesis 1 Topic Classification 1 Translation deu-eng 1 Translation eng-deu 1 Turkish Text Diacritization 1 UNLABELED_DEPENDENCIES 1 Unsupervised Machine Translation 1 Urdu Speech Recognition 1 Video Captioning 1 Vietnamese Machine Reading Comprehension 1 Vietnamese Text Diacritization 1 Visual Reasoning 1 W-R-L-D Sleep Staging 1 W-R-N Sleep Staging 1 Word Alignment 1 Word Sense Disambiguation 1 XLM-R 1 Zero-Shot Cross-Lingual Image-to-Text Retrieval 1 Zero-Shot Cross-Lingual Text-to-Image Retrieval 1 Zero-Shot Cross-Lingual Transfer 1 Zero-Shot Cross-Lingual Visual Natural Language Inference 1 Zero-Shot Cross-Lingual Visual Question Answering 1 Zero-Shot Cross-Lingual Visual Reasoning 1 Zero-Shot Machine Translation 1 Zero-shot Cross-lingual Fact-checking 1 speech-recognition 1 text annotation 1
Filter by Language (clear)
French English 12 German 7 Dutch 5 Portuguese 5 Spanish 5 Chinese 4 Indonesian 4 Italian 4 Korean 4 Polish 4 Russian 4 Arabic 3 Bulgarian 3 Danish 3 Finnish 3 Hebrew 3 Hindi 3 Hungarian 3 Japanese 3 Swedish 3 Telugu 3 Thai 3 Turkish 3 Urdu 3 Vietnamese 3 Afrikaans 2 Albanian 2 Amharic 2 Armenian 2 Bambara 2 Basque 2 Belarusian 2 Breton 2 Catalan 2 Church Slavic 2 Croatian 2 Czech 2 Erzya 2 Estonian 2 Faroese 2 Galician 2 Gothic 2 Icelandic 2 Irish 2 Kazakh 2 Komi-Permyak 2 Latin 2 Latvian 2 Lithuanian 2 Livvi 2 Maltese 2 Manx 2 Marathi 2 Moksha 2 Northern Sami 2 Norwegian 2 Persian 2 Romanian 2 Russia Buriat 2 Sanskrit 2 Scottish Gaelic 2 Serbian 2 Slovak 2 Slovenian 2 Swahili 2 Tagalog 2 Tamil 2 Uighur 2 Ukrainian 2 Upper Sorbian 2 Welsh 2 Wolof 2 Yoruba 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ancient Greek 1 Apurinã 1 Aragonese 1 Arpitan 1 Assamese 1 Assyrian Neo-Aramaic 1 Asturian 1 Avaric 1 Aymara 1 Azerbaijani 1 Banjar 1 Bashkir 1 Bavarian 1 Bengali 1 Bhojpuri 1 Bishnupriya 1 Bislama 1 Bosnian 1 Buginese 1 Burmese 1 Cebuano 1 Central Bikol 1 Central Khmer 1 Central Kurdish 1 Chamorro 1 Chechen 1 Cherokee 1 Cheyenne 1 Choctaw 1 Chukot 1 Chuvash 1 Coptic 1 Cornish 1 Corsican 1 Cree 1 Creek 1 Crimean Tatar 1 Dhivehi 1 Dimli (individual language) 1 Dzongkha 1 Eastern Mari 1 Egyptian Arabic 1 Esperanto 1 Ewe 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 Friulian 1 Fulah 1 Gagauz 1 Gan Chinese 1 Ganda 1 Georgian 1 Gilaki 1 Goan Konkani 1 Greek 1 Guarani 1 Gujarati 1 Haitian 1 Hakka Chinese 1 Hausa 1 Hawaiian 1 Herero 1 Hiri Motu 1 Ido 1 Igbo 1 Iloko 1 Interlingua (International Auxiliary Language Association) 1 Interlingue 1 Inuktitut 1 Inupiaq 1 Jamaican Creole English 1 Javanese 1 Kabardian 1 Kabyle 1 Kalaallisut 1 Kalmyk 1 Kannada 1 Kanuri 1 Kara-Kalpak 1 Karachay-Balkar 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Kinyarwanda 1 Kirghiz 1 Komi 1 Komi-Zyrian 1 Kongo 1 Kuanyama 1 Kurdish 1 Kölsch 1 Ladino 1 Lak 1 Lao 1 Latgalian 1 Lezghian 1 Ligurian 1 Limburgan 1 Lingala 1 Literary Chinese 1 Lojban 1 Lombard 1 Low German 1 Lower Sorbian 1 Luxembourgish 1 Macedonian 1 Maithili 1 Malagasy 1 Malay (macrolanguage) 1 Malayalam 1 Maori 1 Marshallese 1 Mazanderani 1 Mbyá Guaraní 1 Min Dong Chinese 1 Minangkabau 1 Mingrelian 1 Mirandese 1 Modern Greek 1 Modern Greek (1453-) 1 Mongolian 1 Multilingual 1 Mundurukú 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Neapolitan 1 Nepali (macrolanguage) 1 Newari 1 Nigerian Pidgin 1 Northern Frisian 1 Northern Kurdish 1 Northern Luri 1 Norwegian Nynorsk 1 Novial 1 Nyanja 1 Occitan (post 1500) 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Oriya (macrolanguage) 1 Oromo 1 Ossetian 1 Pali 1 Pampanga 1 Pangasinan 1 Papiamento 1 Pedi 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Piemontese 1 Pitcairn-Norfolk 1 Pontic 1 Pushto 1 Quechua 1 Romansh 1 Rundi 1 Rusyn 1 Samoan 1 Sango 1 Sardinian 1 Saterfriesisch 1 Scots 1 Serbo-Croatian 1 Shona 1 Sichuan Yi 1 Sicilian 1 Silesian 1 Sindhi 1 Sinhala 1 Skolt Sami 1 Soi 1 Somali 1 South Azerbaijani 1 South Levantine Arabic 1 Southern Sotho 1 Sranan Tongo 1 Sundanese 1 Swahili (macrolanguage) 1 Swati 1 Swedish Sign Language 1 Swiss German 1 Tahitian 1 Tajik 1 Tatar 1 Tetum 1 Tibetan 1 Tigrinya 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tosk Albanian 1 Tsonga 1 Tswana 1 Tulu 1 Tumbuka 1 Tupinambá 1 Turkmen 1 Tuvinian 1 Twi 1 Udmurt 1 Uzbek 1 Venda 1 Venetian 1 Veps 1 Vlaams 1 Vlax Romani 1 Volapük 1 Walloon 1 Waray (Philippines) 1 Warlpiri 1 Western Frisian 1 Western Mari 1 Western Panjabi 1 Wu Chinese 1 Xhosa 1 Yakut 1 Yiddish 1 Yue Chinese 1 Zeeuws 1 Zhuang 1 Zulu 1 American Sign Language 0 Ancient Hebrew 0 Argentine Sign Language 0 Bangala 0 Bangladeshi Sign Language 0 Bemba (Zambia) 0 Bodo (India) 0 Central Pashto 0 Chavacano 0 Congo Swahili 0 Dogri (individual language) 0 Dogri (macrolanguage) 0 Filipino 0 Fon 0 Geez 0 German Sign Language 0 Greek Sign Language 0 Gulf Arabic 0 Hakha Chin 0 Halh Mongolian 0 Iranian Persian 0 Jejueo 0 Kabuverdianu 0 Kachin 0 Krio 0 Lozi 0 Lunda 0 Luo (Cameroon) 0 Luo (Kenya and Tanzania) 0 Lushai 0 Malay (individual language) 0 Mandarin Chinese 0 Manipuri 0 Mesopotamian Arabic 0 Moroccan Arabic 0 Najdi Arabic 0 Naxi 0 Nepali (individual language) 0 Nigerian Fulfulde 0 North Azerbaijani 0 North Levantine Arabic 0 Northern Huishui Hmong 0 Northern Uzbek 0 Norwegian Bokmål 0 Odia 0 Plateau Malagasy 0 Portuguse 0 Punjabi 0 Rajasthani 0 Saidi Arabic 0 Santali 0 Shan 0 Southern Pashto 0 Standard Arabic 0 Standard Latvian 0 Swiss-German Sign Language 0 Tai 0 Tonga (Zambia) 0 Tunisian Arabic 0 Turkish Sign Language 0 Votic 0 West Central Oromo 0 Zaza 0

4 dataset results for Token Classification AND French