Datasets

8,561 machine learning datasets
Filter by Task
Question Answering 274 Language Modelling 113 Text Generation 112 Text Classification 96 Named Entity Recognition (NER) 81 Reading Comprehension 78 Visual Question Answering (VQA) 75 Text Summarization 68 Natural Language Inference 66 Sentiment Analysis 56 Machine Translation 54 Information Retrieval 53 Natural Language Understanding 52 Relation Extraction 49 Common Sense Reasoning 43 Coreference Resolution 34 Machine Reading Comprehension 33 Abstractive Text Summarization 32 Hate Speech Detection 31 Entity Linking 30 Image Captioning 30 Word Embeddings 29 Data Augmentation 28 Semantic Parsing 27 Classification 25 Code Generation 25 Document Summarization 25 Misinformation 24 Open-Domain Question Answering 23 Video Question Answering 23 Dialogue Generation 21 Fake News Detection 20 Speech Recognition 20 Zero-Shot Learning 20 Knowledge Graphs 19 Part-Of-Speech Tagging 19 Question Generation 19 Stance Detection 19 Video Captioning 19 Video Retrieval 19 Data-to-Text Generation 18 Sequence-to-sequence Language Modeling 18 Visual Reasoning 18 Domain Adaptation 17 Image Retrieval 17 Paraphrase Identification 17 Relation Classification 17 Task-Oriented Dialogue Systems 17 Token Classification 17 Emotion Recognition 16 Multi-Task Learning 16 Retrieval 16 Semantic Textual Similarity 16 Slot Filling 16 Image Classification 15 Intent Detection 15 Recommendation Systems 15 Text Simplification 15 Handwriting Recognition 13 Paraphrase Generation 13 Word Sense Disambiguation 13 Zero-shot Text Search 13 Cross-Lingual Transfer 12 Decision Making 12 Grammatical Error Correction 12 Optical Character Recognition (OCR) 12 Sarcasm Detection 12 Dialogue State Tracking 11 Emotion Classification 11 Event Extraction 11 Fact Verification 11 Few-Shot Learning 11 Language Identification 11 Link Prediction 11 Multi-Document Summarization 11 Sentence Classification 11 Translation 11 Conversational Response Selection 10 Dependency Parsing 10 Emotion Recognition in Conversation 10 Logical Reasoning 10 Multi-Label Classification 10 NER 10 Object Detection 10 Video Understanding 10 Visual Dialog 10 Aspect-Based Sentiment Analysis (ABSA) 9 Automatic Post-Editing 9 Intent Classification 9 Math Word Problem Solving 9 Multi-Label Text Classification 9 Nested Named Entity Recognition 9 News Classification 9 Open Information Extraction 9 Open-Domain Dialog 9 Semantic Segmentation 9 Text-to-Image Generation 9 Vision and Language Navigation 9 Abusive Language 8 Answer Selection 8 Automatic Speech Recognition (ASR) 8 Document Classification 8 Entity Disambiguation 8 Entity Typing 8 Handwriting generation 8 Handwritten Text Recognition 8 Joint Entity and Relation Extraction 8 Named Entity Recognition 8 Passage Retrieval 8 Referring Expression Segmentation 8 Scene Text Recognition 8 Semantic Role Labeling 8 Semantic Similarity 8 Sentence Embeddings 8 Speech Synthesis 8 Text-To-Speech Synthesis 8 Code Search 7 Cross-Modal Retrieval 7 Dialogue Understanding 7 Extreme Summarization 7 Fact Checking 7 Image Generation 7 Knowledge Base Question Answering 7 Mathematical Question Answering 7 Mathematical Reasoning 7 Node Classification 7 Opinion Mining 7 Response Generation 7 Scene Text Detection 7 Speech Emotion Recognition 7 Spoken Language Understanding 7 Temporal Tagging 7 Text-To-SQL 7 Abstractive Dialogue Summarization 6 Ad-hoc video search 6 Chinese Reading Comprehension 6 Community Question Answering 6 Conversational Question Answering 6 Explanation Generation 6 Fairness 6 Image-to-Text Retrieval 6 Medical Named Entity Recognition 6 Medical Visual Question Answering 6 Open Intent Discovery 6 Phrase Grounding 6 Sign Language Translation 6 Table-to-Text Generation 6 Topic Classification 6 Toxic Comment Classification 6 Vision-Language Navigation 6 Zero-Shot Video Retrieval 6 AMR Parsing 5 AMR-to-Text Generation 5 Automated Theorem Proving 5 Chinese Named Entity Recognition 5 Citation Recommendation 5 Cross-Lingual Question Answering 5 Dialogue Act Classification 5 Discourse Parsing 5 Event Detection 5 Extractive Text Summarization 5 Goal-Oriented Dialog 5 Language Acquisition 5 Learning-To-Rank 5 Linguistic Acceptability 5 Medical Relation Extraction 5 Meeting Summarization 5 Moment Retrieval 5 Multimodal Emotion Recognition 5 Multimodal Sentiment Analysis 5 Multiple Choice Question Answering (MCQA) 5 Multiple-choice 5 SSTOD 5 Scientific Document Summarization 5 Self-Supervised Learning 5 Sign Language Recognition 5 Stochastic Optimization 5 Topic Models 5 Twitter Sentiment Analysis 5 Visual Commonsense Reasoning 5 Visual Navigation 5 Visual Question Answering 5 Abuse Detection 4 Action Recognition 4 Anomaly Detection 4 Argument Mining 4 Audio to Text Retrieval 4 Biomedical Information Retrieval 4 Chart Question Answering 4 Chatbot 4 Code Completion 4 Code Summarization 4 Code Translation 4 Composed Image Retrieval (CoIR) 4 Constituency Parsing 4 Dense Video Captioning 4 Dialogue Evaluation 4 Discourse Segmentation 4 Document Ranking 4 Entity Resolution 4 Ethics 4 Few-Shot Relation Classification 4 Genre classification 4 Instruction Following 4 KG-to-Text Generation 4 Lipreading 4 Low Resource Named Entity Recognition 4 Medical Concept Normalization 4 Memorization 4 Multi-class Classification 4 Natural Language Visual Grounding 4 Natural Questions 4 Nested Mention Recognition 4 Paper generation 4 Program Repair 4 Reading Comprehension (Few-Shot) 4 Reading Comprehension (One-Shot) 4 Reading Comprehension (Zero-Shot) 4 Referring Expression Comprehension 4 Relational Reasoning 4 Sentence Embedding 4 Spelling Correction 4 Story Generation 4 Style Transfer 4 Systematic Generalization 4 Table Detection 4 Temporal Relation Classification 4 Term Extraction 4 Text Matching 4 Text Style Transfer 4 Text to Audio Retrieval 4 Text to Video Retrieval 4 Translation deu-eng 4 Translation eng-deu 4 Video Generation 4 Visual Relationship Detection 4 Weakly-Supervised Named Entity Recognition 4 Zero-Shot Cross-Lingual Transfer 4 Zero-Shot Video Question Answer 4 2D Object Detection 3 Active Learning 3 Answer Generation 3 Arithmetic Reasoning 3 Aspect Term Extraction and Sentiment Classification 3 Aspect-Category-Opinion-Sentiment Quadruple Extraction 3 Audio-Visual Speech Recognition 3 Automatic Speech Recognition 3 Binary Relation Extraction 3 Chinese Word Segmentation 3 Chunking 3 Citation Intent Classification 3 Citation Prediction 3 Code Documentation Generation 3 Code Repair 3 Conditional Text Generation 3 Continual Learning 3 Cross Document Coreference Resolution 3 Cross-Lingual NER 3 Definition Extraction 3 Depression Detection 3 Dialect Identification 3 Entity Alignment 3 Entity Retrieval 3 Event Coreference Resolution 3 Explainable artificial intelligence 3 FG-1-PG-1 3 Few-Shot Image Classification 3 Few-Shot NLI 3 Few-shot NER 3 Formal Logic 3 Game of Sudoku 3 Gender Bias Detection 3 Generative Question Answering 3 Goal-Oriented Dialogue Systems 3 Grammatical Error Detection 3 Graph Classification 3 Graph Embedding 3 Humor Detection 3 Instance Segmentation 3 Intent Discovery 3 Joint Event and Temporal Relation Extraction 3 Key Information Extraction 3 Keyword Extraction 3 Lemmatization 3 Lexical Entailment 3 Lip Reading 3 Long-range modeling 3 Low-Resource Neural Machine Translation 3 Masked Language Modeling 3 Meme Classification 3 Meta-Learning 3 Multi Label Text Classification 3 Multi-Domain Recommender Systems 3 Multi-Hop Reading Comprehension 3 Multi-hop Question Answering 3 Multi-modal Dialogue Generation 3 Multi-task Language Understanding 3 Multilingual NLP 3 Multimodal Deep Learning 3 Multimodal Intent Recognition 3 Multimodal Machine Translation 3 Multiple Instance Learning 3 Natural Language Moment Retrieval 3 News Generation 3 News Summarization 3 Out of Distribution (OOD) Detection 3 Person Re-Identification 3 Person Search 3 Product Recommendation 3 Program Synthesis 3 RTE 3 Recognizing Emotion Cause in Conversations 3 Referring Expression 3 Representation Learning 3 Review Generation 3 Semantic Image-Text Similarity 3 Sentiment Classification 3 Source Code Summarization 3 Speaker Diarization 3 Speech Separation 3 Stance Classification 3 Structured Prediction 3 Temporal Action Localization 3 Temporal Relation Extraction 3 Text Categorization 3 Text Clustering 3 Text Retrieval 3 Text Segmentation 3 Text based Person Retrieval 3 Text-to-Code Generation 3 Transfer Learning 3 Transliteration 3 Unconstrained Lip-synchronization 3 Unsupervised Extractive Summarization 3 Unsupervised Machine Translation 3 Unsupervised Text Classification 3 Video Description 3 Video-Text Retrieval 3 Visual Entailment 3 Visual Grounding 3 Visual Speech Recognition 3 Weakly Supervised Classification 3 Word Alignment 3 Zero-Shot Composed Image Retrieval (ZS-CIR) 3 text similarity 3 text2text-generation 3 2D Semantic Segmentation 2 3D Anomaly Detection 2 AbbreviationDetection 2 Adversarial Attack 2 Adversarial Robustness 2 Aggression Identification 2 Arabic Sentiment Analysis 2 Argument Retrieval 2 Art Analysis 2 Aspect Category Detection 2 Aspect Category Polarity 2 Aspect Extraction 2 Astronomy 2 Audio captioning 2 Autonomous Driving 2 Bayesian Inference 2 Bias Detection 2 Binary text classification 2 Causal Inference 2 Chinese Sentence Pair Classification 2 Claim Extraction with Stance Classification (CESC) 2 Claim Verification 2 Click-Through Rate Prediction 2 Cloze (multi-choices) (Few-Shot) 2 Cloze (multi-choices) (One-Shot) 2 Cloze (multi-choices) (Zero-Shot) 2 CoLA 2 Code Classification 2 Code Comment Generation 2 Commonsense Knowledge Base Construction 2 Continual Pretraining 2 Conversational Response Generation 2 Conversational Search 2 Cross-Lingual Abstractive Summarization 2 Cross-Lingual Document Classification 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Cross-Lingual Paraphrase Identification 2 Curved Text Detection 2 Dark Humor Detection 2 Deception Detection 2 Defect Detection 2 Dialog Act Classification 2 Dialog Relation Extraction 2 Distractor Generation 2 Document Layout Analysis 2 Document Text Classification 2 Document Translation 2 Domain Generalization 2 Duplicate-Question Retrieval 2 Dynamic Link Prediction 2 Elementary Mathematics 2 Embeddings Evaluation 2 Emotion Recognition in Context 2 Emotional Dialogue Acts 2 End-To-End Dialogue Modelling 2 Event Argument Extraction 2 Extractive Document Summarization 2 FLUE 2 Fact-based Text Editing 2 Factual Visual Question Answering 2 Feature Engineering 2 Fill Mask 2 Gender Prediction 2 Gesture Generation 2 Graph Generation 2 Handwriting Verification 2 Handwritten Digit Recognition 2 Image Manipulation 2 Imitation Learning 2 Implicit Discourse Relation Classification 2 Intent Recognition 2 Interpretable Machine Learning 2 Irony Identification 2 Irregular Text Recognition 2 Keyphrase Extraction 2 Knowledge Graph Embeddings 2 Layout-to-Image Generation 2 Lip to Speech Synthesis 2 Logical Reasoning Question Answering 2 Long Form Question Answering 2 MRPC 2 Max-Shot Cross-Lingual Visual Reasoning 2 Medical Code Prediction 2 Medical Diagnosis 2 Medical Report Generation 2 Model Compression 2 Morphological Analysis 2 Morphological Tagging 2 Mortality Prediction 2 Motion Synthesis 2 Multi-domain Dialogue State Tracking 2 Multilingual text classification 2 Multimodal Abstractive Text Summarization 2 Multimodal Text Prediction 2 Multiview Contextual Commonsense Inference 2 Music Generation 2 Native Language Identification 2 Natural Language Inference (Few-Shot) 2 Negation Detection 2 Network Embedding 2 Neural Architecture Search 2 New Product Sales Forecasting 2 News Annotation 2 Node Clustering 2 Object Recognition 2 Out-of-Distribution Detection 2 Paper generation (Conclusion-to-title) 2 Paper generation (Title-to-abstract) 2 Paper generation (abstract-to-conclusion) 2 Paraphrase Identification within Bi-Encoder 2 Partially Relevant Video Retrieval 2 Passage Re-Ranking 2 Phrase Ranking 2 Phrase Tagging 2 Point Processes 2 Prosody Prediction 2 QNLI 2 Quantization 2 Query-Based Extractive Summarization 2 Recipe Generation 2 Relation Linking 2 Rumour Detection 2 SQL Parsing 2 STS 2 Satire Detection 2 Scene Graph Detection 2 Scene Graph Generation 2 Science Question Answering 2 Scientific Concept Extraction 2 Scientific Results Extraction 2 Semantic Textual Similarity within Bi-Encoder 2 Semi Supervised Learning for Image Captioning 2 Semi-Supervised Text Classification 2 Sentence Fusion 2 Sentence-Embedding 2 Sequential Recommendation 2 Sign Language Production 2 Speaker Identification 2 Speaker Verification 2 Speech-to-Speech Translation 2 Speech-to-Text Translation 2 Spoken Dialogue Systems 2 Spoken language identification 2 Stock Market Prediction 2 Stock Prediction 2 Story Completion 2 Table-based Fact Verification 2 Talking Face Generation 2 Temporal Information Extraction 2 dialog state tracking 2 regression 2 slot-filling 2
Filter by Language
English 1299 Chinese 190 German 119 French 106 Spanish 85 Russian 84 Italian 54 Japanese 52 Arabic 51 Portuguese 51 Hindi 46 Korean 39 Turkish 36 Dutch 32 Czech 30 Vietnamese 29 Persian 28 Polish 28 Bengali 26 Danish 26 Tamil 26 Indonesian 25 Finnish 23 Romanian 23 Marathi 21 Multilingual 20 Telugu 20 Hungarian 19 Estonian 17 Swedish 17 Urdu 17 Greek 16 Thai 16 Bulgarian 15 Gujarati 15 Hebrew 15 Malayalam 15 Basque 13 Punjabi 13 Slovak 13 Swahili 13 Croatian 12 Ukrainian 12 Latvian 11 Norwegian 11 Slovenian 11 Amharic 10 Catalan 10 Kazakh 10 Lithuanian 10 Kannada 9 Mandarin Chinese 9 Serbian 9 Albanian 8 Armenian 8 Assamese 8 Irish 7 Oriya (macrolanguage) 7 Sanskrit 7 Sinhala 7 Welsh 7 Yoruba 7 Burmese 6 Hausa 6 Icelandic 6 Igbo 6 Macedonian 6 Maltese 6 Mongolian 6 Afrikaans 5 Azerbaijani 5 Georgian 5 Iranian Persian 5 Kurdish 5 Malay (individual language) 5 Norwegian Bokmål 5 Oromo 5 Sindhi 5 Somali 5 Uzbek 5 American Sign Language 4 Bambara 4 Belarusian 4 Breton 4 Egyptian Arabic 4 Filipino 4 Galician 4 Guarani 4 Haitian 4 Latin 4 Malagasy 4 Nigerian Pidgin 4 Norwegian Nynorsk 4 Odia 4 Scottish Gaelic 4 Tagalog 4 Tigrinya 4 Wolof 4 Central Khmer 3 Chechen 3 Esperanto 3 Fulah 3 Ganda 3 Iloko 3 Javanese 3 Kirghiz 3 Lao 3 Lingala 3 Nepali (macrolanguage) 3 Quechua 3 Serbo-Croatian 3 South Azerbaijani 3 Standard Arabic 3 Sundanese 3 Upper Sorbian 3 Aragonese 2 Bangala 2 Bashkir 2 Bavarian 2 Bhojpuri 2 Bishnupriya 2 Bosnian 2 Cebuano 2 Central Kurdish 2 Dhivehi 2 Erzya 2 Faroese 2 Goan Konkani 2 Jejueo 2 Kabyle 2 Kinyarwanda 2 Luo (Kenya and Tanzania) 2 Maithili 2 Malay (macrolanguage) 2 Modern Greek 2 Moroccan Arabic 2 Nepali (individual language) 2 Nyanja 2 Romansh 2 Russia Buriat 2 Swati 2 Tajik 2 Tatar 2 Tibetan 2 Tsonga 2 Tswana 2 Uighur 2 Waray (Philippines) 2 Western Panjabi 2 Xhosa 2 Yiddish 2 Yue Chinese 2 Akkadian 1 Akuntsu 1 Ancient Greek 1 Ancient Hebrew 1 Apurinã 1 Assyrian Neo-Aramaic 1 Asturian 1 Avaric 1 Aymara 1 Bemba (Zambia) 1 Central Bikol 1 Central Pashto 1 Chavacano 1 Chukot 1 Church Slavic 1 Chuvash 1 Congo Swahili 1 Coptic 1 Cornish 1 Dimli (individual language) 1 Dogri (macrolanguage) 1 Eastern Mari 1 Ewe 1 Fon 1 Geez 1 German Sign Language 1 Gothic 1 Gulf Arabic 1 Halh Mongolian 1 Ido 1 Interlingue 1 Inuktitut 1 Kabuverdianu 1 Kachin 1 Kalaallisut 1 Kalmyk 1 Karachay-Balkar 1 Karelian 1 Khunsari 1 Komi 1 Komi-Permyak 1 Komi-Zyrian 1 Krio 1 Lezghian 1 Limburgan 1 Literary Chinese 1 Livvi 1 Lojban 1 Lombard 1 Low German 1 Lower Sorbian 1 Lozi 1 Lunda 1 Luo (Cameroon) 1 Lushai 1 Luxembourgish 1 Manipuri 1 Manx 1 Maori 1 Mazanderani 1 Mbyá Guaraní 1 Mesopotamian Arabic 1 Minangkabau 1 Mingrelian 1 Mirandese 1 Moksha 1 Mundurukú 1 Najdi Arabic 1 Nayini 1 Neapolitan 1 Newari 1 Nigerian Fulfulde 1 North Azerbaijani 1 North Levantine Arabic 1 Northern Frisian 1 Northern Kurdish 1 Northern Luri 1 Northern Sami 1 Northern Uzbek 1 Occitan (post 1500) 1 Old French 1 Old Russian 1 Old Turkish 1 Ossetian 1 Pampanga 1 Pedi 1 Piemontese 1 Plateau Malagasy 1 Pushto 1 Rundi 1 Sardinian 1 Shan 1 Shona 1 Sicilian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Pashto 1 Southern Sotho 1 Standard Latvian 1 Swedish Sign Language 1 Swiss German 1 Swiss-German Sign Language 1 Tonga (Zambia) 1 Tosk Albanian 1 Tupinambá 1 Turkmen 1 Tuvinian 1 Twi 1 Venetian 1 Volapük 1 Walloon 1 Warlpiri 1 West Central Oromo 1 Western Frisian 1 Western Mari 1 Wu Chinese 1 Yakut 1 Zulu 1 Abkhazian 0 Achinese 0 Adyghe 0 Afar 0 Akan 0 Argentine Sign Language 0 Arpitan 0 Bangladeshi Sign Language 0 Banjar 0 Bislama 0 Bodo (India) 0 Buginese 0 Chamorro 0 Cherokee 0 Cheyenne 0 Choctaw 0 Corsican 0 Cree 0 Creek 0 Crimean Tatar 0 Dogri (individual language) 0 Dzongkha 0 Extremaduran 0 Fiji Hindi 0 Fijian 0 Friulian 0 Gagauz 0 Gan Chinese 0 Gilaki 0 Greek Sign Language 0 Hakha Chin 0 Hakka Chinese 0 Hawaiian 0 Herero 0 Hiri Motu 0 Interlingua (International Auxiliary Language Association) 0 Inupiaq 0 Jamaican Creole English 0 Kabardian 0 Kanuri 0 Kara-Kalpak 0 Kashmiri 0 Kashubian 0 Kikuyu 0 Kongo 0 Kuanyama 0 Kölsch 0 Ladino 0 Lak 0 Latgalian 0 Ligurian 0 Marshallese 0 Min Dong Chinese 0 Modern Greek (1453-) 0 Narom 0 Nauru 0 Navajo 0 Naxi 0 Ndonga 0 Northern Huishui Hmong 0 Novial 0 Official Aramaic (700-300 BCE) 0 Old English (ca. 450-1100) 0 Pali 0 Pangasinan 0 Papiamento 0 Pennsylvania German 0 Pfaelzisch 0 Picard 0 Pitcairn-Norfolk 0 Pontic 0 Portuguse 0 Rajasthani 0 Rusyn 0 Saidi Arabic 0 Samoan 0 Sango 0 Santali 0 Saterfriesisch 0 Scots 0 Sichuan Yi 0 Silesian 0 Sranan Tongo 0 Swahili (macrolanguage) 0 Tahitian 0 Tai 0 Tetum 0 Tok Pisin 0 Tonga (Tonga Islands) 0 Tulu 0 Tumbuka 0 Tunisian Arabic 0 Turkish Sign Language 0 Udmurt 0 Venda 0 Veps 0 Vlaams 0 Vlax Romani 0 Votic 0 Zaza 0 Zeeuws 0 Zhuang 0

2313 dataset results for Texts