Datasets

7,707 machine learning datasets
Filter by Task
Question Answering 256 Language Modelling 109 Text Generation 101 Text Classification 87 Named Entity Recognition 79 Reading Comprehension 76 Visual Question Answering 67 Text Summarization 63 Natural Language Inference 62 Sentiment Analysis 54 Information Retrieval 52 Machine Translation 52 Natural Language Understanding 49 Relation Extraction 44 Common Sense Reasoning 36 Coreference Resolution 32 Machine Reading Comprehension 32 Word Embeddings 29 Abstractive Text Summarization 28 Data Augmentation 28 Entity Linking 28 Image Captioning 26 Semantic Parsing 26 Document Summarization 24 Hate Speech Detection 24 Misinformation 23 Dialogue Generation 22 Open-Domain Question Answering 21 Code Generation 20 Fake News Detection 19 Video Question Answering 19 Data-to-Text Generation 18 Part-Of-Speech Tagging 18 Question Generation 18 Sequence-to-sequence Language Modeling 18 Zero-Shot Learning 18 Domain Adaptation 17 Knowledge Graphs 17 Speech Recognition 17 Token Classification 17 Classification 16 Emotion Recognition 16 Semantic Textual Similarity 16 Stance Detection 16 Task-Oriented Dialogue Systems 16 Video Retrieval 16 Visual Reasoning 16 Multi-Task Learning 15 Paraphrase Identification 15 Relation Classification 15 Video Captioning 15 Image Classification 14 Slot Filling 14 Handwriting Recognition 13 Recommendation Systems 13 Zero-shot Text Search 13 Cross-Lingual Transfer 12 Decision Making 12 Image Retrieval 12 Word Sense Disambiguation 12 Event Extraction 11 Grammatical Error Correction 11 Link Prediction 11 Multi-Document Summarization 11 Paraphrase Generation 11 Sarcasm Detection 11 Text Simplification 11 Translation 11 Conversational Response Selection 10 Dependency Parsing 10 Emotion Classification 10 Emotion Recognition in Conversation 10 Fact Verification 10 Few-Shot Learning 10 Intent Detection 10 Language Identification 10 Multi-Label Classification 10 Optical Character Recognition 10 Sentence Classification 10 Video Understanding 10 Dialogue State Tracking 9 Multi-Label Text Classification 9 Nested Named Entity Recognition 9 Object Detection 9 Open-Domain Dialog 9 Semantic Segmentation 9 Vision and Language Navigation 9 Visual Dialog 9 Abusive Language 8 Answer Selection 8 Aspect-Based Sentiment Analysis 8 Automatic Post-Editing 8 Document Classification 8 Entity Disambiguation 8 Entity Typing 8 Handwriting generation 8 Intent Classification 8 Referring Expression Segmentation 8 Semantic Similarity 8 Sentence Embeddings 8 Text-To-Speech Synthesis 8 Code Search 7 Cross-Modal Retrieval 7 Extreme Summarization 7 Joint Entity and Relation Extraction 7 Knowledge Base Question Answering 7 Math Word Problem Solving 7 NER 7 Node Classification 7 Open Information Extraction 7 Passage Retrieval 7 Retrieval 7 Semantic Role Labeling 7 Speech Synthesis 7 Spoken Language Understanding 7 Temporal Tagging 7 Text-To-Sql 7 Ad-hoc video search 6 Community Question Answering 6 Conversational Question Answering 6 Dialogue Understanding 6 Fact Checking 6 Fairness 6 Handwritten Text Recognition 6 Image Generation 6 Mathematical Question Answering 6 Medical Named Entity Recognition 6 News Classification 6 Opinion Mining 6 Phrase Grounding 6 Scene Text Detection 6 Table-to-Text Generation 6 Topic Classification 6 Vision-Language Navigation 6 AMR Parsing 5 AMR-to-Text Generation 5 Automatic Speech Recognition 5 Chinese Reading Comprehension 5 Cross-Lingual Question Answering 5 Discourse Parsing 5 Event Detection 5 Extractive Text Summarization 5 Goal-Oriented Dialog 5 Learning-To-Rank 5 Mathematical Reasoning 5 Medical Relation Extraction 5 Multimodal Sentiment Analysis 5 Open Intent Discovery 5 Response Generation 5 SSTOD 5 Scene Text Recognition 5 Scientific Document Summarization 5 Self-Supervised Learning 5 Stochastic Optimization 5 Text-to-Image Generation 5 Twitter Sentiment Analysis 5 Visual Navigation 5 Zero-Shot Video Retrieval 5 Abuse Detection 4 Anomaly Detection 4 Argument Mining 4 Audio to Text Retrieval 4 Automated Theorem Proving 4 Biomedical Information Retrieval 4 Chart Question Answering 4 Chinese Named Entity Recognition 4 Citation Recommendation 4 Code Summarization 4 Constituency Parsing 4 Dialogue Act Classification 4 Dialogue Evaluation 4 Discourse Segmentation 4 Document Ranking 4 Entity Resolution 4 Explanation Generation 4 Few-Shot Relation Classification 4 Image-to-Text Retrieval 4 KG-to-Text Generation 4 Language Acquisition 4 Lipreading 4 Low Resource Named Entity Recognition 4 Medical Concept Normalization 4 Moment Retrieval 4 Multi-class Classification 4 Natural Language Visual Grounding 4 Nested Mention Recognition 4 Paper generation 4 Pretrained Language Models 4 Relational Reasoning 4 Sentence Embedding 4 Sign Language Recognition 4 Sign Language Translation 4 Speech Emotion Recognition 4 Spelling Correction 4 Story Generation 4 Style Transfer 4 Systematic Generalization 4 Table Detection 4 Term Extraction 4 Text Matching 4 Text Style Transfer 4 Text to Audio Retrieval 4 Topic Models 4 Translation deu-eng 4 Translation eng-deu 4 Visual Commonsense Reasoning 4 Visual Relationship Detection 4 Weakly-Supervised Named Entity Recognition 4 Zero-Shot Cross-Lingual Transfer 4 Abstractive Dialogue Summarization 3 Action Recognition 3 Active Learning 3 Aspect Term Extraction and Sentiment Classification 3 Binary Relation Extraction 3 Chatbot 3 Chunking 3 Code Documentation Generation 3 Continual Learning 3 Cross Document Coreference Resolution 3 Definition Extraction 3 Dense Video Captioning 3 Dialect Identification 3 Entity Retrieval 3 Event Coreference Resolution 3 Explainable artificial intelligence 3 FG-1-PG-1 3 Few-Shot NLI 3 Few-shot NER 3 Gender Bias Detection 3 Generative Question Answering 3 Genre classification 3 Goal-Oriented Dialogue Systems 3 Grammatical Error Detection 3 Graph Classification 3 Graph Embedding 3 Humor Detection 3 Instance Segmentation 3 Keyword Extraction 3 Lexical Entailment 3 Linguistic Acceptability 3 Lip Reading 3 Long-range modeling 3 Low-Resource Neural Machine Translation 3 Masked Language Modeling 3 Memorization 3 Meta-Learning 3 Multi Label Text Classification 3 Multi-Hop Reading Comprehension 3 Multi-modal Dialogue Generation 3 Multimodal Emotion Recognition 3 Multimodal Machine Translation 3 Multiple Choice Question Answering (MCQA) 3 Multiple Instance Learning 3 Natural Language Moment Retrieval 3 News Summarization 3 Person Re-Identification 3 Person Search 3 Product Recommendation 3 Program Repair 3 RTE 3 Reading Comprehension (Few-Shot) 3 Reading Comprehension (One-Shot) 3 Reading Comprehension (Zero-Shot) 3 Recognizing Emotion Cause in Conversations 3 Referring Expression Comprehension 3 Review Generation 3 Semantic Image-Text Similarity 3 Source Code Summarization 3 Stance Classification 3 Structured Prediction 3 Text Categorization 3 Text Clustering 3 Text based Person Retrieval 3 Text-to-Code Generation 3 Toxic Comment Classification 3 Transfer Learning 3 Transliteration 3 Unconstrained Lip-synchronization 3 Unsupervised Machine Translation 3 Unsupervised Text Classification 3 Video Description 3 Video Generation 3 Video-Text Retrieval 3 Visual Entailment 3 Visual Speech Recognition 3 Weakly Supervised Classification 3 Word Alignment 3 text similarity 3 2D Semantic Segmentation 2 3D Action Recognition 2 AbbreviationDetection 2 Adversarial Attack 2 Answer Generation 2 Arabic Sentiment Analysis 2 Argument Retrieval 2 Arithmetic Reasoning 2 Art Analysis 2 Aspect Category Detection 2 Aspect Category Polarity 2 Aspect Extraction 2 Aspect-Category-Opinion-Sentiment Quadruple Extraction 2 Astronomy 2 Audio captioning 2 Audio-Visual Speech Recognition 2 Autonomous Driving 2 Bayesian Inference 2 Bias Detection 2 Chinese Sentence Pair Classification 2 Chinese Word Segmentation 2 Citation Intent Classification 2 Citation Prediction 2 Click-Through Rate Prediction 2 Cloze (multi-choices) (Few-Shot) 2 Cloze (multi-choices) (One-Shot) 2 Cloze (multi-choices) (Zero-Shot) 2 CoLA 2 Code Comment Generation 2 Code Completion 2 Code Repair 2 Code Translation 2 Commonsense Knowledge Base Construction 2 Conditional Text Generation 2 Conversational Response Generation 2 Cross-Lingual Document Classification 2 Cross-Lingual NER 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Cross-Lingual Paraphrase Identification 2 Dark Humor Detection 2 Deception Detection 2 Defect Detection 2 Depression Detection 2 Dialog Act Classification 2 Dialog Relation Extraction 2 Distractor Generation 2 Document Text Classification 2 Document Translation 2 Domain Generalization 2 Duplicate-Question Retrieval 2 Dynamic Link Prediction 2 Elementary Mathematics 2 Emotion Recognition in Context 2 Emotional Dialogue Acts 2 End-To-End Dialogue Modelling 2 Entity Alignment 2 Event Argument Extraction 2 Extractive Document Summarization 2 FLUE 2 Fact-based Text Editing 2 Factual Visual Question Answering 2 Feature Engineering 2 Few-Shot Image Classification 2 Fill Mask 2 Gender Prediction 2 Gesture Generation 2 Graph Generation 2 Handwriting Verification 2 Handwritten Digit Recognition 2 Highlight Detection 2 Image Manipulation 2 Imitation Learning 2 Intent Recognition 2 Interpretable Machine Learning 2 Irony Identification 2 Joint Event and Temporal Relation Extraction 2 Keyphrase Extraction 2 Knowledge Graph Embeddings 2 Lemmatization 2 Lip to Speech Synthesis 2 Logical Reasoning Question Answering 2 Long Form Question Answering 2 MRPC 2 Max-Shot Cross-Lingual Visual Reasoning 2 Meeting Summarization 2 Meme Classification 2 Morphological Analysis 2 Mortality Prediction 2 Motion Synthesis 2 Multi-domain Dialogue State Tracking 2 Multi-hop Question Answering 2 Multilingual NLP 2 Multimodal Deep Learning 2 Multiple-choice 2 Multiview Contextual Commonsense Inference 2 Music Generation 2 Native Language Identification 2 Natural Language Inference (Few-Shot) 2 Natural Questions 2 Navigate 2 Negation Detection 2 Network Embedding 2 Neural Architecture Search 2 New Product Sales Forecasting 2 News Annotation 2 News Generation 2 Node Clustering 2 Object Recognition 2 Out-of-Distribution Detection 2 Paper generation (Conclusion-to-title) 2 Paper generation (Title-to-abstract) 2 Paper generation (abstract-to-conclusion) 2 Paraphrase Identification within Bi-Encoder 2 Partially Relevant Video Retrieval 2 Passage Re-Ranking 2 Phrase Ranking 2 Phrase Tagging 2 Point Processes 2 Program Synthesis 2 Prosody Prediction 2 QNLI 2 Quantization 2 Query-Based Extractive Summarization 2 Recipe Generation 2 Referring Expression 2 Representation Learning 2 SQL Parsing 2 STS 2 Scene Graph Detection 2 Scene Graph Generation 2 Science Question Answering 2 Scientific Concept Extraction 2 Scientific Results Extraction 2 Semantic Textual Similarity within Bi-Encoder 2 Semi Supervised Learning for Image Captioning 2 Semi-Supervised Text Classification 2 Sentence Fusion 2 Sentence-Embedding 2 Sequential Recommendation 2 Sign Language Production 2 Speaker Diarization 2 Speaker Identification 2 Speaker Verification 2 Speech-to-Text Translation 2 Spoken Dialogue Systems 2 Spoken language identification 2 Stock Market Prediction 2 Stock Prediction 2 Supervised Video Summarization 2 Table-based Fact Verification 2 Talking Face Generation 2 Temporal Action Localization 2 Temporal Information Extraction 2 Temporal Relation Classification 2 Temporal Relation Extraction 2 Text Reranking 2 Text Retrieval 2 Text Segmentation 2 Time Series Forecasting 2 Timex normalization 2 Tweet Retrieval 2 Twitter Event Detection 2 Unsupervised Domain Adaptation 2 Unsupervised Extractive Summarization 2 Unsupervised KG-to-Text Generation 2 Unsupervised semantic parsing 2 ValNov 2 Variational Inference 2 Video Summarization 2 Visual Grounding 2 Visual Keyword Spotting 2 Visual Storytelling 2 Weather Forecasting 2 Zero-Shot Cross-Lingual Visual Reasoning 2 Zero-shot Relation Classification 2 Zero-shot Relation Triplet Extraction 2 Zero-shot Slot Filling 2 dialog state tracking 2 regression 2 text-based games 2 text-classification 2 2D object detection 1 3D Anomaly Detection 1 3D Human Reconstruction 1 3D Human Shape Estimation 1 3D Reconstruction 1 3D Shape Reconstruction 1 3D dense captioning 1 4-ary Relation Extraction 1 Abstract Algebra 1 Accented Speech Recognition 1 Action Anticipation 1 Action Classification 1 Action Detection 1 Action Quality Assessment 1 Action Recognition In Videos 1 Action Segmentation 1 Action Understanding 1 Actionable Phrase Detection 1 Ad-Hoc Information Retrieval 1 Adversarial Robustness 1 Aesthetic Image Captioning 1 Aesthetics Quality Assessment 1 Age And Gender Classification 1 Aggression Identification 1 Anachronisms 1 Analogical Similarity 1 Analytic Entailment 1 Anatomy 1 Anchor link prediction 1
Filter by Language
English 1137 Chinese 164 German 106 French 93 Spanish 78 Russian 67 Italian 50 Portuguese 48 Japanese 46 Arabic 43 Hindi 40 Korean 36 Turkish 32 Dutch 29 Czech 28 Polish 25 Danish 24 Persian 24 Tamil 24 Vietnamese 23 Bengali 22 Indonesian 21 Romanian 21 Finnish 20 Multilingual 19 Marathi 18 Telugu 17 Estonian 16 Hungarian 15 Greek 14 Gujarati 14 Malayalam 14 Swedish 14 Bulgarian 13 Hebrew 13 Thai 13 Urdu 13 Punjabi 12 Croatian 11 Slovak 11 Basque 10 Latvian 10 Slovenian 10 Swahili 10 Amharic 9 Kannada 9 Kazakh 9 Lithuanian 9 Norwegian 9 Ukrainian 9 Catalan 8 Mandarin Chinese 8 Serbian 8 Albanian 7 Armenian 7 Assamese 7 Irish 7 Oriya (macrolanguage) 7 Welsh 7 Sanskrit 6 Sinhala 6 Yoruba 6 Icelandic 5 Macedonian 5 Maltese 5 Mongolian 5 Afrikaans 4 Azerbaijani 4 Belarusian 4 Breton 4 Burmese 4 Georgian 4 Hausa 4 Igbo 4 Kurdish 4 Latin 4 Scottish Gaelic 4 Sindhi 4 Uzbek 4 American Sign Language 3 Chechen 3 Egyptian Arabic 3 Filipino 3 Galician 3 Haitian 3 Iranian Persian 3 Malagasy 3 Malay (individual language) 3 Nigerian Pidgin 3 Norwegian Nynorsk 3 Somali 3 Tagalog 3 Upper Sorbian 3 Wolof 3 Aragonese 2 Bambara 2 Bangala 2 Bashkir 2 Bavarian 2 Bishnupriya 2 Bosnian 2 Central Khmer 2 Erzya 2 Esperanto 2 Faroese 2 Guarani 2 Javanese 2 Jejueo 2 Kirghiz 2 Lao 2 Modern Greek 2 Nepali (macrolanguage) 2 Norwegian Bokmål 2 Odia 2 Oromo 2 Quechua 2 Romansh 2 Russia Buriat 2 Serbo-Croatian 2 South Azerbaijani 2 Standard Arabic 2 Sundanese 2 Tatar 2 Uighur 2 Western Panjabi 2 Yiddish 2 Yue Chinese 2 Akkadian 1 Akuntsu 1 Ancient Greek 1 Ancient Hebrew 1 Apurinã 1 Assyrian Neo-Aramaic 1 Asturian 1 Avaric 1 Bhojpuri 1 Cebuano 1 Central Bikol 1 Central Kurdish 1 Central Pashto 1 Chavacano 1 Chukot 1 Church Slavic 1 Chuvash 1 Congo Swahili 1 Coptic 1 Cornish 1 Dhivehi 1 Dimli (individual language) 1 Eastern Mari 1 Fon 1 Fulah 1 Ganda 1 Geez 1 German Sign Language 1 Goan Konkani 1 Gothic 1 Ido 1 Iloko 1 Interlingue 1 Inuktitut 1 Kalmyk 1 Karachay-Balkar 1 Karelian 1 Khunsari 1 Kinyarwanda 1 Komi 1 Komi-Permyak 1 Komi-Zyrian 1 Lezghian 1 Limburgan 1 Lingala 1 Literary Chinese 1 Livvi 1 Lojban 1 Lombard 1 Low German 1 Lower Sorbian 1 Luo (Cameroon) 1 Luo (Kenya and Tanzania) 1 Luxembourgish 1 Maithili 1 Malay (macrolanguage) 1 Manipuri 1 Manx 1 Mazanderani 1 Mbyá Guaraní 1 Minangkabau 1 Mingrelian 1 Mirandese 1 Moksha 1 Moroccan Arabic 1 Mundurukú 1 Nayini 1 Neapolitan 1 Nepali (individual language) 1 Newari 1 Northern Frisian 1 Northern Kurdish 1 Northern Luri 1 Northern Sami 1 Occitan (post 1500) 1 Old French 1 Old Russian 1 Old Turkish 1 Ossetian 1 Pampanga 1 Piemontese 1 Pushto 1 Sardinian 1 Sicilian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Swati 1 Swedish Sign Language 1 Swiss German 1 Swiss-German Sign Language 1 Tajik 1 Tibetan 1 Tigrinya 1 Tswana 1 Tupinambá 1 Turkmen 1 Tuvinian 1 Venetian 1 Volapük 1 Walloon 1 Waray (Philippines) 1 Warlpiri 1 Western Frisian 1 Western Mari 1 Wu Chinese 1 Xhosa 1 Yakut 1 Abkhazian 0 Achinese 0 Adyghe 0 Afar 0 Akan 0 Argentine Sign Language 0 Arpitan 0 Aymara 0 Bangladeshi Sign Language 0 Banjar 0 Bislama 0 Bodo (India) 0 Buginese 0 Chamorro 0 Cherokee 0 Cheyenne 0 Choctaw 0 Corsican 0 Cree 0 Creek 0 Crimean Tatar 0 Dzongkha 0 Ewe 0 Extremaduran 0 Fiji Hindi 0 Fijian 0 Friulian 0 Gagauz 0 Gan Chinese 0 Gilaki 0 Greek Sign Language 0 Gulf Arabic 0 Hakha Chin 0 Hakka Chinese 0 Hawaiian 0 Herero 0 Hiri Motu 0 Interlingua (International Auxiliary Language Association) 0 Inupiaq 0 Jamaican Creole English 0 Kabardian 0 Kabyle 0 Kalaallisut 0 Kanuri 0 Kara-Kalpak 0 Kashmiri 0 Kashubian 0 Kikuyu 0 Kongo 0 Kuanyama 0 Kölsch 0 Ladino 0 Lak 0 Latgalian 0 Ligurian 0 Maori 0 Marshallese 0 Min Dong Chinese 0 Modern Greek (1453-) 0 Narom 0 Nauru 0 Navajo 0 Naxi 0 Ndonga 0 Northern Huishui Hmong 0 Novial 0 Nyanja 0 Official Aramaic (700-300 BCE) 0 Old English (ca. 450-1100) 0 Pali 0 Pangasinan 0 Papiamento 0 Pedi 0 Pennsylvania German 0 Pfaelzisch 0 Picard 0 Pitcairn-Norfolk 0 Pontic 0 Portuguse 0 Rajasthani 0 Rundi 0 Rusyn 0 Saidi Arabic 0 Samoan 0 Sango 0 Santali 0 Saterfriesisch 0 Scots 0 Shona 0 Sichuan Yi 0 Silesian 0 Southern Sotho 0 Sranan Tongo 0 Swahili (macrolanguage) 0 Tahitian 0 Tai 0 Tetum 0 Tok Pisin 0 Tonga (Tonga Islands) 0 Tosk Albanian 0 Tsonga 0 Tulu 0 Tumbuka 0 Tunisian Arabic 0 Turkish Sign Language 0 Twi 0 Udmurt 0 Venda 0 Veps 0 Vlaams 0 Vlax Romani 0 Votic 0 Zaza 0 Zeeuws 0 Zhuang 0 Zulu 0

2062 dataset results for Texts