Datasets

6,712 machine learning datasets
Filter by Task
Question Answering 228 Language Modelling 99 Text Generation 80 Text Classification 72 Reading Comprehension 71 Named Entity Recognition 68 Visual Question Answering 60 Natural Language Inference 57 Text Summarization 47 Sentiment Analysis 46 Machine Translation 43 Information Retrieval 42 Natural Language Understanding 42 Relation Extraction 37 Common Sense Reasoning 32 Coreference Resolution 29 Machine Reading Comprehension 29 Word Embeddings 28 Data Augmentation 27 Abstractive Text Summarization 25 Image Captioning 25 Semantic Parsing 25 Entity Linking 24 Document Summarization 22 Misinformation 22 Fake News Detection 19 Code Generation 18 Dialogue Generation 18 Hate Speech Detection 18 Part-Of-Speech Tagging 18 Video Question Answering 17 Data-to-Text Generation 16 Knowledge Graphs 16 Open-Domain Question Answering 16 Token Classification 16 Paraphrase Identification 15 Semantic Textual Similarity 15 Stance Detection 15 Task-Oriented Dialogue Systems 15 Video Captioning 15 Video Retrieval 15 Domain Adaptation 14 Question Generation 14 Sequence-to-sequence Language Modeling 14 Visual Reasoning 14 Handwriting Recognition 13 Image Classification 13 Multi-Task Learning 13 Recommendation Systems 13 Relation Classification 13 Slot Filling 13 Cross-Lingual Transfer 12 Decision Making 12 Speech Recognition 12 Word Sense Disambiguation 12 Multi-Document Summarization 11 Zero-Shot Learning 11 Conversational Response Selection 10 Dependency Parsing 10 Emotion Recognition 10 Fact Verification 10 Language Identification 10 Link Prediction 10 Paraphrase Generation 10 Sarcasm Detection 10 Intent Detection 9 Optical Character Recognition 9 Vision and Language Navigation 9 Visual Dialog 9 Aspect-Based Sentiment Analysis 8 Automatic Post-Editing 8 Dialogue State Tracking 8 Document Classification 8 Emotion Recognition in Conversation 8 Few-Shot Learning 8 Handwriting generation 8 Image Retrieval 8 Multi-Label Classification 8 Multi-Label Text Classification 8 Nested Named Entity Recognition 8 Object Detection 8 Referring Expression Segmentation 8 Semantic Similarity 8 Sentence Embeddings 8 Text Simplification 8 Cross-Modal Retrieval 7 Emotion Classification 7 Entity Disambiguation 7 Entity Typing 7 Event Extraction 7 Grammatical Error Correction 7 Math Word Problem Solving 7 Node Classification 7 Open Information Extraction 7 Open-Domain Dialog 7 Sentence Classification 7 Speech Synthesis 7 Text-To-Speech Synthesis 7 Video Understanding 7 Abusive Language 6 Ad-hoc video search 6 Answer Selection 6 Code Search 6 Community Question Answering 6 Dialogue Understanding 6 Extreme Summarization 6 Intent Classification 6 Knowledge Base Question Answering 6 Mathematical Question Answering 6 Scene Text Detection 6 Semantic Role Labeling 6 Semantic Segmentation 6 Spoken Language Understanding 6 Table-to-Text Generation 6 Translation 6 AMR Parsing 5 AMR-to-Text Generation 5 Chinese Reading Comprehension 5 Conversational Question Answering 5 Cross-Lingual Question Answering 5 Extractive Text Summarization 5 Goal-Oriented Dialog 5 Handwritten Text Recognition 5 Joint Entity and Relation Extraction 5 Learning-To-Rank 5 Medical Named Entity Recognition 5 Medical Relation Extraction 5 Multimodal Sentiment Analysis 5 NER 5 Open Intent Discovery 5 Opinion Mining 5 Passage Retrieval 5 SSTOD 5 Scene Text Recognition 5 Scientific Document Summarization 5 Stochastic Optimization 5 Text-To-Sql 5 Vision-Language Navigation 5 Visual Navigation 5 Action Recognition 4 Anomaly Detection 4 Automated Theorem Proving 4 Chart Question Answering 4 Chinese Named Entity Recognition 4 Citation Recommendation 4 Code Summarization 4 Constituency Parsing 4 Discourse Parsing 4 Discourse Segmentation 4 Document Ranking 4 Fairness 4 Few-Shot Relation Classification 4 Image Generation 4 Language Acquisition 4 Lipreading 4 Low Resource Named Entity Recognition 4 Natural Language Visual Grounding 4 Nested Mention Recognition 4 News Classification 4 Phrase Grounding 4 Relational Reasoning 4 Self-Supervised Learning 4 Sentence Embedding 4 Systematic Generalization 4 Table Detection 4 Text-Image Retrieval 4 Topic Models 4 Translation deu-eng 4 Translation eng-deu 4 Weakly-Supervised Named Entity Recognition 4 Zero-Shot Cross-Lingual Transfer 4 Abuse Detection 3 Active Learning 3 Aspect Term Extraction and Sentiment Classification 3 Biomedical Information Retrieval 3 Chatbot 3 Classification 3 Code Documentation Generation 3 Dense Video Captioning 3 Dialogue Act Classification 3 Entity Retrieval 3 Explainable artificial intelligence 3 Fact Checking 3 Few-Shot NLI 3 Few-shot NER 3 Goal-Oriented Dialogue Systems 3 Graph Classification 3 Graph Embedding 3 Image-to-Text Retrieval 3 KG-to-Text Generation 3 Lexical Entailment 3 Lip Reading 3 Long-range modeling 3 Low-Resource Neural Machine Translation 3 Moment Retrieval 3 Multi Label Text Classification 3 Multimodal Emotion Recognition 3 Multimodal Machine Translation 3 Multiple Instance Learning 3 Natural Language Moment Retrieval 3 Paper generation 3 Person Re-Identification 3 Person Search 3 Product Recommendation 3 Program Repair 3 Reading Comprehension (Few-Shot) 3 Reading Comprehension (One-Shot) 3 Reading Comprehension (Zero-Shot) 3 Recognizing Emotion Cause in Conversations 3 Referring Expression Comprehension 3 Sign Language Recognition 3 Sign Language Translation 3 Source Code Summarization 3 Stance Classification 3 Story Generation 3 Structured Prediction 3 Style Transfer 3 Term Extraction 3 Text Categorization 3 Text Matching 3 Text Style Transfer 3 Text based Person Retrieval 3 Text-to-Code Generation 3 Text-to-Image Retrieval 3 Topic Classification 3 Unconstrained Lip-synchronization 3 Unsupervised Machine Translation 3 Video Description 3 Visual Commonsense Reasoning 3 Visual Relationship Detection 3 Visual Speech Recognition 3 Weakly Supervised Classification 3 Word Alignment 3 text similarity 3 AbbreviationDetection 2 Action Classification 2 Arabic Sentiment Analysis 2 Argument Mining 2 Art Analysis 2 Aspect Category Detection 2 Aspect Category Polarity 2 Aspect Extraction 2 Aspect-Category-Opinion-Sentiment Quadruple Extraction 2 Audio to Text Retrieval 2 Audio-Visual Speech Recognition 2 Bayesian Inference 2 Chinese Sentence Pair Classification 2 Chinese Word Segmentation 2 Chunking 2 Citation Intent Classification 2 Citation Prediction 2 Cloze (multi-choices) (Few-Shot) 2 Cloze (multi-choices) (One-Shot) 2 Cloze (multi-choices) (Zero-Shot) 2 Code Comment Generation 2 Code Completion 2 Code Repair 2 Code Translation 2 Commonsense Knowledge Base Construction 2 Continual Learning 2 Conversational Response Generation 2 Cross Document Coreference Resolution 2 Cross-Lingual Document Classification 2 Cross-Lingual NER 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Cross-Lingual Paraphrase Identification 2 Deception Detection 2 Depression Detection 2 Dialect Identification 2 Dialog Act Classification 2 Dialog Relation Extraction 2 Dialogue Evaluation 2 Distractor Generation 2 Document Text Classification 2 Duplicate-Question Retrieval 2 Dynamic Link Prediction 2 Emotional Dialogue Acts 2 End-To-End Dialogue Modelling 2 Entity Resolution 2 Event Argument Extraction 2 Event Coreference Resolution 2 Extractive Document Summarization 2 Fact-based Text Editing 2 Factual Visual Question Answering 2 Feature Engineering 2 Few-Shot Image Classification 2 Gender Bias Detection 2 Gender Prediction 2 Generative Question Answering 2 Genre classification 2 Grammatical Error Detection 2 Graph Generation 2 Handwriting Verification 2 Handwritten Digit Recognition 2 Highlight Detection 2 Humor Detection 2 Image Manipulation 2 Imitation Learning 2 Interpretable Machine Learning 2 Keyphrase Extraction 2 Keyword Extraction 2 Knowledge Graph Embeddings 2 Lemmatization 2 Linguistic Acceptability 2 Lip to Speech Synthesis 2 Logical Reasoning Question Answering 2 Masked Language Modeling 2 Max-Shot Cross-Lingual Visual Reasoning 2 Meeting Summarization 2 Meta-Learning 2 Morphological Analysis 2 Multi-Hop Reading Comprehension 2 Multi-class Classification 2 Multi-domain Dialogue State Tracking 2 Multi-hop Question Answering 2 Multi-modal Dialogue Generation 2 Native Language Identification 2 Negation Detection 2 Network Embedding 2 Neural Architecture Search 2 New Product Sales Forecasting 2 News Annotation 2 News Generation 2 Node Clustering 2 Object Recognition 2 Out-of-Distribution Detection 2 Paper generation (Conclusion-to-title) 2 Paper generation (Title-to-abstract) 2 Paper generation (abstract-to-conclusion) 2 Paraphrase Identification within Bi-Encoder 2 Passage Re-Ranking 2 Phrase Ranking 2 Phrase Tagging 2 Point Processes 2 Pretrained Language Models 2 Program Synthesis 2 Prosody Prediction 2 Quantization 2 Recipe Generation 2 Representation Learning 2 Response Generation 2 SQL Parsing 2 Scene Graph Detection 2 Scene Graph Generation 2 Scientific Concept Extraction 2 Scientific Results Extraction 2 Semantic Image-Text Similarity 2 Semantic Textual Similarity within Bi-Encoder 2 Semi-Supervised Text Classification 2 Sentence Fusion 2 Sign Language Production 2 Speaker Verification 2 Speech Emotion Recognition 2 Speech-to-Text Translation 2 Spelling Correction 2 Spoken Dialogue Systems 2 Stock Market Prediction 2 Stock Prediction 2 Supervised Video Summarization 2 Table-based Fact Verification 2 Talking Face Generation 2 Temporal Action Localization 2 Temporal Information Extraction 2 Text Clustering 2 Text Segmentation 2 Text to Audio Retrieval 2 Text-to-Image Generation 2 Time Series Forecasting 2 Timex normalization 2 Transliteration 2 Tweet Retrieval 2 Twitter Sentiment Analysis 2 Unsupervised Extractive Summarization 2 Unsupervised KG-to-Text Generation 2 Unsupervised semantic parsing 2 Variational Inference 2 Video Summarization 2 Video-Text Retrieval 2 Visual Entailment 2 Visual Keyword Spotting 2 Visual Storytelling 2 Weather Forecasting 2 Zero-Shot Cross-Lingual Visual Reasoning 2 Zero-shot Relation Classification 2 Zero-shot Relation Triplet Extraction 2 2D Semantic Segmentation 1 3D Action Recognition 1 3D Anomaly Detection 1 3D Shape Reconstruction 1 3D dense captioning 1 4-ary Relation Extraction 1 Abstractive Dialogue Summarization 1 Accented Speech Recognition 1 Action Anticipation 1 Action Detection 1 Action Quality Assessment 1 Action Recognition In Videos 1 Action Segmentation 1 Action Understanding 1 Adversarial Attack 1 Adversarial Robustness 1 Aesthetics Quality Assessment 1 Age And Gender Classification 1 Aggression Identification 1 Anchor link prediction 1 Annotated Code Search 1 Answer Generation 1 Arabic Text Diacritization 1 Argument Pair Extraction (APE) 1 Argument Retrieval 1 Arithmetic Reasoning 1 Aspect Sentiment Triplet Extraction 1 Aspect-oriented Opinion Extraction 1 Audio Super-Resolution 1 Author Attribution 1 Authorship Verification 1 AutoML 1 Automated Essay Scoring 1 Automatic Speech Recognition 1 Autonomous Driving 1 Behavioural cloning 1 Bias Detection 1 Bidirectional Relationship Classification 1 Binary Relation Extraction 1 Blackout Poetry Generation 1 Breast Tumour Classification 1 Bridging Anaphora Resolution 1 COVID-19 Diagnosis 1 COVID-19 Tracking 1 Causal Discovery 1 Causal Emotion Entailment 1 Causal Identification 1 Chemical Indexing 1 Claim Extraction with Stance Classification (CESC) 1 Claim Verification 1 Claim-Evidence Pair Extraction (CEPE) 1 Click-Through Rate Prediction 1 Clinical Assertion Status Detection 1 Clinical Concept Extraction 1 Clinical Note Phenotyping 1 Clone Detection 1 Cloze Test 1 Clustering Algorithms Evaluation 1 Code Classification 1 CodeSearchNet - Java 1 Combinatorial Optimization 1 Common Sense Reasoning (Few-Shot) 1 Common Sense Reasoning (One-Shot) 1 Common Sense Reasoning (Zero-Shot) 1 Community Detection 1 Complex Word Identification 1 Component Classification 1 Compositional Zero-Shot Learning 1 Computational Phenotyping 1 Computed Tomography (CT) 1 Concept-To-Text Generation 1 Conditional Text Generation 1 Constituency Grammar Induction 1 Context Query Reformulation 1 Context-specific Spam Detection 1 Contextual Embedding for Source Code 1 Continuous Control 1 Conversation Disentanglement 1 Conversational Search 1 Counterfactual Explanation 1 Croatian Text Diacritization 1 Cross-Document Language Modeling 1 Cross-Domain Named Entity Recognition 1 Cross-Lingual Abstractive Summarization 1 Cross-Lingual Bitext Mining 1 Cross-Lingual Entity Linking 1 Cross-Lingual Semantic Textual Similarity 1 Cross-Lingual Sentiment Classification 1 Cross-Modal Person Re-Identification 1 Cross-lingual zero-shot dependency parsing 1 Curved Text Detection 1 Czech Text Diacritization 1 De-identification 1 Deblurring 1 Decipherment 1 Defect Detection 1 Definition Extraction 1 Dialogue Management 1 Dialogue Rewriting 1 Disaster Response 1 Distant Speech Recognition 1 Document Embedding 1 Document Layout Analysis 1 Document Translation 1 Document-level Event Extraction 1 Domain Generalization 1 Drug–drug Interaction Extraction 1 Email Thread Summarization 1 Embeddings Evaluation 1 Emotion Recognition in Context 1 audio-visual learning 1 connective detection 1 dialogue summary 1
Filter by Language
English 971 Chinese 136 German 99 French 78 Spanish 66 Russian 61 Italian 44 Japanese 42 Arabic 40 Portuguese 40 Hindi 31 Turkish 31 Korean 29 Dutch 26 Czech 23 Danish 22 Persian 21 Vietnamese 21 Polish 20 Tamil 20 Bengali 19 Indonesian 19 Finnish 18 Multilingual 18 Romanian 18 Marathi 15 Telugu 15 Estonian 13 Hebrew 13 Thai 13 Urdu 13 Gujarati 12 Malayalam 12 Swedish 12 Greek 11 Hungarian 11 Bulgarian 10 Punjabi 10 Swahili 10 Basque 9 Kazakh 9 Norwegian 9 Ukrainian 9 Amharic 8 Croatian 8 Serbian 8 Slovak 8 Albanian 7 Armenian 7 Catalan 7 Kannada 7 Latvian 7 Mandarin Chinese 7 Slovenian 7 Welsh 7 Irish 6 Lithuanian 6 Oriya (macrolanguage) 6 Sinhala 6 Assamese 5 Icelandic 5 Macedonian 5 Mongolian 5 Sanskrit 5 Yoruba 5 Azerbaijani 4 Belarusian 4 Breton 4 Burmese 4 Georgian 4 Igbo 4 Kurdish 4 Latin 4 Maltese 4 Scottish Gaelic 4 Sindhi 4 Afrikaans 3 American Sign Language 3 Chechen 3 Filipino 3 Galician 3 Haitian 3 Hausa 3 Malagasy 3 Malay (individual language) 3 Norwegian Nynorsk 3 Somali 3 Tagalog 3 Upper Sorbian 3 Uzbek 3 Wolof 3 Aragonese 2 Bambara 2 Bashkir 2 Bavarian 2 Bishnupriya 2 Bosnian 2 Central Khmer 2 Egyptian Arabic 2 Erzya 2 Esperanto 2 Faroese 2 Guarani 2 Iranian Persian 2 Javanese 2 Jejueo 2 Kirghiz 2 Lao 2 Modern Greek 2 Nepali (macrolanguage) 2 Nigerian Pidgin 2 Norwegian Bokmål 2 Oromo 2 Quechua 2 Romansh 2 Russia Buriat 2 Serbo-Croatian 2 South Azerbaijani 2 Standard Arabic 2 Sundanese 2 Tatar 2 Uighur 2 Western Panjabi 2 Yiddish 2 Yue Chinese 2 Akkadian 1 Akuntsu 1 Ancient Greek 1 Ancient Hebrew 1 Apurinã 1 Assyrian Neo-Aramaic 1 Asturian 1 Avaric 1 Bangala 1 Bhojpuri 1 Cebuano 1 Central Bikol 1 Central Kurdish 1 Central Pashto 1 Chavacano 1 Chukot 1 Church Slavic 1 Chuvash 1 Congo Swahili 1 Coptic 1 Cornish 1 Dhivehi 1 Dimli (individual language) 1 Eastern Mari 1 Fon 1 Fulah 1 Ganda 1 Geez 1 Goan Konkani 1 Gothic 1 Ido 1 Iloko 1 Interlingue 1 Inuktitut 1 Kalmyk 1 Karachay-Balkar 1 Karelian 1 Khunsari 1 Kinyarwanda 1 Komi 1 Komi-Permyak 1 Komi-Zyrian 1 Lezghian 1 Limburgan 1 Lingala 1 Literary Chinese 1 Livvi 1 Lojban 1 Lombard 1 Low German 1 Lower Sorbian 1 Luo (Cameroon) 1 Luo (Kenya and Tanzania) 1 Luxembourgish 1 Maithili 1 Malay (macrolanguage) 1 Manipuri 1 Manx 1 Mazanderani 1 Mbyá Guaraní 1 Minangkabau 1 Mingrelian 1 Mirandese 1 Moksha 1 Moroccan Arabic 1 Mundurukú 1 Nayini 1 Neapolitan 1 Nepali (individual language) 1 Newari 1 Northern Frisian 1 Northern Kurdish 1 Northern Luri 1 Northern Sami 1 Occitan (post 1500) 1 Odia 1 Old French 1 Old Russian 1 Old Turkish 1 Ossetian 1 Pampanga 1 Piemontese 1 Portuguse 1 Pushto 1 Sardinian 1 Sicilian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Swati 1 Swedish Sign Language 1 Swiss German 1 Tajik 1 Tibetan 1 Tigrinya 1 Tswana 1 Tupinambá 1 Turkmen 1 Tuvinian 1 Venetian 1 Volapük 1 Walloon 1 Waray (Philippines) 1 Warlpiri 1 Western Frisian 1 Western Mari 1 Wu Chinese 1 Xhosa 1 Yakut 1 Abkhazian 0 Achinese 0 Adyghe 0 Afar 0 Akan 0 Argentine Sign Language 0 Arpitan 0 Aymara 0 Bangladeshi Sign Language 0 Banjar 0 Bislama 0 Bodo (India) 0 Buginese 0 Chamorro 0 Cherokee 0 Cheyenne 0 Choctaw 0 Corsican 0 Cree 0 Creek 0 Crimean Tatar 0 Dzongkha 0 Ewe 0 Extremaduran 0 Fiji Hindi 0 Fijian 0 Friulian 0 Gagauz 0 Gan Chinese 0 German Sign Language 0 Gilaki 0 Greek Sign Language 0 Gulf Arabic 0 Hakha Chin 0 Hakka Chinese 0 Hawaiian 0 Herero 0 Hiri Motu 0 Interlingua (International Auxiliary Language Association) 0 Inupiaq 0 Jamaican Creole English 0 Kabardian 0 Kabyle 0 Kalaallisut 0 Kanuri 0 Kara-Kalpak 0 Kashmiri 0 Kashubian 0 Kikuyu 0 Kongo 0 Kuanyama 0 Kölsch 0 Ladino 0 Lak 0 Latgalian 0 Ligurian 0 Maori 0 Marshallese 0 Min Dong Chinese 0 Modern Greek (1453-) 0 Narom 0 Nauru 0 Navajo 0 Naxi 0 Ndonga 0 Northern Huishui Hmong 0 Novial 0 Nyanja 0 Official Aramaic (700-300 BCE) 0 Old English (ca. 450-1100) 0 Pali 0 Pangasinan 0 Papiamento 0 Pedi 0 Pennsylvania German 0 Pfaelzisch 0 Picard 0 Pitcairn-Norfolk 0 Pontic 0 Rajasthani 0 Rundi 0 Rusyn 0 Saidi Arabic 0 Samoan 0 Sango 0 Santali 0 Saterfriesisch 0 Scots 0 Shona 0 Sichuan Yi 0 Silesian 0 Southern Sotho 0 Sranan Tongo 0 Swahili (macrolanguage) 0 Swiss-German Sign Language 0 Tahitian 0 Tai 0 Tetum 0 Tok Pisin 0 Tonga (Tonga Islands) 0 Tosk Albanian 0 Tsonga 0 Tulu 0 Tumbuka 0 Tunisian Arabic 0 Turkish Sign Language 0 Twi 0 Udmurt 0 Venda 0 Veps 0 Vlaams 0 Vlax Romani 0 Votic 0 Zeeuws 0 Zhuang 0 Zulu 0

1772 dataset results for Texts