Datasets

3,464 Machine Learning Datasets
Filter by Task
Question Answering 170 Language Modelling 89 Reading Comprehension 67 Text Generation 48 Visual Question Answering 46 Natural Language Inference 40 Machine Translation 34 Named Entity Recognition 34 Text Summarization 34 Sentiment Analysis 31 Information Retrieval 28 Natural Language Understanding 27 Text Classification 27 Word Embeddings 26 Coreference Resolution 22 Relation Extraction 22 Common Sense Reasoning 21 Machine Reading Comprehension 21 Data Augmentation 20 Abstractive Text Summarization 19 Image Captioning 17 Semantic Parsing 17 Document Summarization 16 Video Question Answering 16 Data-to-Text Generation 13 Knowledge Graphs 13 Open-Domain Question Answering 13 Video Captioning 13 Decision Making 12 Domain Adaptation 12 Entity Linking 11 Multi-Task Learning 11 Semantic Textual Similarity 11 Cross-Lingual Transfer 10 Paraphrase Identification 10 Recommendation Systems 10 Video Retrieval 10 Multi-Document Summarization 9 Question Generation 9 Visual Reasoning 9 Word Sense Disambiguation 9 Automatic Post-Editing 8 Intent Detection 8 Language Identification 8 Misinformation 8 Sentence Embeddings 8 Slot Filling 8 Speech Recognition 8 Visual Dialog 8 Dialogue State Tracking 7 Fake News Detection 7 Part-Of-Speech Tagging 7 Task-Oriented Dialogue Systems 7 Vision and Language Navigation 7 Emotion Recognition 6 Entity Disambiguation 6 Link Prediction 6 Multi-Label Classification 6 Object Detection 6 Stance Detection 6 Video Understanding 6 Answer Selection 5 Chinese Reading Comprehension 5 Dependency Parsing 5 Dialogue Generation 5 Emotion Recognition in Conversation 5 Goal-Oriented Dialog 5 Grammatical Error Correction 5 Image Classification 5 Image Retrieval 5 Learning-To-Rank 5 Open Information Extraction 5 Opinion Mining 5 Paraphrase Generation 5 Relation Classification 5 Scene Text Detection 5 Semantic Segmentation 5 Sentence Classification 5 Sentence Embedding 5 Speech Synthesis 5 Stochastic Optimization 5 Text Simplification 5 Visual Navigation 5 Abusive Language 4 Action Recognition 4 Conversational Response Selection 4 Cross-Modal Retrieval 4 Document Classification 4 Emotion Classification 4 Entity Typing 4 Hate Speech Detection 4 Image Generation 4 Multimodal Machine Translation 4 Optical Character Recognition 4 Referring Expression Segmentation 4 Sarcasm Detection 4 Self-Supervised Learning 4 Semantic Role Labeling 4 Semantic Similarity 4 Spoken Language Understanding 4 Text-To-Sql 4 Weakly-Supervised Named Entity Recognition 4 Zero-Shot Learning 4 Chatbot 3 Code Generation 3 Community Question Answering 3 Dense Video Captioning 3 Document Ranking 3 Extractive Text Summarization 3 Fact Verification 3 Fairness 3 Handwriting Recognition 3 Joint Entity and Relation Extraction 3 Math Word Problem Solving 3 Medical Named Entity Recognition 3 Meta-Learning 3 Multimodal Sentiment Analysis 3 Multiple Instance Learning 3 Nested Named Entity Recognition 3 Phrase Grounding 3 Structured Prediction 3 Style Transfer 3 Text Style Transfer 3 Text-Image Retrieval 3 Vision-Language Navigation 3 Word Alignment 3 2 Abuse Detection 2 Active Learning 2 Anomaly Detection 2 Arabic Sentiment Analysis 2 Art Analysis 2 Aspect-Based Sentiment Analysis 2 Automated Theorem Proving 2 Chinese Named Entity Recognition 2 Chinese Sentence Pair Classification 2 Chunking 2 Constituency Parsing 2 Cross-Lingual Bitext Mining 2 Cross-Lingual Document Classification 2 Cross-Lingual Natural Language Inference 2 Dialog Relation Extraction 2 Distractor Generation 2 Event Extraction 2 Feature Engineering 2 Gender Prediction 2 Genre classification 2 Goal-Oriented Dialogue Systems 2 Grammatical Error Detection 2 Graph Embedding 2 Humor Detection 2 Imitation Learning 2 Intent Classification 2 Keyword Extraction 2 Knowledge Graph Embeddings 2 Language Acquisition 2 Lexical Entailment 2 Linguistic Acceptability 2 Lipreading 2 Logical Reasoning Question Answering 2 Low-Resource Neural Machine Translation 2 Meeting Summarization 2 Morphological Analysis 2 Multi-Label Text Classification 2 Negation Detection 2 Nested Mention Recognition 2 Network Embedding 2 Neural Architecture Search 2 News Generation 2 Node Classification 2 Person Search 2 Point Processes 2 Recipe Generation 2 Relational Reasoning 2 SQL Parsing 2 Scene Graph Detection 2 Scene Graph Generation 2 Scene Text 2 Scene Text Recognition 2 Scientific Document Summarization 2 Scientific Results Extraction 2 Sentence Fusion 2 Source Code Summarization 2 Spoken Dialogue Systems 2 Systematic Generalization 2 Table Detection 2 Table-based Fact Verification 2 Text Categorization 2 Text Matching 2 Text-To-Speech Synthesis 2 Tokenization 2 Transliteration 2 Unconstrained Lip-synchronization 2 Unsupervised Machine Translation 2 Variational Inference 2 Video Description 2 Visual Storytelling 2 Weakly Supervised Classification 2 3D Object Classification 1 Accented Speech Recognition 1 Action Classification 1 Action Quality Assessment 1 Action Recognition In Videos 1 Action Understanding 1 Adversarial Attack 1 Arabic Text Diacritization 1 Argument Mining 1 Audio Super-Resolution 1 Autonomous Driving 1 COVID-19 Diagnosis 1 Causal Emotion Entailment 1 Chinese Word Segmentation 1 Citation Intent Classification 1 Citation Recommendation 1 Click-Through Rate Prediction 1 Code Documentation Generation 1 Code Search 1 Combinatorial Optimization 1 Community Detection 1 Complex Word Identification 1 Component Classification 1 Computed Tomography (CT) 1 Concept-To-Text Generation 1 Constituency Grammar Induction 1 Continual Learning 1 Continuous Control 1 Conversation Disentanglement 1 Cross Document Coreference Resolution 1 Cross-Document Language Modeling 1 Cross-Domain Named Entity Recognition 1 Cross-Lingual Abstractive Summarization 1 Cross-Lingual Semantic Textual Similarity 1 Cross-Lingual Sentiment Classification 1 Cross-lingual zero-shot dependency parsing 1 Curved Text Detection 1 Deception Detection 1 Decipherment 1 Definition Extraction 1 Depression Detection 1 Dialogue Management 1 Dialogue Understanding 1 Discourse Parsing 1 Distant Speech Recognition 1 Document Layout Analysis 1 Domain Generalization 1 Drug–drug Interaction Extraction 1 End-To-End Dialogue Modelling 1 Entity Embeddings 1 Entity Extraction using GAN 1 Epidemiology 1 Event Coreference Resolution 1 Extreme Summarization 1 Face Sketch Synthesis 1 Feature Importance 1 Federated Learning 1 Few-Shot Relation Classification 1 Fine-Grained Opinion Analysis 1 Fine-Grained Visual Categorization 1 Fine-Grained Visual Recognition 1 Fine-grained Action Recognition 1 Food Recognition 1 Generative Question Answering 1 Graph Generation 1 Graph Similarity 1 Graph-to-Sequence 1 Hand Gesture Recognition 1 Handwritten Chinese Text Recognition 1 Human Pose Forecasting 1 Human robot interaction 1 Hypernym Discovery 1 Image Comprehension 1 Image Forensics 1 Image Inpainting 1 Image Manipulation 1 Incremental Learning 1 Interpretable Machine Learning 1 Intrusion Detection 1 KB-to-Language Generation 1 Knowledge Base Population 1 Latent Variable Models 1 Layout-to-Image Generation 1 Lexical Simplification 1 Lip Reading 1 Lip to Speech Synthesis 1 Low Resource Named Entity Recognition 1 Material Classification 1 Material Recognition 1 Mathematical Proofs 1 Mathematical Question Answering 1 Medical Relation Extraction 1 Medical Report Generation 1 Meme Classification 1 Memex Question Answering 1 Method name prediction 1 Metric Learning 1 Missing Elements 1 Motion Capture 1 Multi-Domain Sentiment Classification 1 Multi-Hop Reading Comprehension 1 Multi-domain Dialogue State Tracking 1 Multimodal Abstractive Text Summarization 1 Multiview Learning 1 Music Information Retrieval 1 Music Source Separation 1 Myocardial infarction detection 1 NLP based Person Retrival 1 Native Language Identification 1 Natural Language Visual Grounding 1 NetHack Score 1 Node Clustering 1 Object Recognition 1 Open Knowledge Graph Embedding 1 Open-Domain Dialog 1 Outlier Detection 1 Parallel Corpus Mining 1 Passage Re-Ranking 1 Person Re-Identification 1 Predicate Classification 1 Predicate Detection 1 Program Repair 1 Program Synthesis 1 Program induction 1 Prosody Prediction 1 Quantization 1 Reasoning Chain Explanations 1 Recognizing Emotion Cause in Conversations 1 Referring Expression Comprehension 1 Region Proposal 1 SQL-to-Text 1 Scene Generation 1 Scene Graph Classification 1 Scene Understanding 1 Scene-Aware Dialogue 1 Semantic Role Labeling (predicted predicates) 1 Semi-Supervised Text Classification 1 Sentence Compression 1 Sentence Embeddings For Biomedical Texts 1 Sentence Similarity 1 Sign Language Recognition 1 Sign Language Translation 1 Sleep spindles detection 1 Speaker Diarization 1 Speech Enhancement 1 Speech Separation 1 Speech-to-Gesture Translation 1 Spelling Correction 1 Split and Rephrase 1 Spoken language identification 1 Sql Chatbots 1 Starcraft 1 Starcraft II 1 Stock Market Prediction 1 Stock Prediction 1 Subjectivity Analysis 1 Supervised Video Summarization 1 Table-to-Text Generation 1 Talking Face Generation 1 Talking Head Generation 1 Temporal Action Localization 1 Temporal Action Proposal Generation 1 Temporal Information Extraction 1 Temporal Localization 1 Text Infilling 1 Text based Person Retrieval 1 Text-to-Image Generation 1 Time Series Classification 1 Timex normalization 1 Topic Models 1 Toxic Comment Classification 1 Tweet-Reply Sentiment Analysis 1 Twitter Sentiment Analysis 1 Unsupervised Dependency Parsing 1 Unsupervised KG-to-text 1 Unsupervised Text Style Transfer 1 Unsupervised semantic parsing 1 Video Story QA 1 Video Summarization 1 Visual Commonsense Reasoning 1 Visual Entailment 1 Visual Relationship Detection 1 dialogue summary 1 graph construction 1 text-based games 1
Filter by Language
English 451 Chinese 64 German 51 French 40 Spanish 37 Russian 26 Arabic 22 Japanese 22 Italian 20 Portuguese 19 Czech 16 Hindi 16 Turkish 16 Korean 15 Finnish 14 Dutch 13 Vietnamese 13 Multilingual 12 Romanian 11 Persian 10 Telugu 10 Marathi 9 Polish 9 Tamil 9 Thai 9 Bengali 8 Estonian 8 Gujarati 7 Indonesian 7 Malayalam 7 Norwegian 7 Armenian 6 Greek 6 Hebrew 6 Kannada 6 Punjabi 6 Swahili 6 Swedish 6 Urdu 6 Amharic 5 Assamese 5 Basque 5 Bulgarian 5 Catalan 5 Danish 5 Hungarian 5 Ukrainian 5 Albanian 4 Breton 4 Croatian 4 Kurdish 4 Lithuanian 4 Oriya (macrolanguage) 4 Serbian 4 Sinhala 4 Slovak 4 Welsh 4 Yoruba 4 Afrikaans 3 Belarusian 3 Galician 3 Haitian 3 Icelandic 3 Igbo 3 Irish 3 Kazakh 3 Latin 3 Latvian 3 Macedonian 3 Malagasy 3 Mandarin Chinese 3 Sanskrit 3 Scottish Gaelic 3 Sindhi 3 Slovenian 3 Tagalog 3 Wolof 3 Aragonese 2 Azerbaijani 2 Bavarian 2 Bishnupriya 2 Bosnian 2 Burmese 2 Central Khmer 2 Chechen 2 Egyptian Arabic 2 Erzya 2 Esperanto 2 Filipino 2 Georgian 2 Guarani 2 Hausa 2 Javanese 2 Jejueo 2 Lao 2 Malay (individual language) 2 Maltese 2 Modern Greek 2 Mongolian 2 Nigerian Pidgin 2 Norwegian Nynorsk 2 Quechua 2 Romansh 2 Russia Buriat 2 Somali 2 South Azerbaijani 2 Standard Arabic 2 Sundanese 2 Tatar 2 Uighur 2 Upper Sorbian 2 Uzbek 2 Yiddish 2 Yue Chinese 2 Akkadian 1 Akuntsu 1 American Sign Language 1 Ancient Greek 1 Apurinã 1 Assyrian Neo-Aramaic 1 Asturian 1 Avaric 1 Bambara 1 Bashkir 1 Bhojpuri 1 Cebuano 1 Central Bikol 1 Central Kurdish 1 Chavacano 1 Chukot 1 Church Slavic 1 Chuvash 1 Coptic 1 Cornish 1 Dhivehi 1 Dimli (individual language) 1 Eastern Mari 1 Faroese 1 Fon 1 Fulah 1 Ganda 1 Goan Konkani 1 Gothic 1 Ido 1 Iloko 1 Interlingue 1 Kalmyk 1 Karachay-Balkar 1 Karelian 1 Khunsari 1 Kinyarwanda 1 Kirghiz 1 Komi 1 Komi-Permyak 1 Komi-Zyrian 1 Lezghian 1 Limburgan 1 Lingala 1 Literary Chinese 1 Livvi 1 Lojban 1 Lombard 1 Low German 1 Lower Sorbian 1 Luo (Cameroon) 1 Luo (Kenya and Tanzania) 1 Luxembourgish 1 Maithili 1 Manx 1 Mazanderani 1 Mbyá Guaraní 1 Minangkabau 1 Mingrelian 1 Mirandese 1 Moksha 1 Moroccan Arabic 1 Mundurukú 1 Nayini 1 Neapolitan 1 Nepali (individual language) 1 Nepali (macrolanguage) 1 Newari 1 Northern Frisian 1 Northern Kurdish 1 Northern Luri 1 Northern Sami 1 Norwegian Bokmål 1 Occitan (post 1500) 1 Old French 1 Old Russian 1 Old Turkish 1 Oromo 1 Ossetian 1 Pampanga 1 Piemontese 1 Portuguse 1 Pushto 1 Sardinian 1 Serbo-Croatian 1 Sicilian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Swati 1 Swedish Sign Language 1 Swiss German 1 Tajik 1 Tibetan 1 Tswana 1 Tupinambá 1 Turkmen 1 Tuvinian 1 Venetian 1 Volapük 1 Walloon 1 Waray (Philippines) 1 Warlpiri 1 Western Frisian 1 Western Mari 1 Western Panjabi 1 Wu Chinese 1 Xhosa 1 Yakut 1 Abkhazian 0 Achinese 0 Adyghe 0 Afar 0 Akan 0 Arpitan 0 Aymara 0 Bangladeshi Sign Language 0 Banjar 0 Bislama 0 Buginese 0 Chamorro 0 Cherokee 0 Cheyenne 0 Choctaw 0 Corsican 0 Cree 0 Creek 0 Crimean Tatar 0 Dzongkha 0 Ewe 0 Extremaduran 0 Fiji Hindi 0 Fijian 0 Friulian 0 Gagauz 0 Gan Chinese 0 German Sign Language 0 Gilaki 0 Greek Sign Language 0 Gulf Arabic 0 Hakha Chin 0 Hakka Chinese 0 Hawaiian 0 Herero 0 Hiri Motu 0 Interlingua (International Auxiliary Language Association) 0 Inuktitut 0 Inupiaq 0 Jamaican Creole English 0 Kabardian 0 Kabyle 0 Kalaallisut 0 Kanuri 0 Kara-Kalpak 0 Kashmiri 0 Kashubian 0 Kikuyu 0 Kongo 0 Kuanyama 0 Kölsch 0 Ladino 0 Lak 0 Latgalian 0 Ligurian 0 Malay (macrolanguage) 0 Maori 0 Marshallese 0 Min Dong Chinese 0 Modern Greek (1453-) 0 Narom 0 Nauru 0 Navajo 0 Ndonga 0 Novial 0 Nyanja 0 Odia 0 Official Aramaic (700-300 BCE) 0 Old English (ca. 450-1100) 0 Pali 0 Pangasinan 0 Papiamento 0 Pedi 0 Pennsylvania German 0 Pfaelzisch 0 Picard 0 Pitcairn-Norfolk 0 Pontic 0 Rundi 0 Rusyn 0 Samoan 0 Sango 0 Saterfriesisch 0 Scots 0 Shona 0 Sichuan Yi 0 Silesian 0 Southern Sotho 0 Sranan Tongo 0 Swahili (macrolanguage) 0 Tahitian 0 Tetum 0 Tigrinya 0 Tok Pisin 0 Tonga (Tonga Islands) 0 Tosk Albanian 0 Tsonga 0 Tulu 0 Tumbuka 0 Tunisian Arabic 0 Turkish Sign Language 0 Twi 0 Udmurt 0 Venda 0 Veps 0 Vlaams 0 Vlax Romani 0 Votic 0 Zeeuws 0 Zhuang 0 Zulu 0

978 dataset results for Texts