Datasets

3,846 machine learning datasets
Filter by Task
Question Answering 182 Language Modelling 91 Reading Comprehension 69 Text Generation 48 Visual Question Answering 47 Natural Language Inference 39 Machine Translation 36 Named Entity Recognition 35 Information Retrieval 34 Text Summarization 34 Text Classification 33 Sentiment Analysis 31 Natural Language Understanding 27 Word Embeddings 26 Machine Reading Comprehension 24 Coreference Resolution 23 Relation Extraction 23 Data Augmentation 20 Abstractive Text Summarization 19 Common Sense Reasoning 19 Image Captioning 19 Semantic Parsing 18 Document Summarization 16 Video Question Answering 16 Data-to-Text Generation 14 Fake News Detection 14 Misinformation 14 Domain Adaptation 13 Knowledge Graphs 13 Open-Domain Question Answering 13 Semantic Textual Similarity 13 Video Captioning 13 Decision Making 12 Entity Linking 12 Cross-Lingual Transfer 11 Multi-Document Summarization 11 Multi-Task Learning 11 Paraphrase Identification 11 Recommendation Systems 10 Video Retrieval 10 Question Generation 9 Speech Recognition 9 Visual Reasoning 9 Word Sense Disambiguation 9 Automatic Post-Editing 8 Intent Detection 8 Language Identification 8 Sentence Embeddings 8 Slot Filling 8 Vision and Language Navigation 8 Visual Dialog 8 Dialogue State Tracking 7 Part-Of-Speech Tagging 7 Referring Expression Segmentation 7 Task-Oriented Dialogue Systems 7 Emotion Recognition 6 Entity Disambiguation 6 Hate Speech Detection 6 Image Retrieval 6 Link Prediction 6 Object Detection 6 Scene Text Detection 6 Stance Detection 6 Video Understanding 6 Answer Selection 5 Chinese Reading Comprehension 5 Cross-Modal Retrieval 5 Dependency Parsing 5 Dialogue Generation 5 Document Classification 5 Emotion Recognition in Conversation 5 Grammatical Error Correction 5 Image Classification 5 Learning-To-Rank 5 Multi-Label Classification 5 Open Information Extraction 5 Opinion Mining 5 Optical Character Recognition 5 Relation Classification 5 Semantic Role Labeling 5 Semantic Segmentation 5 Sentence Classification 5 Sentence Embedding 5 Speech Synthesis 5 Stochastic Optimization 5 Text Simplification 5 Visual Navigation 5 Abusive Language 4 Action Recognition 4 Conversational Response Selection 4 Cross-Lingual Question Answering 4 Emotion Classification 4 Entity Typing 4 Goal-Oriented Dialog 4 Image Generation 4 Joint Entity and Relation Extraction 4 Multimodal Machine Translation 4 Multimodal Sentiment Analysis 4 Paraphrase Generation 4 Sarcasm Detection 4 Scene Text Recognition 4 Self-Supervised Learning 4 Semantic Similarity 4 Spoken Language Understanding 4 Text-To-Speech Synthesis 4 Text-To-Sql 4 Weakly-Supervised Named Entity Recognition 4 Zero-Shot Learning 4 Abuse Detection 3 Biomedical Information Retrieval 3 Chatbot 3 Code Generation 3 Community Question Answering 3 Dense Video Captioning 3 Document Ranking 3 Event Extraction 3 Extractive Text Summarization 3 Fact Verification 3 Fairness 3 Graph Classification 3 Handwriting Recognition 3 Lipreading 3 Math Word Problem Solving 3 Meta-Learning 3 Multiple Instance Learning 3 Nested Named Entity Recognition 3 Passage Retrieval 3 Phrase Grounding 3 Reading Comprehension (Few-Shot) 3 Reading Comprehension (One-Shot) 3 Reading Comprehension (Zero-Shot) 3 Scientific Document Summarization 3 Structured Prediction 3 Style Transfer 3 Text Style Transfer 3 Text-Image Retrieval 3 Vision-Language Navigation 3 Word Alignment 3 Active Learning 2 Anomaly Detection 2 Arabic Sentiment Analysis 2 Art Analysis 2 Aspect-Based Sentiment Analysis 2 Automated Theorem Proving 2 Chinese Named Entity Recognition 2 Chinese Sentence Pair Classification 2 Chunking 2 Citation Prediction 2 Cloze (multi-choices) (Few-Shot) 2 Cloze (multi-choices) (One-Shot) 2 Cloze (multi-choices) (Zero-Shot) 2 Constituency Parsing 2 Cross-Lingual Bitext Mining 2 Cross-Lingual Document Classification 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual Paraphrase Identification 2 Dialog Relation Extraction 2 Distractor Generation 2 Duplicate-Question Retrieval 2 Entity Retrieval 2 Event Coreference Resolution 2 Explainable artificial intelligence 2 Fact Checking 2 Feature Engineering 2 Gender Prediction 2 Genre classification 2 Goal-Oriented Dialogue Systems 2 Grammatical Error Detection 2 Graph Embedding 2 Humor Detection 2 Imitation Learning 2 Intent Classification 2 Keyword Extraction 2 Knowledge Graph Embeddings 2 Language Acquisition 2 Lexical Entailment 2 Linguistic Acceptability 2 Lip Reading 2 Logical Reasoning Question Answering 2 Low-Resource Neural Machine Translation 2 Mathematical Question Answering 2 Meeting Summarization 2 Morphological Analysis 2 Multi-Label Text Classification 2 Native Language Identification 2 Negation Detection 2 Nested Mention Recognition 2 Network Embedding 2 Neural Architecture Search 2 News Generation 2 Node Classification 2 Person Search 2 Point Processes 2 Recipe Generation 2 Referring Expression Comprehension 2 Relational Reasoning 2 SQL Parsing 2 Scene Graph Detection 2 Scene Graph Generation 2 Scene Text 2 Scientific Results Extraction 2 Sentence Fusion 2 Sign Language Translation 2 Source Code Summarization 2 Spoken Dialogue Systems 2 Systematic Generalization 2 Table Detection 2 Table-based Fact Verification 2 Text Categorization 2 Text Matching 2 Tokenization 2 Transliteration 2 Tweet Retrieval 2 Unconstrained Lip-synchronization 2 Unsupervised Machine Translation 2 Variational Inference 2 Video Description 2 Visual Storytelling 2 Weakly Supervised Classification 2 3D Object Classification 1 Accented Speech Recognition 1 Action Classification 1 Action Quality Assessment 1 Action Understanding 1 Adversarial Attack 1 Arabic Text Diacritization 1 Argument Mining 1 Argument Retrieval 1 Audio Super-Resolution 1 Audio-Visual Speech Recognition 1 Autonomous Driving 1 COVID-19 Diagnosis 1 Causal Emotion Entailment 1 Chinese Word Segmentation 1 Citation Intent Classification 1 Citation Recommendation 1 Click-Through Rate Prediction 1 Code Documentation Generation 1 Code Search 1 Combinatorial Optimization 1 Common Sense Reasoning (Few-Shot) 1 Common Sense Reasoning (One-Shot) 1 Common Sense Reasoning (Zero-Shot) 1 Commonsense Knowledge Base Construction 1 Community Detection 1 Complex Word Identification 1 Component Classification 1 Computed Tomography (CT) 1 Concept-To-Text Generation 1 Constituency Grammar Induction 1 Continual Learning 1 Continuous Control 1 Conversation Disentanglement 1 Cross Document Coreference Resolution 1 Cross-Document Language Modeling 1 Cross-Domain Named Entity Recognition 1 Cross-Lingual Abstractive Summarization 1 Cross-Lingual NER 1 Cross-Lingual POS Tagging 1 Cross-Lingual Semantic Textual Similarity 1 Cross-Lingual Sentiment Classification 1 Cross-lingual zero-shot dependency parsing 1 Curved Text Detection 1 Deblurring 1 Deception Detection 1 Decipherment 1 Definition Extraction 1 Depression Detection 1 Dialogue Management 1 Dialogue Understanding 1 Disaster Response 1 Discourse Parsing 1 Distant Speech Recognition 1 Document Layout Analysis 1 Document-level 1 Domain Generalization 1 Drug–drug Interaction Extraction 1 End-To-End Dialogue Modelling 1 Entity Cross-Document Coreference Resolution 1 Entity Embeddings 1 Entity Extraction using GAN 1 Epidemiology 1 Event Cross-Document Coreference Resolution 1 Evidence Selection 1 Extreme Summarization 1 Face Sketch Synthesis 1 Feature Importance 1 Federated Learning 1 Few-Shot Relation Classification 1 Fine-Grained Opinion Analysis 1 Fine-Grained Visual Categorization 1 Fine-Grained Visual Recognition 1 Fine-grained Action Recognition 1 Food Recognition 1 General Classification 1 Generative Question Answering 1 Graph Generation 1 Graph Question Answering 1 Graph Similarity 1 Graph-to-Sequence 1 Hand Gesture Recognition 1 Handwritten Chinese Text Recognition 1 Human Pose Forecasting 1 Human robot interaction 1 Hypernym Discovery 1 Image Comprehension 1 Image Forensics 1 Image Inpainting 1 Image Manipulation 1 Incremental Learning 1 Interpretable Machine Learning 1 Intrusion Detection 1 KB-to-Language Generation 1 Key Information Extraction 1 Knowledge Base Population 1 Knowledge Base Question Answering 1 Latent Variable Models 1 Layout-to-Image Generation 1 Length-of-Stay prediction 1 Lexical Simplification 1 Lip to Speech Synthesis 1 Low Resource Named Entity Recognition 1 Material Classification 1 Material Recognition 1 Mathematical Proofs 1 Medical Diagnosis 1 Medical Procedure 1 Medical Relation Extraction 1 Medical Report Generation 1 Meme Classification 1 Memex Question Answering 1 Method name prediction 1 Metric Learning 1 Missing Elements 1 Mortality Prediction 1 Motion Capture 1 Multi-Choice MRC 1 Multi-Domain Sentiment Classification 1 Multi-Hop Reading Comprehension 1 Multi-domain Dialogue State Tracking 1 Multimodal Abstractive Text Summarization 1 Multiview Learning 1 Music Information Retrieval 1 Music Source Separation 1 Myocardial infarction detection 1 NLP based Person Retrival 1 Natural Language Inference (Few-Shot) 1 Natural Language Inference (One-Shot) 1 Natural Language Inference (Zero-Shot) 1 Natural Language Visual Grounding 1 NetHack Score 1 News Annotation 1 News Classification 1 News Retrieval 1 Node Clustering 1 Object Recognition 1 Open Knowledge Graph Embedding 1 Open-Domain Dialog 1 Outlier Detection 1 Parallel Corpus Mining 1 Passage Re-Ranking 1 Person Re-Identification 1 Predicate Classification 1 Predicate Detection 1 Program Repair 1 Program Synthesis 1 Program induction 1 Prosody Prediction 1 Quantization 1 Re-Ranking 1 Reasoning Chain Explanations 1 Recognizing Emotion Cause in Conversations 1 Region Proposal 1 Rumour Detection 1 SQL-to-Text 1 Salient Object Detection 1 Scene Generation 1 Scene Graph Classification 1 Scene Understanding 1 Scene-Aware Dialogue 1 Semantic Role Labeling (predicted predicates) 1 Semi-Supervised Text Classification 1 Semi-Supervised Video Object Segmentation 1 Sentence Compression 1 Sentence Embeddings For Biomedical Texts 1 Sentence Similarity 1 Sentence-level Cat1 1 Sentence-level Cat2 1 Sentence-level Cat3 1 Short Text Clustering 1 Sign Language Production 1 Sign Language Recognition 1 Sleep spindles detection 1 Span-Extraction MRC 1 Speaker Diarization 1 Speech Enhancement 1 Speech Separation 1 Speech-to-Gesture Translation 1 Speech-to-Text Translation 1 Spelling Correction 1 Split and Rephrase 1 Spoken language identification 1 Sql Chatbots 1 Starcraft 1 Starcraft II 1 Stock Market Prediction 1 Stock Prediction 1 Subjectivity Analysis 1 Supervised Video Summarization 1 Table-to-Text Generation 1 Talking Face Generation 1 Talking Head Generation 1 Temporal Action Localization 1 Temporal Action Proposal Generation 1 Temporal Information Extraction 1 Temporal Localization 1 Text Infilling 1 Text based Person Retrieval 1 Text-to-Image Generation 1 Time Series Classification 1 Timex normalization 1 Topic Models 1 Toxic Comment Classification 1 Tweet-Reply Sentiment Analysis 1 Twitter Sentiment Analysis 1 Unsupervised Dependency Parsing 1 Unsupervised KG-to-text 1 Unsupervised Text Style Transfer 1 Unsupervised Video Object Segmentation 1 Unsupervised semantic parsing 1 Video Story QA 1 Video Summarization 1 Visual Commonsense Reasoning 1 Visual Entailment 1 Visual Relationship Detection 1 Visual Speech Recognition 1 Wildly Unsupervised Domain Adaptation 1 Zero-Shot Cross-Lingual Transfer 1 dialogue summary 1 graph construction 1 severity prediction 1 text-based games 1
Filter by Language
English 523 Chinese 68 German 64 French 46 Spanish 41 Russian 30 Japanese 26 Arabic 24 Italian 23 Portuguese 22 Czech 18 Turkish 18 Hindi 17 Korean 16 Finnish 15 Dutch 14 Persian 13 Vietnamese 13 Multilingual 12 Romanian 12 Telugu 11 Estonian 10 Polish 10 Tamil 10 Bengali 9 Indonesian 9 Marathi 9 Thai 9 Malayalam 8 Gujarati 7 Norwegian 7 Swedish 7 Urdu 7 Armenian 6 Basque 6 Bulgarian 6 Catalan 6 Greek 6 Hebrew 6 Hungarian 6 Kannada 6 Punjabi 6 Swahili 6 Ukrainian 6 Albanian 5 Amharic 5 Assamese 5 Croatian 5 Danish 5 Kazakh 5 Mandarin Chinese 5 Slovak 5 Slovenian 5 Welsh 5 Breton 4 Kurdish 4 Latvian 4 Lithuanian 4 Macedonian 4 Oriya (macrolanguage) 4 Serbian 4 Sinhala 4 Yoruba 4 Afrikaans 3 Belarusian 3 Chechen 3 Galician 3 Georgian 3 Haitian 3 Icelandic 3 Igbo 3 Irish 3 Latin 3 Malagasy 3 Mongolian 3 Sanskrit 3 Scottish Gaelic 3 Sindhi 3 Tagalog 3 Wolof 3 American Sign Language 2 Aragonese 2 Azerbaijani 2 Bavarian 2 Bishnupriya 2 Bosnian 2 Burmese 2 Central Khmer 2 Egyptian Arabic 2 Erzya 2 Esperanto 2 Filipino 2 Guarani 2 Hausa 2 Javanese 2 Jejueo 2 Lao 2 Malay (individual language) 2 Maltese 2 Modern Greek 2 Nigerian Pidgin 2 Norwegian Nynorsk 2 Quechua 2 Romansh 2 Russia Buriat 2 Serbo-Croatian 2 Somali 2 South Azerbaijani 2 Standard Arabic 2 Sundanese 2 Tatar 2 Uighur 2 Upper Sorbian 2 Uzbek 2 Yiddish 2 Yue Chinese 2 Akkadian 1 Akuntsu 1 Ancient Greek 1 Apurinã 1 Assyrian Neo-Aramaic 1 Asturian 1 Avaric 1 Bambara 1 Bashkir 1 Bhojpuri 1 Cebuano 1 Central Bikol 1 Central Kurdish 1 Chavacano 1 Chukot 1 Church Slavic 1 Chuvash 1 Coptic 1 Cornish 1 Dhivehi 1 Dimli (individual language) 1 Eastern Mari 1 Faroese 1 Fon 1 Fulah 1 Ganda 1 Goan Konkani 1 Gothic 1 Ido 1 Iloko 1 Interlingue 1 Iranian Persian 1 Kalmyk 1 Karachay-Balkar 1 Karelian 1 Khunsari 1 Kinyarwanda 1 Kirghiz 1 Komi 1 Komi-Permyak 1 Komi-Zyrian 1 Lezghian 1 Limburgan 1 Lingala 1 Literary Chinese 1 Livvi 1 Lojban 1 Lombard 1 Low German 1 Lower Sorbian 1 Luo (Cameroon) 1 Luo (Kenya and Tanzania) 1 Luxembourgish 1 Maithili 1 Manx 1 Mazanderani 1 Mbyá Guaraní 1 Minangkabau 1 Mingrelian 1 Mirandese 1 Moksha 1 Moroccan Arabic 1 Mundurukú 1 Nayini 1 Neapolitan 1 Nepali (individual language) 1 Nepali (macrolanguage) 1 Newari 1 Northern Frisian 1 Northern Kurdish 1 Northern Luri 1 Northern Sami 1 Norwegian Bokmål 1 Occitan (post 1500) 1 Old French 1 Old Russian 1 Old Turkish 1 Oromo 1 Ossetian 1 Pampanga 1 Piemontese 1 Portuguse 1 Pushto 1 Sardinian 1 Sicilian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Swati 1 Swedish Sign Language 1 Swiss German 1 Tajik 1 Tibetan 1 Tswana 1 Tupinambá 1 Turkmen 1 Tuvinian 1 Venetian 1 Volapük 1 Walloon 1 Waray (Philippines) 1 Warlpiri 1 Western Frisian 1 Western Mari 1 Western Panjabi 1 Wu Chinese 1 Xhosa 1 Yakut 1 Abkhazian 0 Achinese 0 Adyghe 0 Afar 0 Akan 0 Arpitan 0 Aymara 0 Bangladeshi Sign Language 0 Banjar 0 Bislama 0 Buginese 0 Chamorro 0 Cherokee 0 Cheyenne 0 Choctaw 0 Corsican 0 Cree 0 Creek 0 Crimean Tatar 0 Dzongkha 0 Ewe 0 Extremaduran 0 Fiji Hindi 0 Fijian 0 Friulian 0 Gagauz 0 Gan Chinese 0 German Sign Language 0 Gilaki 0 Greek Sign Language 0 Gulf Arabic 0 Hakha Chin 0 Hakka Chinese 0 Hawaiian 0 Herero 0 Hiri Motu 0 Interlingua (International Auxiliary Language Association) 0 Inuktitut 0 Inupiaq 0 Jamaican Creole English 0 Kabardian 0 Kabyle 0 Kalaallisut 0 Kanuri 0 Kara-Kalpak 0 Kashmiri 0 Kashubian 0 Kikuyu 0 Kongo 0 Kuanyama 0 Kölsch 0 Ladino 0 Lak 0 Latgalian 0 Ligurian 0 Malay (macrolanguage) 0 Maori 0 Marshallese 0 Min Dong Chinese 0 Modern Greek (1453-) 0 Narom 0 Nauru 0 Navajo 0 Ndonga 0 Novial 0 Nyanja 0 Odia 0 Official Aramaic (700-300 BCE) 0 Old English (ca. 450-1100) 0 Pali 0 Pangasinan 0 Papiamento 0 Pedi 0 Pennsylvania German 0 Pfaelzisch 0 Picard 0 Pitcairn-Norfolk 0 Pontic 0 Rundi 0 Rusyn 0 Samoan 0 Sango 0 Saterfriesisch 0 Scots 0 Shona 0 Sichuan Yi 0 Silesian 0 Southern Sotho 0 Sranan Tongo 0 Swahili (macrolanguage) 0 Swiss-German Sign Language 0 Tahitian 0 Tetum 0 Tigrinya 0 Tok Pisin 0 Tonga (Tonga Islands) 0 Tosk Albanian 0 Tsonga 0 Tulu 0 Tumbuka 0 Tunisian Arabic 0 Turkish Sign Language 0 Twi 0 Udmurt 0 Venda 0 Veps 0 Vlaams 0 Vlax Romani 0 Votic 0 Zeeuws 0 Zhuang 0 Zulu 0

1078 dataset results for Texts