Datasets

9,496 machine learning datasets
Filter by Task (clear)
Machine Translation Question Answering 273 Text Generation 124 Language Modelling 115 Text Classification 108 Named Entity Recognition (NER) 91 Visual Question Answering (VQA) 81 Reading Comprehension 77 Text Summarization 70 Natural Language Inference 65 Sentiment Analysis 57 Natural Language Understanding 55 Information Retrieval 54 Relation Extraction 52 Common Sense Reasoning 43 Abstractive Text Summarization 33 Image Captioning 33 Machine Reading Comprehension 33 Classification 32 Code Generation 32 Coreference Resolution 32 Entity Linking 32 Hate Speech Detection 32 Misinformation 29 Word Embeddings 29 Data Augmentation 28 Semantic Parsing 27 Document Summarization 24 Open-Domain Question Answering 23 Stance Detection 23 Video Question Answering 23 Dialogue Generation 22 Speech Recognition 21 Video Retrieval 21 Fake News Detection 20 Part-Of-Speech Tagging 20 Question Generation 20 Retrieval 20 Video Captioning 20 Data-to-Text Generation 19 Knowledge Graphs 19 Visual Reasoning 19 Image Retrieval 18 Recommendation Systems 18 Relation Classification 18 Sequence-to-sequence Language Modeling 18 Domain Adaptation 17 NER 17 Task-Oriented Dialogue Systems 17 Token Classification 17 Emotion Recognition 16 Few-Shot Learning 16 Image Classification 16 Multi-Task Learning 16 Slot Filling 16 Text Simplification 16 Intent Detection 15 Paraphrase Identification 15 Semantic Textual Similarity 15 Mathematical Reasoning 14 Emotion Classification 13 Handwriting Recognition 13 Language Identification 13 Zero-shot Text Search 13 Automatic Speech Recognition (ASR) 12 Cross-Lingual Transfer 12 Decision Making 12 Grammatical Error Correction 12 Link Prediction 12 Logical Reasoning 12 Math Word Problem Solving 12 Multi-Document Summarization 12 Optical Character Recognition (OCR) 12 Paraphrase Generation 12 Sarcasm Detection 12 Visual Question Answering 12 Word Sense Disambiguation 12 Dialogue State Tracking 11 Emotion Recognition in Conversation 11 Event Extraction 11 Fact Verification 11 Joint Entity and Relation Extraction 11 Multi-Label Classification 11 Sentence Classification 11 Text-to-Image Generation 11 Translation 11 Video Understanding 11 Zero-Shot Learning 11 Aspect-Based Sentiment Analysis (ABSA) 10 Conversational Response Selection 10 Dependency Parsing 10 Document Classification 10 Object Detection 10 Open Information Extraction 10 Semantic Role Labeling 10 Text-To-Speech Synthesis 10 Vision and Language Navigation 10 Automatic Post-Editing 9 Instruction Following 9 Intent Classification 9 Multi-Label Text Classification 9 Nested Named Entity Recognition 9 News Classification 9 Open-Domain Dialog 9 Semantic Segmentation 9 Speech Synthesis 9 Abusive Language 8 Answer Selection 8 Cross-Modal Retrieval 8 Entity Disambiguation 8 Entity Typing 8 Handwriting generation 8 Handwritten Text Recognition 8 Image Generation 8 Passage Retrieval 8 Referring Expression Segmentation 8 Scene Text Recognition 8 Semantic Similarity 8 Sentence Embeddings 8 Text Retrieval 8 Text-To-SQL 8 Visual Dialog 8 Zero-Shot Video Question Answer 8 Code Search 7 Conversational Question Answering 7 Dialogue Understanding 7 Explanation Generation 7 Extreme Summarization 7 Knowledge Base Question Answering 7 Mathematical Question Answering 7 Medical Visual Question Answering 7 Multiple-choice 7 Named Entity Recognition 7 Node Classification 7 Opinion Mining 7 Response Generation 7 Scene Text Detection 7 Sign Language Translation 7 Speech Emotion Recognition 7 Spoken Language Understanding 7 Stance Classification 7 Temporal Tagging 7 Topic Classification 7 Topic Models 7 Toxic Comment Classification 7 Abstractive Dialogue Summarization 6 Ad-hoc video search 6 Chart Question Answering 6 Chinese Reading Comprehension 6 Cross-Lingual NER 6 Dialogue Act Classification 6 Fact Checking 6 Fairness 6 Image-to-Text Retrieval 6 Medical Named Entity Recognition 6 Meeting Summarization 6 Multimodal Sentiment Analysis 6 Multiple Choice Question Answering (MCQA) 6 Phrase Grounding 6 Table-to-Text Generation 6 Translation deu-eng 6 Translation eng-deu 6 Vision-Language Navigation 6 AMR Parsing 5 AMR-to-Text Generation 5 Automated Theorem Proving 5 Automatic Speech Recognition 5 Chinese Named Entity Recognition 5 Citation Recommendation 5 Code Completion 5 Code Repair 5 Code Translation 5 Community Question Answering 5 Cross-Lingual Question Answering 5 Dialogue Evaluation 5 Discourse Parsing 5 Event Detection 5 Extractive Text Summarization 5 Generative Question Answering 5 Goal-Oriented Dialog 5 Language Acquisition 5 Learning-To-Rank 5 Linguistic Acceptability 5 Medical Relation Extraction 5 Moment Retrieval 5 Multimodal Deep Learning 5 Multimodal Emotion Recognition 5 Open Intent Discovery 5 Reading Comprehension (Few-Shot) 5 Reading Comprehension (One-Shot) 5 Reading Comprehension (Zero-Shot) 5 Referring Expression Comprehension 5 SSTOD 5 Scientific Document Summarization 5 Self-Supervised Learning 5 Sentiment Classification 5 Sign Language Recognition 5 Stochastic Optimization 5 Story Generation 5 Text-to-Code Generation 5 Twitter Sentiment Analysis 5 Video Generation 5 Visual Commonsense Reasoning 5 Visual Navigation 5 Zero-Shot Video Retrieval 5 Abuse Detection 4 Action Recognition 4 Anomaly Detection 4 Argument Mining 4 Audio to Text Retrieval 4 Audio-Visual Speech Recognition 4 Chatbot 4 Code Summarization 4 Composed Image Retrieval (CoIR) 4 Constituency Parsing 4 Dense Video Captioning 4 Discourse Segmentation 4 Document Ranking 4 Entity Resolution 4 Entity Retrieval 4 Ethics 4 Few-Shot Relation Classification 4 Genre classification 4 Gesture Generation 4 KG-to-Text Generation 4 Lip Reading 4 Lipreading 4 Low Resource Named Entity Recognition 4 Masked Language Modeling 4 Medical Concept Normalization 4 Memorization 4 Multi-class Classification 4 Multi-task Language Understanding 4 Multimodal Reasoning 4 Natural Language Visual Grounding 4 Natural Questions 4 Nested Mention Recognition 4 News Generation 4 Paper generation 4 Product Recommendation 4 Program Repair 4 Relational Reasoning 4 Sentence Embedding 4 Speech Enhancement 4 Speech Separation 4 Spelling Correction 4 Style Transfer 4 Systematic Generalization 4 Table Detection 4 Temporal Relation Classification 4 Term Extraction 4 Text Clustering 4 Text Matching 4 Text Segmentation 4 Text Style Transfer 4 Text to Audio Retrieval 4 Text to Video Retrieval 4 Text-to-Video Generation 4 Transfer Learning 4 Video Grounding 4 Weakly-Supervised Named Entity Recognition 4 Zero-Shot Cross-Lingual Transfer 4 2D Object Detection 3 Active Learning 3 Answer Generation 3 Arithmetic Reasoning 3 Aspect Category Detection 3 Aspect Category Polarity 3 Aspect Term Extraction and Sentiment Classification 3 Aspect-Category-Opinion-Sentiment Quadruple Extraction 3 Bias Detection 3 Binary Classification 3 Binary Relation Extraction 3 Binary text classification 3 Biomedical Information Retrieval 3 Chinese Word Segmentation 3 Chunking 3 Citation Intent Classification 3 Cloze (multi-choices) (Few-Shot) 3 Cloze (multi-choices) (One-Shot) 3 Cloze (multi-choices) (Zero-Shot) 3 Code Classification 3 Code Documentation Generation 3 Conditional Text Generation 3 Continual Learning 3 Conversational Response Generation 3 Cross Document Coreference Resolution 3 Definition Extraction 3 Depression Detection 3 Dialect Identification 3 Entity Alignment 3 Event Coreference Resolution 3 Explainable artificial intelligence 3 FG-1-PG-1 3 Few-Shot Image Classification 3 Few-shot NER 3 Formal Logic 3 Game of Sudoku 3 Gender Bias Detection 3 Goal-Oriented Dialogue Systems 3 Grammatical Error Detection 3 Graph Classification 3 Graph Embedding 3 Humor Detection 3 Instance Segmentation 3 Intent Discovery 3 Joint Event and Temporal Relation Extraction 3 Key Information Extraction 3 Keyword Extraction 3 Lemmatization 3 Lexical Entailment 3 Long Form Question Answering 3 Long-range modeling 3 Medical Report Generation 3 Meme Classification 3 Meta-Learning 3 Multi Label Text Classification 3 Multi-Domain Recommender Systems 3 Multi-Hop Reading Comprehension 3 Multi-hop Question Answering 3 Multi-modal Dialogue Generation 3 Multilingual NLP 3 Multimodal Intent Recognition 3 Multimodal Machine Translation 3 Multiple Instance Learning 3 Music Generation 3 Natural Language Moment Retrieval 3 Negation Detection 3 News Recommendation 3 News Summarization 3 Object Counting 3 Open Intent Detection 3 Out of Distribution (OOD) Detection 3 Person Re-Identification 3 Person Search 3 Recognizing Emotion Cause in Conversations 3 Referring Expression 3 Representation Learning 3 Review Generation 3 Science Question Answering 3 Semantic Image-Text Similarity 3 Source Code Summarization 3 Speaker Diarization 3 Structured Prediction 3 Temporal Action Localization 3 Temporal Relation Extraction 3 Text Categorization 3 Text Reranking 3 Text based Person Retrieval 3 Text-to-Music Generation 3 Time Series Forecasting 3 Translation eng-hrv 3 Translation eng-srp_Cyrl 3 Transliteration 3 Unconstrained Lip-synchronization 3 Unsupervised Text Classification 3 Video Description 3 Video-Text Retrieval 3 Visual Entailment 3 Visual Relationship Detection 3 Visual Speech Recognition 3 Visual Storytelling 3 Weakly Supervised Classification 3 Word Alignment 3 Zero-Shot Composed Image Retrieval (ZS-CIR) 3 Zero-Shot Text Classification 3 Zero-shot Named Entity Recognition (NER) 3 text similarity 3 text2text-generation 3 2D Semantic Segmentation 2 3D Anomaly Detection 2 3D Face Animation 2 AbbreviationDetection 2 Ad-Hoc Information Retrieval 2 Adversarial Attack 2 Adversarial Robustness 2 Aggression Identification 2 Arabic Sentiment Analysis 2 Argument Retrieval 2 Art Analysis 2 Aspect Extraction 2 Astronomy 2 Audio captioning 2 Autonomous Driving 2 Bayesian Inference 2 Causal Inference 2 Causal Language Modeling 2 Chinese Sentence Pair Classification 2 Citation Prediction 2 Claim Extraction with Stance Classification (CESC) 2 Claim Verification 2 Click-Through Rate Prediction 2 Clustering 2 CoLA 2 Code Comment Generation 2 Common Sense Reasoning (Zero-Shot) 2 Commonsense Knowledge Base Construction 2 ContextNER 2 Continual Pretraining 2 Conversational Search 2 Cross-Lingual Abstractive Summarization 2 Cross-Lingual Document Classification 2 Cross-Lingual Natural Language Inference 2 Cross-Lingual POS Tagging 2 Cross-Lingual Paraphrase Identification 2 Curved Text Detection 2 Dark Humor Detection 2 Deception Detection 2 Defect Detection 2 Dialog Act Classification 2 Dialog Relation Extraction 2 Distractor Generation 2 Document Layout Analysis 2 Document Text Classification 2 Document Translation 2 Domain Generalization 2 Dynamic Link Prediction 2 Elementary Mathematics 2 Embeddings Evaluation 2 Emotion Recognition in Context 2 Emotional Dialogue Acts 2 Empathetic Response Generation 2 End-To-End Dialogue Modelling 2 Event Argument Extraction 2 Extractive Document Summarization 2 FLUE 2 Fact-based Text Editing 2 Factual Visual Question Answering 2 Feature Engineering 2 Few-Shot NLI 2 Few-Shot Text Classification 2 Fill Mask 2 Gender Prediction 2 Graph Generation 2 Handwriting Verification 2 Handwritten Digit Recognition 2 Headline Generation 2 Human Activity Recognition 2 Image Manipulation 2 Imitation Learning 2 Implicit Discourse Relation Classification 2 Incremental Learning 2 Intent Classification and Slot Filling 2 Intent Recognition 2 Interpretable Machine Learning 2 Irony Identification 2 Irregular Text Recognition 2 Keyphrase Extraction 2 Knowledge Graph Embeddings 2 Large Language Model 2 Layout-to-Image Generation 2 Lip to Speech Synthesis 2 Logical Reasoning Question Answering 2 Low-Resource Neural Machine Translation 2 MNLI-m 2 MNLI-mm 2 MRPC 2 Max-Shot Cross-Lingual Visual Reasoning 2 Medical Code Prediction 2 Medical Diagnosis 2 Model Compression 2 Morphological Analysis 2 Morphological Tagging 2 Mortality Prediction 2 Motion Captioning 2 Motion Synthesis 2 Multi-Label Learning 2 Multi-domain Dialogue State Tracking 2 Multilabel Text Classification 2 Multilingual Named Entity Recognition 2 Multilingual text classification 2 Multimodal Abstractive Text Summarization 2 Multimodal Text Prediction 2 Multiview Contextual Commonsense Inference 2 Native Language Identification 2 Natural Language Inference (Few-Shot) 2 Network Embedding 2 Neural Architecture Search 2 New Product Sales Forecasting 2 News Annotation 2 Node Clustering 2 Object Localization 2 Object Recognition 2 Open Vocabulary Object Detection 2 Out-of-Distribution Detection 2 Overall - Test 2 Paper generation (Conclusion-to-title) 2 Paper generation (Title-to-abstract) 2 Paper generation (abstract-to-conclusion) 2 Partially Relevant Video Retrieval 2 Passage Re-Ranking 2 Person Retrieval 2 Phrase Ranking 2 Phrase Tagging 2 coreference-resolution 2 dialog state tracking 2 intent-classification 2 knowledge editing 2 multimodal generation 2
Filter by Language
English 43 German 18 French 14 Chinese 12 Spanish 10 Czech 9 Japanese 9 Russian 9 Portuguese 8 Finnish 7 Italian 6 Hindi 5 Polish 5 Romanian 5 Vietnamese 5 Dutch 4 Estonian 4 Hungarian 4 Turkish 4 Arabic 3 Multilingual 3 Tamil 3 Amharic 2 Basque 2 Bengali 2 Bulgarian 2 Danish 2 Korean 2 Sanskrit 2 Swedish 2 Assamese 1 Aymara 1 Bambara 1 Bhojpuri 1 Central Khmer 1 Congo Swahili 1 Dhivehi 1 Dogri (macrolanguage) 1 Ewe 1 Fulah 1 Ganda 1 Geez 1 Goan Konkani 1 Greek 1 Guarani 1 Gujarati 1 Hebrew 1 Igbo 1 Iloko 1 Inuktitut 1 Kalaallisut 1 Krio 1 Kurdish 1 Latvian 1 Lithuanian 1 Lushai 1 Maithili 1 Malay (individual language) 1 Marathi 1 Nepali (macrolanguage) 1 Norwegian 1 Oromo 1 Quechua 1 Sinhala 1 Slovak 1 Slovenian 1 Telugu 1 Thai 1 Tigrinya 1 Tsonga 1 Twi 1 Ukrainian 1 Upper Sorbian 1 Yoruba 1 Abkhazian 0 Achinese 0 Adyghe 0 Afar 0 Afrikaans 0 Akan 0 Akkadian 0 Akuntsu 0 Albanian 0 American Sign Language 0 Ancient Greek 0 Ancient Hebrew 0 Apurinã 0 Aragonese 0 Argentine Sign Language 0 Armenian 0 Arpitan 0 Assyrian Neo-Aramaic 0 Asturian 0 Avaric 0 Azerbaijani 0 Bangala 0 Bangladeshi Sign Language 0 Banjar 0 Bashkir 0 Bavarian 0 Belarusian 0 Bemba (Zambia) 0 Bishnupriya 0 Bislama 0 Bodo (India) 0 Bosnian 0 Breton 0 Buginese 0 Burmese 0 Catalan 0 Cebuano 0 Central Bikol 0 Central Kurdish 0 Central Pashto 0 Chamorro 0 Chavacano 0 Chechen 0 Cherokee 0 Cheyenne 0 Choctaw 0 Chukot 0 Church Slavic 0 Chuvash 0 Coptic 0 Cornish 0 Corsican 0 Cree 0 Creek 0 Crimean Tatar 0 Croatian 0 Dimli (individual language) 0 Dogri (individual language) 0 Dzongkha 0 Eastern Mari 0 Egyptian Arabic 0 Erzya 0 Esperanto 0 Extremaduran 0 Faroese 0 Fiji Hindi 0 Fijian 0 Filipino 0 Fon 0 Friulian 0 Gagauz 0 Galician 0 Gan Chinese 0 Georgian 0 German Sign Language 0 Gilaki 0 Gothic 0 Greek Sign Language 0 Gulf Arabic 0 Haitian 0 Hakha Chin 0 Hakka Chinese 0 Halh Mongolian 0 Hausa 0 Hawaiian 0 Herero 0 Hiri Motu 0 Icelandic 0 Ido 0 Indonesian 0 Interlingua (International Auxiliary Language Association) 0 Interlingue 0 Inupiaq 0 Iranian Persian 0 Irish 0 Jamaican Creole English 0 Javanese 0 Jejueo 0 Kabardian 0 Kabuverdianu 0 Kabyle 0 Kachin 0 Kalmyk 0 Kannada 0 Kanuri 0 Kara-Kalpak 0 Karachay-Balkar 0 Karelian 0 Kashmiri 0 Kashubian 0 Kazakh 0 Khunsari 0 Kikuyu 0 Kinyarwanda 0 Kirghiz 0 Komi 0 Komi-Permyak 0 Komi-Zyrian 0 Kongo 0 Kuanyama 0 Kölsch 0 Ladino 0 Lak 0 Lao 0 Latgalian 0 Latin 0 Lezghian 0 Ligurian 0 Limburgan 0 Lingala 0 Literary Chinese 0 Livvi 0 Lojban 0 Lombard 0 Low German 0 Lower Sorbian 0 Lozi 0 Lunda 0 Luo (Cameroon) 0 Luo (Kenya and Tanzania) 0 Luxembourgish 0 Macedonian 0 Malagasy 0 Malay (macrolanguage) 0 Malayalam 0 Maltese 0 Mandarin Chinese 0 Manipuri 0 Manx 0 Maori 0 Marshallese 0 Mazanderani 0 Mbyá Guaraní 0 Mesopotamian Arabic 0 Min Dong Chinese 0 Minangkabau 0 Mingrelian 0 Mirandese 0 Modern Greek 0 Modern Greek (1453-) 0 Moksha 0 Mongolian 0 Moroccan Arabic 0 Mundurukú 0 Najdi Arabic 0 Narom 0 Nauru 0 Navajo 0 Naxi 0 Nayini 0 Ndonga 0 Neapolitan 0 Nepali (individual language) 0 Newari 0 Nigerian Fulfulde 0 Nigerian Pidgin 0 North Azerbaijani 0 North Levantine Arabic 0 Northern Frisian 0 Northern Huishui Hmong 0 Northern Kurdish 0 Northern Luri 0 Northern Sami 0 Northern Uzbek 0 Norwegian Bokmål 0 Norwegian Nynorsk 0 Novial 0 Nyanja 0 Occitan (post 1500) 0 Odia 0 Official Aramaic (700-300 BCE) 0 Old English (ca. 450-1100) 0 Old French 0 Old Russian 0 Old Turkish 0 Oriya (macrolanguage) 0 Ossetian 0 Pali 0 Pampanga 0 Pangasinan 0 Papiamento 0 Pedi 0 Pennsylvania German 0 Persian 0 Pfaelzisch 0 Picard 0 Piemontese 0 Pitcairn-Norfolk 0 Plateau Malagasy 0 Pontic 0 Portuguse 0 Punjabi 0 Pushto 0 Rajasthani 0 Romansh 0 Rundi 0 Russia Buriat 0 Rusyn 0 Saidi Arabic 0 Samoan 0 Sango 0 Santali 0 Sardinian 0 Saterfriesisch 0 Scots 0 Scottish Gaelic 0 Serbian 0 Serbo-Croatian 0 Shan 0 Shona 0 Sichuan Yi 0 Sicilian 0 Silesian 0 Sindhi 0 Skolt Sami 0 Soi 0 Somali 0 South Azerbaijani 0 South Levantine Arabic 0 Southern Pashto 0 Southern Sotho 0 Sranan Tongo 0 Standard Arabic 0 Standard Latvian 0 Sundanese 0 Swahili 0 Swahili (macrolanguage) 0 Swati 0 Swedish Sign Language 0 Swiss German 0 Swiss-German Sign Language 0 Tagalog 0 Tahitian 0 Tai 0 Tajik 0 Tatar 0 Tetum 0 Tibetan 0 Tok Pisin 0 Tonga (Tonga Islands) 0 Tonga (Zambia) 0 Tosk Albanian 0 Tswana 0 Tulu 0 Tumbuka 0 Tunisian Arabic 0 Tupinambá 0 Turkish Sign Language 0 Turkmen 0 Tuvinian 0 Udmurt 0 Uighur 0 Urdu 0 Uzbek 0 Venda 0 Venetian 0 Veps 0 Vlaams 0 Vlax Romani 0 Volapük 0 Votic 0 Walloon 0 Waray (Philippines) 0 Warlpiri 0 Welsh 0 West Central Oromo 0 Western Frisian 0 Western Mari 0 Western Panjabi 0 Wolof 0 Wu Chinese 0 Xhosa 0 Yakut 0 Yiddish 0 Yue Chinese 0 Zaza 0 Zeeuws 0 Zhuang 0 Zulu 0

56 dataset results for Machine Translation AND Texts