Datasets

9,499 machine learning datasets
Filter by Task (clear)
Sentiment Analysis Audio Classification 28 Speech Recognition 28 Music Information Retrieval 21 Sound Event Detection 18 Music Generation 16 Automatic Speech Recognition (ASR) 14 Speech Emotion Recognition 12 Information Retrieval 11 Speech Enhancement 11 Acoustic Scene Classification 9 Few-Shot Audio Classification 9 Speech Separation 9 Audio Source Separation 8 Audio Tagging 8 Data Augmentation 8 Emotion Recognition 8 Multi-Task Learning 8 Scene Classification 8 Text-To-Speech Synthesis 8 Automatic Speech Recognition 7 Language Modelling 7 Music Source Separation 7 Sound Event Localization and Detection 7 Environmental Sound Classification 6 Music Transcription 6 Speech Synthesis 6 Action Recognition 5 Audio Generation 5 Audio-Visual Speech Recognition 5 Emotion Recognition in Conversation 5 Multimodal Emotion Recognition 5 Spoken Language Understanding 5 Video Understanding 5 Anomaly Detection 4 Audio to Text Retrieval 4 Emotion Classification 4 Gesture Generation 4 Keyword Spotting 4 Lipreading 4 Multi-Label Classification 4 Music Modeling 4 Question Answering 4 Text to Audio Retrieval 4 Video Retrieval 4 Visual Speech Recognition 4 Action Quality Assessment 3 Classification 3 Direction of Arrival Estimation 3 Distant Speech Recognition 3 Facial Expression Recognition (FER) 3 Genre classification 3 Image Classification 3 Instrument Recognition 3 Intent Detection 3 Language Identification 3 Lip Reading 3 Multi-modal Classification 3 Multi-task Audio Source Seperation 3 Multimodal Deep Learning 3 Multimodal Sentiment Analysis 3 Music Classification 3 Music Tagging 3 Object Recognition 3 Quantization 3 Recommendation Systems 3 Self-Supervised Learning 3 Semantic Segmentation 3 Slot Filling 3 Speaker Diarization 3 Speaker Recognition 3 Speaker Verification 3 Speech Denoising 3 Speech Extraction 3 Spoken language identification 3 Talking Face Generation 3 Unconstrained Lip-synchronization 3 Video Emotion Recognition 3 Voice Conversion 3 3D Face Animation 2 3D Object Classification 2 Abstractive Text Summarization 2 Acoustic echo cancellation 2 Activity Recognition 2 Arousal Estimation 2 Audio Emotion Recognition 2 Audio Signal Processing 2 Audio Super-Resolution 2 Audio captioning 2 Audio-Visual Synchronization 2 Automatic Phoneme Recognition 2 Bird Audio Detection 2 Contrastive Learning 2 Cross-Modal Retrieval 2 DeepFake Detection 2 Depression Detection 2 Facial Emotion Recognition 2 Intent Discovery 2 Lip to Speech Synthesis 2 Multiview Learning 2 Music Auto-Tagging 2 Music Recommendation 2 Online Beat Tracking 2 Online Downbeat Tracking 2 Open Intent Discovery 2 Open Set Learning 2 Out of Distribution (OOD) Detection 2 Resynthesis 2 Robust Speech Recognition 2 Scene Understanding 2 Skills Assessment 2 Skills Evaluation 2 Sound Classification 2 Speaker Identification 2 Speaker Separation 2 Speech Dereverberation 2 Speech-to-Text Translation 2 Style Transfer 2 Talking Head Generation 2 Target Sound Extraction 2 Text-to-Music Generation 2 Unsupervised Anomaly Detection 2 Valence Estimation 2 Video Captioning 2 Video Classification 2 Video Summarization 2 Visual Keyword Spotting 2 Visual Question Answering (VQA) 2 Voice Cloning 2 Zero-Shot Learning 2 Zero-shot Audio Captioning 2 Zero-shot Audio Classification 2 Zero-shot Text to Audio Retrieval 2 audio-visual learning 2 2D Object Detection 1 3D Face Reconstruction 1 3D Facial Expression Recognition 1 3D Human Reconstruction 1 3D Object Detection 1 3D Object Recognition 1 3D Point Cloud Reconstruction 1 Accented Speech Recognition 1 Action Parsing 1 Action Recognition In Videos 1 Action Understanding 1 Active Learning 1 Active Speaker Localization 1 Activity Detection 1 Activity Prediction 1 Anomaly Detection In Surveillance Videos 1 Anxiety Detection 1 Audio Effects Modeling 1 Audio Fingerprint 1 Audio Multiple Target Classification 1 Audio Synthesis 1 Audio-visual Question Answering 1 Audio/Video to Text Retrieval 1 Automatic Lyrics Transcription 1 Automatic Sleep Stage Classification 1 Bandwidth Extension 1 Cadenza 1 - Task 1 - Headphone 1 Cadenza 1 - Task 2 - In Car 1 Caller Detection 1 Chord Recognition 1 Common Sense Reasoning 1 Conversational Response Generation 1 Cross-Lingual ASR 1 Cross-Lingual POS Tagging 1 Cross-Lingual Transfer 1 Cross-lingual zero-shot dependency parsing 1 Dense Video Captioning 1 Dependency Parsing 1 Dialog Act Classification 1 Dialogue Act Classification 1 Dialogue Evaluation 1 Dialogue Generation 1 Dimensionality Reduction 1 Directional Hearing 1 Domain Adaptation 1 Dominance Estimation 1 Drum Transcription 1 ECG Classification 1 Emotional Dialogue Acts 1 Environment Sound Classification 1 Face Clustering 1 Face Detection 1 Fact Checking 1 Federated Learning 1 Few-Shot Learning 1 Fill Mask 1 Fine-Grained Visual Categorization 1 Fine-Grained Visual Recognition 1 Fine-grained Action Recognition 1 Gait Recognition 1 Gunshot Detection 1 Headline Generation 1 Human Interaction Recognition 1 Human Pose Forecasting 1 Humor Detection 1 Image Captioning 1 Image Generation 1 Image Generation from Scene Graphs 1 Image Manipulation 1 Image Retrieval 1 Image-to-Text Retrieval 1 Intent Classification 1 Knowledge Graphs 1 LABELED_DEPENDENCIES 1 LEMMA 1 Learning with noisy labels 1 Link Prediction 1 MORPH 1 Matrix Completion 1 Meeting Summarization 1 Melody Extraction 1 Metric Learning 1 Mobile Security 1 Motion Synthesis 1 Multi-Label Learning 1 Multi-Source Unsupervised Domain Adaptation 1 Multimodal Abstractive Text Summarization 1 Multimodal Activity Recognition 1 Multimodal Reasoning 1 Multimodal Sleep Stage Detection 1 Multiview Detection 1 Music Captioning 1 Music Emotion Recognition 1 Music Genre Classification 1 Music Performance Rendering 1 Music Style Transfer 1 Named Entity Recognition (NER) 1 Natural Language Inference (Few-Shot) 1 Neural Architecture Search 1 Object Categorization 1 Occluded Face Detection 1 Open-Domain Dialog 1 Opinion Mining 1 Optical Flow Estimation 1 POS 1 Part-Of-Speech Tagging 1 Personality Recognition in Conversation 1 Personality Trait Recognition 1 Personalized and Emotional Conversation 1 Physical Commonsense Reasoning 1 Pitch Classification 1 Pose Estimation 1 Prediction Intervals 1 Question Generation 1 Real-time Directional Hearing 1 Retrieval 1 Robot Manipulation 1 SENTS 1 SQL Parsing 1 Sarcasm Detection 1 Scene Graph Detection 1 Scene Recognition 1 Scene-Aware Dialogue 1 Seizure Detection 1 Self-Driving Cars 1 Self-Supervised Audio Classification 1 Semantic Parsing 1 Sentence Embedding 1 Sequential Image Classification 1 Sequential skip prediction 1 Shooter Localization 1 Singer Identification 1 Sleep Stage Detection 1 Speech Intent Classification 1 Speech Synthesis - Assamese 1 Speech Synthesis - Bengali 1 Speech Synthesis - Bodo 1 Speech Synthesis - Gujarati 1 Speech Synthesis - Hindi 1 Speech Synthesis - Kannada 1 Speech Synthesis - Malayalam 1 Speech Synthesis - Manipuri 1 Speech Synthesis - Marathi 1 Speech Synthesis - Rajasthani 1 Speech Synthesis - Tamil 1 Speech Synthesis - Telugu 1 Speech-to-Gesture Translation 1 Speech-to-Speech Translation 1 Supervised Video Summarization 1 Synthetic Speech Detection 1 TAG 1 Task-Oriented Dialogue Systems 1 Temporal Forgery Localization 1 Text Classification 1 Text Generation 1 Text Segmentation 1 Text Summarization 1 Text to Audio/Video Retrieval 1 Time Offset Calibration 1 Time Series Alignment 1 Time Series Analysis 1 Time Series Averaging 1 Time Series Classification 1 Time Series Clustering 1 Token Classification 1 Translation 1 UNLABELED_DEPENDENCIES 1 Unsupervised Video Summarization 1 Urdu Speech Recognition 1 Video Emotion Detection 1 Video Object Segmentation 1 Video Reconstruction 1 Video Synchronization 1 Video-Text Retrieval 1 Voice Anti-spoofing 1 Voice Query Recognition 1 Wikipedia Summarization 1 Word Embeddings 1 Word Translation 1 Zero-Shot Environment Sound Classification 1 Zero-Shot Video Question Answer 1 Zero-Shot Video Retrieval 1 audio-visual event localization 1 speech-recognition 1 video narration captioning 1
Filter by Language
Chinese 1 Abkhazian 0 Achinese 0 Adyghe 0 Afar 0 Afrikaans 0 Akan 0 Akkadian 0 Akuntsu 0 Albanian 0 American Sign Language 0 Amharic 0 Ancient Greek 0 Ancient Hebrew 0 Apurinã 0 Arabic 0 Aragonese 0 Argentine Sign Language 0 Armenian 0 Arpitan 0 Assamese 0 Assyrian Neo-Aramaic 0 Asturian 0 Avaric 0 Aymara 0 Azerbaijani 0 Bambara 0 Bangala 0 Bangladeshi Sign Language 0 Banjar 0 Bashkir 0 Basque 0 Bavarian 0 Belarusian 0 Bemba (Zambia) 0 Bengali 0 Bhojpuri 0 Bishnupriya 0 Bislama 0 Bodo (India) 0 Bosnian 0 Breton 0 Buginese 0 Bulgarian 0 Burmese 0 Catalan 0 Cebuano 0 Central Bikol 0 Central Khmer 0 Central Kurdish 0 Central Pashto 0 Chamorro 0 Chavacano 0 Chechen 0 Cherokee 0 Cheyenne 0 Choctaw 0 Chukot 0 Church Slavic 0 Chuvash 0 Congo Swahili 0 Coptic 0 Cornish 0 Corsican 0 Cree 0 Creek 0 Crimean Tatar 0 Croatian 0 Czech 0 Danish 0 Dhivehi 0 Dimli (individual language) 0 Dogri (individual language) 0 Dogri (macrolanguage) 0 Dutch 0 Dzongkha 0 Eastern Mari 0 Egyptian Arabic 0 English 0 Erzya 0 Esperanto 0 Estonian 0 Ewe 0 Extremaduran 0 Faroese 0 Fiji Hindi 0 Fijian 0 Filipino 0 Finnish 0 Fon 0 French 0 Friulian 0 Fulah 0 Gagauz 0 Galician 0 Gan Chinese 0 Ganda 0 Geez 0 Georgian 0 German 0 German Sign Language 0 Gilaki 0 Goan Konkani 0 Gothic 0 Greek 0 Greek Sign Language 0 Guarani 0 Gujarati 0 Gulf Arabic 0 Haitian 0 Hakha Chin 0 Hakka Chinese 0 Halh Mongolian 0 Hausa 0 Hawaiian 0 Hebrew 0 Herero 0 Hindi 0 Hiri Motu 0 Hungarian 0 Icelandic 0 Ido 0 Igbo 0 Iloko 0 Indonesian 0 Interlingua (International Auxiliary Language Association) 0 Interlingue 0 Inuktitut 0 Inupiaq 0 Iranian Persian 0 Irish 0 Italian 0 Jamaican Creole English 0 Japanese 0 Javanese 0 Jejueo 0 Kabardian 0 Kabuverdianu 0 Kabyle 0 Kachin 0 Kalaallisut 0 Kalmyk 0 Kannada 0 Kanuri 0 Kara-Kalpak 0 Karachay-Balkar 0 Karelian 0 Kashmiri 0 Kashubian 0 Kazakh 0 Khunsari 0 Kikuyu 0 Kinyarwanda 0 Kirghiz 0 Komi 0 Komi-Permyak 0 Komi-Zyrian 0 Kongo 0 Korean 0 Krio 0 Kuanyama 0 Kurdish 0 Kölsch 0 Ladino 0 Lak 0 Lao 0 Latgalian 0 Latin 0 Latvian 0 Lezghian 0 Ligurian 0 Limburgan 0 Lingala 0 Literary Chinese 0 Lithuanian 0 Livvi 0 Lojban 0 Lombard 0 Low German 0 Lower Sorbian 0 Lozi 0 Lunda 0 Luo (Cameroon) 0 Luo (Kenya and Tanzania) 0 Lushai 0 Luxembourgish 0 Macedonian 0 Maithili 0 Malagasy 0 Malay (individual language) 0 Malay (macrolanguage) 0 Malayalam 0 Maltese 0 Mandarin Chinese 0 Manipuri 0 Manx 0 Maori 0 Marathi 0 Marshallese 0 Mazanderani 0 Mbyá Guaraní 0 Mesopotamian Arabic 0 Min Dong Chinese 0 Minangkabau 0 Mingrelian 0 Mirandese 0 Modern Greek 0 Modern Greek (1453-) 0 Moksha 0 Mongolian 0 Moroccan Arabic 0 Multilingual 0 Mundurukú 0 Najdi Arabic 0 Narom 0 Nauru 0 Navajo 0 Naxi 0 Nayini 0 Ndonga 0 Neapolitan 0 Nepali (individual language) 0 Nepali (macrolanguage) 0 Newari 0 Nigerian Fulfulde 0 Nigerian Pidgin 0 North Azerbaijani 0 North Levantine Arabic 0 Northern Frisian 0 Northern Huishui Hmong 0 Northern Kurdish 0 Northern Luri 0 Northern Sami 0 Northern Uzbek 0 Norwegian 0 Norwegian Bokmål 0 Norwegian Nynorsk 0 Novial 0 Nyanja 0 Occitan (post 1500) 0 Odia 0 Official Aramaic (700-300 BCE) 0 Old English (ca. 450-1100) 0 Old French 0 Old Russian 0 Old Turkish 0 Oriya (macrolanguage) 0 Oromo 0 Ossetian 0 Pali 0 Pampanga 0 Pangasinan 0 Papiamento 0 Pedi 0 Pennsylvania German 0 Persian 0 Pfaelzisch 0 Picard 0 Piemontese 0 Pitcairn-Norfolk 0 Plateau Malagasy 0 Polish 0 Pontic 0 Portuguese 0 Portuguse 0 Punjabi 0 Pushto 0 Quechua 0 Rajasthani 0 Romanian 0 Romansh 0 Rundi 0 Russia Buriat 0 Russian 0 Rusyn 0 Saidi Arabic 0 Samoan 0 Sango 0 Sanskrit 0 Santali 0 Sardinian 0 Saterfriesisch 0 Scots 0 Scottish Gaelic 0 Serbian 0 Serbo-Croatian 0 Shan 0 Shona 0 Sichuan Yi 0 Sicilian 0 Silesian 0 Sindhi 0 Sinhala 0 Skolt Sami 0 Slovak 0 Slovenian 0 Soi 0 Somali 0 South Azerbaijani 0 South Levantine Arabic 0 Southern Pashto 0 Southern Sotho 0 Spanish 0 Sranan Tongo 0 Standard Arabic 0 Standard Latvian 0 Sundanese 0 Swahili 0 Swahili (macrolanguage) 0 Swati 0 Swedish 0 Swedish Sign Language 0 Swiss German 0 Swiss-German Sign Language 0 Tagalog 0 Tahitian 0 Tai 0 Tajik 0 Tamil 0 Tatar 0 Telugu 0 Tetum 0 Thai 0 Tibetan 0 Tigrinya 0 Tok Pisin 0 Tonga (Tonga Islands) 0 Tonga (Zambia) 0 Tosk Albanian 0 Tsonga 0 Tswana 0 Tulu 0 Tumbuka 0 Tunisian Arabic 0 Tupinambá 0 Turkish 0 Turkish Sign Language 0 Turkmen 0 Tuvinian 0 Twi 0 Udmurt 0 Uighur 0 Ukrainian 0 Upper Sorbian 0 Urdu 0 Uzbek 0 Venda 0 Venetian 0 Veps 0 Vietnamese 0 Vlaams 0 Vlax Romani 0 Volapük 0 Votic 0 Walloon 0 Waray (Philippines) 0 Warlpiri 0 Welsh 0 West Central Oromo 0 Western Frisian 0 Western Mari 0 Western Panjabi 0 Wolof 0 Wu Chinese 0 Xhosa 0 Yakut 0 Yiddish 0 Yoruba 0 Yue Chinese 0 Zaza 0 Zeeuws 0 Zhuang 0 Zulu 0