Datasets

5,420 machine learning datasets
Filter by Task
Music Information Retrieval 16 Sound Event Detection 16 Speech Recognition 16 Audio Classification 14 Music Generation 13 Information Retrieval 10 Acoustic Scene Classification 8 Data Augmentation 8 Multi-Task Learning 8 Audio Source Separation 7 Language Modelling 7 Scene Classification 7 Music Source Separation 6 Speech Enhancement 6 Audio Tagging 5 Action Recognition 4 Audio Generation 4 Environmental Sound Classification 4 Music Modeling 4 Video Understanding 4 Action Quality Assessment 3 Anomaly Detection 3 Direction of Arrival Estimation 3 Emotion Recognition 3 Emotion Recognition in Conversation 3 Genre classification 3 Keyword Spotting 3 Lipreading 3 Multi-Label Classification 3 Music Transcription 3 Quantization 3 Question Answering 3 Recommendation Systems 3 Sound Event Localization and Detection 3 Speaker Verification 3 Speech Synthesis 3 Visual Speech Recognition 3 Acoustic echo cancellation 2 Activity Recognition 2 Audio Super-Resolution 2 Audio to Text Retrieval 2 Contrastive Learning 2 Distant Speech Recognition 2 Emotion Classification 2 Image Classification 2 Instrument Recognition 2 Intent Detection 2 Language Identification 2 Lip Reading 2 Multi-task Audio Source Seperation 2 Multimodal Deep Learning 2 Multimodal Sentiment Analysis 2 Multiview Learning 2 Music Auto-Tagging 2 Music Classification 2 Music Tagging 2 Open Intent Discovery 2 Open Set Learning 2 Resynthesis 2 Self-Supervised Learning 2 Semantic Segmentation 2 Skills Assessment 2 Skills Evaluation 2 Slot Filling 2 Speaker Recognition 2 Speech Emotion Recognition 2 Speech-to-Text Translation 2 Spoken Language Understanding 2 Spoken language identification 2 Style Transfer 2 Talking Head Generation 2 Text to Audio Retrieval 2 Unconstrained Lip-synchronization 2 Unsupervised Anomaly Detection 2 Video Classification 2 Video Retrieval 2 Visual Keyword Spotting 2 Zero-Shot Environment Sound Classification 2 3D Point Cloud Reconstruction 1 Abstractive Text Summarization 1 Accented Speech Recognition 1 Action Parsing 1 Action Recognition In Videos 1 Action Understanding 1 Active Learning 1 Activity Detection 1 Activity Prediction 1 Anomaly Detection In Surveillance Videos 1 Anxiety Detection 1 Audio Effects Modeling 1 Audio Fingerprint 1 Audio-Visual Speech Recognition 1 Audio-Visual Synchronization 1 Audio/Video to Text Retrieval 1 Automatic Sleep Stage Classification 1 Bird Audio Detection 1 Chord Recognition 1 Cross-Lingual ASR 1 Cross-Lingual POS Tagging 1 Cross-Lingual Transfer 1 Cross-lingual zero-shot dependency parsing 1 Curriculum Learning 1 DeepFake Detection 1 Dense Video Captioning 1 Dependency Parsing 1 Depression Detection 1 Dimensionality Reduction 1 Domain Adaptation 1 Drum Transcription 1 ECG Classification 1 Face Detection 1 Facial Emotion Recognition 1 Federated Learning 1 Fine-Grained Visual Categorization 1 Fine-Grained Visual Recognition 1 Fine-grained Action Recognition 1 Gait Recognition 1 Human Pose Forecasting 1 Humor Detection 1 Image Captioning 1 Image Generation 1 Image Manipulation 1 Knowledge Graphs 1 Lip to Speech Synthesis 1 Matrix Completion 1 Melody Extraction 1 Metric Learning 1 Mobile Security 1 Multimodal Abstractive Text Summarization 1 Multimodal Activity Recognition 1 Multimodal Emotion Recognition 1 Multimodal Sleep Stage Detection 1 Multiview Detection 1 Music Emotion Recognition 1 Music Genre Classification 1 Music Style Transfer 1 Named Entity Recognition 1 Neural Architecture Search 1 Object Recognition 1 Online Beat Tracking 1 Online Downbeat Tracking 1 Opinion Mining 1 Optical Flow Estimation 1 Part-Of-Speech Tagging 1 Prediction Intervals 1 SQL Parsing 1 Scene Graph Detection 1 Scene Recognition 1 Scene Understanding 1 Scene-Aware Dialogue 1 Seizure Detection 1 Self-Driving Cars 1 Self-Supervised Audio Classification 1 Semantic Parsing 1 Sentence Embedding 1 Sentiment Analysis 1 Sleep Stage Detection 1 Speaker Identification 1 Speaker Separation 1 Speech Dereverberation 1 Speech Quality 1 Speech Separation 1 Speech-to-Gesture Translation 1 Speech-to-Speech Translation 1 Talking Face Generation 1 Task-Oriented Dialogue Systems 1 Text Summarization 1 Text to Audio/Video Retrieval 1 Text-To-Speech Synthesis 1 Time Series 1 Time Series Analysis 1 Time Series Averaging 1 Time Series Classification 1 Time Series Clustering 1 Translation 1 Video Captioning 1 Video Emotion Recognition 1 Video Reconstruction 1 Voice Query Recognition 1 Word Embeddings 1 Zero-Shot Learning 1 audio-visual learning 1
Filter by Language
English 54 Chinese 12 French 10 German 9 Japanese 8 Spanish 8 Arabic 7 Italian 7 Portuguese 7 Russian 7 Persian 6 Dutch 5 Hindi 5 Tamil 5 Turkish 5 Catalan 4 Estonian 4 Indonesian 4 Latvian 4 Slovenian 4 Swedish 4 Welsh 4 Czech 3 Korean 3 Mongolian 3 Romanian 3 Vietnamese 3 Assamese 2 Basque 2 Bengali 2 Breton 2 Bulgarian 2 Finnish 2 Hungarian 2 Irish 2 Kazakh 2 Lithuanian 2 Maltese 2 Marathi 2 Multilingual 2 Odia 2 Polish 2 Slovak 2 Telugu 2 Thai 2 Ukrainian 2 Afrikaans 1 Akkadian 1 Akuntsu 1 Albanian 1 Amharic 1 Ancient Greek 1 Apurinã 1 Armenian 1 Assyrian Neo-Aramaic 1 Bambara 1 Belarusian 1 Bhojpuri 1 Bodo (India) 1 Chukot 1 Church Slavic 1 Chuvash 1 Coptic 1 Croatian 1 Danish 1 Dhivehi 1 Erzya 1 Esperanto 1 Faroese 1 Fon 1 Galician 1 Georgian 1 Gothic 1 Greek 1 Gujarati 1 Hakha Chin 1 Hebrew 1 Icelandic 1 Kabyle 1 Kannada 1 Karelian 1 Khunsari 1 Kinyarwanda 1 Komi-Permyak 1 Komi-Zyrian 1 Latin 1 Literary Chinese 1 Livvi 1 Malayalam 1 Mandarin Chinese 1 Manipuri 1 Manx 1 Mbyá Guaraní 1 Modern Greek 1 Moksha 1 Mundurukú 1 Nayini 1 Nigerian Pidgin 1 Northern Kurdish 1 Northern Sami 1 Norwegian 1 Old French 1 Old Russian 1 Old Turkish 1 Punjabi 1 Rajasthani 1 Russia Buriat 1 Sanskrit 1 Scottish Gaelic 1 Serbian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Swedish Sign Language 1 Swiss German 1 Tagalog 1 Tatar 1 Tupinambá 1 Uighur 1 Upper Sorbian 1 Urdu 1 Uzbek 1 Votic 1 Warlpiri 1 Wolof 1 Yoruba 1 Yue Chinese 1 Abkhazian 0 Achinese 0 Adyghe 0 Afar 0 Akan 0 American Sign Language 0 Aragonese 0 Argentine Sign Language 0 Arpitan 0 Asturian 0 Avaric 0 Aymara 0 Azerbaijani 0 Bangladeshi Sign Language 0 Banjar 0 Bashkir 0 Bavarian 0 Bishnupriya 0 Bislama 0 Bosnian 0 Buginese 0 Burmese 0 Cebuano 0 Central Bikol 0 Central Khmer 0 Central Kurdish 0 Central Pashto 0 Chamorro 0 Chavacano 0 Chechen 0 Cherokee 0 Cheyenne 0 Choctaw 0 Cornish 0 Corsican 0 Cree 0 Creek 0 Crimean Tatar 0 Dimli (individual language) 0 Dzongkha 0 Eastern Mari 0 Egyptian Arabic 0 Ewe 0 Extremaduran 0 Fiji Hindi 0 Fijian 0 Filipino 0 Friulian 0 Fulah 0 Gagauz 0 Gan Chinese 0 Ganda 0 Geez 0 German Sign Language 0 Gilaki 0 Goan Konkani 0 Greek Sign Language 0 Guarani 0 Gulf Arabic 0 Haitian 0 Hakka Chinese 0 Hausa 0 Hawaiian 0 Herero 0 Hiri Motu 0 Ido 0 Igbo 0 Iloko 0 Interlingua (International Auxiliary Language Association) 0 Interlingue 0 Inuktitut 0 Inupiaq 0 Iranian Persian 0 Jamaican Creole English 0 Javanese 0 Jejueo 0 Kabardian 0 Kalaallisut 0 Kalmyk 0 Kanuri 0 Kara-Kalpak 0 Karachay-Balkar 0 Kashmiri 0 Kashubian 0 Kikuyu 0 Kirghiz 0 Komi 0 Kongo 0 Kuanyama 0 Kurdish 0 Kölsch 0 Ladino 0 Lak 0 Lao 0 Latgalian 0 Lezghian 0 Ligurian 0 Limburgan 0 Lingala 0 Lojban 0 Lombard 0 Low German 0 Lower Sorbian 0 Luo (Cameroon) 0 Luo (Kenya and Tanzania) 0 Luxembourgish 0 Macedonian 0 Maithili 0 Malagasy 0 Malay (individual language) 0 Malay (macrolanguage) 0 Maori 0 Marshallese 0 Mazanderani 0 Min Dong Chinese 0 Minangkabau 0 Mingrelian 0 Mirandese 0 Modern Greek (1453-) 0 Moroccan Arabic 0 Narom 0 Nauru 0 Navajo 0 Naxi 0 Ndonga 0 Neapolitan 0 Nepali (individual language) 0 Nepali (macrolanguage) 0 Newari 0 Northern Frisian 0 Northern Huishui Hmong 0 Northern Luri 0 Norwegian Bokmål 0 Norwegian Nynorsk 0 Novial 0 Nyanja 0 Occitan (post 1500) 0 Official Aramaic (700-300 BCE) 0 Old English (ca. 450-1100) 0 Oriya (macrolanguage) 0 Oromo 0 Ossetian 0 Pali 0 Pampanga 0 Pangasinan 0 Papiamento 0 Pedi 0 Pennsylvania German 0 Pfaelzisch 0 Picard 0 Piemontese 0 Pitcairn-Norfolk 0 Pontic 0 Portuguse 0 Pushto 0 Quechua 0 Romansh 0 Rundi 0 Rusyn 0 Samoan 0 Sango 0 Santali 0 Sardinian 0 Saterfriesisch 0 Scots 0 Serbo-Croatian 0 Shona 0 Sichuan Yi 0 Sicilian 0 Silesian 0 Sindhi 0 Sinhala 0 Somali 0 South Azerbaijani 0 Southern Sotho 0 Sranan Tongo 0 Standard Arabic 0 Sundanese 0 Swahili 0 Swahili (macrolanguage) 0 Swati 0 Swiss-German Sign Language 0 Tahitian 0 Tai 0 Tajik 0 Tetum 0 Tibetan 0 Tigrinya 0 Tok Pisin 0 Tonga (Tonga Islands) 0 Tosk Albanian 0 Tsonga 0 Tswana 0 Tulu 0 Tumbuka 0 Tunisian Arabic 0 Turkish Sign Language 0 Turkmen 0 Tuvinian 0 Twi 0 Udmurt 0 Venda 0 Venetian 0 Veps 0 Vlaams 0 Vlax Romani 0 Volapük 0 Walloon 0 Waray (Philippines) 0 Western Frisian 0 Western Mari 0 Western Panjabi 0 Wu Chinese 0 Xhosa 0 Yakut 0 Yiddish 0 Zeeuws 0 Zhuang 0 Zulu 0

253 dataset results for Audio