Datasets

10,259 machine learning datasets
Filter by Task
Machine Translation 7 Question Answering 6 Natural Language Understanding 4 Speech Recognition 4 Cross-Lingual NER 3 Cross-Lingual Transfer 3 Language Modelling 3 Sentiment Analysis 3 Cross-Lingual POS Tagging 2 Language Identification 2 Misinformation 2 Multilingual text classification 2 Named Entity Recognition (NER) 2 Open-Domain Question Answering 2 Part-Of-Speech Tagging 2 Reading Comprehension 2 Text Summarization 2 Token Classification 2 Word Embeddings 2 Aspect-Based Sentiment Analysis (ABSA) 1 Audio Classification 1 Audio Emotion Recognition 1 Automatic Speech Recognition (ASR) 1 Chinese Reading Comprehension 1 Classification 1 Croatian Text Diacritization 1 Cross-Lingual ASR 1 Cross-Lingual Natural Language Inference 1 Cross-lingual zero-shot dependency parsing 1 Czech Text Diacritization 1 Data Augmentation 1 Dependency Parsing 1 Document Classification 1 Document Summarization 1 Document Translation 1 Emotion Classification 1 Emotion Recognition 1 Entity Alignment 1 Fact Verification 1 Fake News Detection 1 Few-Shot Audio Classification 1 Few-shot NER 1 French Text Diacritization 1 Hungarian Text Diacritization 1 Image Classification 1 Irish Text Diacritization 1 Keyphrase Extraction 1 Keyphrase Generation 1 LABELED_DEPENDENCIES 1 LEMMA 1 Latvian Text Diacritization 1 MORPH 1 Machine Reading Comprehension 1 Multilingual Machine Comprehension in English Hindi 1 Multilingual NLP 1 Multilingual Named Entity Recognition 1 Multiple-choice 1 NER 1 NMT 1 Natural Language Inference 1 Natural Questions 1 News Retrieval 1 Node Classification 1 POS 1 Paper generation 1 Polish Text Diacritization 1 Pretrained Multilingual Language Models 1 Punctuation Restoration 1 Reading Comprehension (Few-Shot) 1 Reading Comprehension (One-Shot) 1 Reading Comprehension (Zero-Shot) 1 Relation Classification 1 Relation Extraction 1 Romanian Text Diacritization 1 SENTS 1 Semantic Composition 1 Sentence Embeddings 1 Slovak Text Diacritization 1 Spanish Text Diacritization 1 Speaker Identification 1 Speaker Verification 1 Speech Emotion Recognition 1 Speech Synthesis 1 TAG 1 Text Classification 1 Text Generation 1 Text Retrieval 1 Translation 1 Turkish Text Diacritization 1 UNLABELED_DEPENDENCIES 1 Unfairness Detection 1 Urdu Speech Recognition 1 Vietnamese Machine Reading Comprehension 1 Vietnamese Text Diacritization 1 XLM-R 1 Zero-shot Cross-lingual Fact-checking 1 text annotation 1
Filter by Language (clear)
Polish English 3090 Chinese 342 German 174 French 161 Spanish 132 Russian 116 Arabic 90 Japanese 89 Italian 84 Portuguese 82 Hindi 72 Vietnamese 62 Korean 59 Turkish 53 Bengali 49 Persian 48 Dutch 45 Tamil 43 Czech 38 Indonesian 37 Danish 34 Finnish 33 Romanian 33 Telugu 31 Multilingual 29 Urdu 29 Swedish 28 Hungarian 26 Thai 26 Marathi 25 Greek 24 Estonian 23 Gujarati 23 Hebrew 22 Malayalam 22 Mandarin Chinese 22 Bulgarian 21 Basque 19 Kannada 18 Catalan 17 Latvian 17 Punjabi 17 Slovak 17 Swahili 17 Slovenian 16 Ukrainian 16 Croatian 15 Lithuanian 15 Kazakh 14 Norwegian 14 Serbian 14 Amharic 13 Assamese 12 Iranian Persian 12 Kurdish 12 Albanian 11 Irish 10 Maltese 10 Welsh 10 Yoruba 10 Armenian 9 Burmese 9 Hausa 9 Igbo 9 Macedonian 9 Mongolian 9 Oriya (macrolanguage) 9 American Sign Language 8 Georgian 8 Odia 8 Sanskrit 8 Sinhala 8 Tagalog 8 Azerbaijani 7 Bambara 7 Breton 7 Filipino 7 Icelandic 7 Malagasy 7 Nepali (macrolanguage) 7 Oromo 7 Serbo-Croatian 7 Somali 7 Uzbek 7 Wolof 7 Afrikaans 6 Central Khmer 6 Central Kurdish 6 Esperanto 6 Galician 6 Ganda 6 Guarani 6 Haitian 6 Nigerian Pidgin 6 Sindhi 6 Tigrinya 6 Western Panjabi 6 Belarusian 5 Egyptian Arabic 5 Fon 5 Javanese 5 Latin 5 Lingala 5 Malay (individual language) 5 Norwegian Bokmål 5 Norwegian Nynorsk 5 Quechua 5 Scottish Gaelic 5 Standard Arabic 5 Sundanese 5 Tibetan 5 Tswana 5 Aymara 4 Bangala 4 Bosnian 4 Cebuano 4 Chechen 4 Dhivehi 4 Ewe 4 Fulah 4 Iloko 4 Kabyle 4 Kinyarwanda 4 Kirghiz 4 Lao 4 Luo (Kenya and Tanzania) 4 Nyanja 4 South Azerbaijani 4 Tatar 4 Twi 4 Upper Sorbian 4 Xhosa 4 Zulu 4 Aragonese 3 Bashkir 3 Bavarian 3 Bishnupriya 3 Chuvash 3 Erzya 3 Faroese 3 Goan Konkani 3 Maithili 3 Malay (macrolanguage) 3 Romansh 3 Rundi 3 Russia Buriat 3 Shona 3 Swati 3 Swiss German 3 Tajik 3 Tetum 3 Tsonga 3 Uighur 3 Waray (Philippines) 3 Yiddish 3 Argentine Sign Language 2 Asturian 2 Avaric 2 Bangladeshi Sign Language 2 Bhojpuri 2 Central Bikol 2 Cherokee 2 Church Slavic 2 Cornish 2 Dimli (individual language) 2 Eastern Mari 2 German Sign Language 2 Gothic 2 Gulf Arabic 2 Ido 2 Interlingue 2 Inuktitut 2 Jejueo 2 Kalaallisut 2 Kalmyk 2 Karachay-Balkar 2 Komi 2 Komi-Permyak 2 Lezghian 2 Limburgan 2 Livvi 2 Lojban 2 Lombard 2 Low German 2 Lower Sorbian 2 Luxembourgish 2 Manipuri 2 Manx 2 Maori 2 Mazanderani 2 Minangkabau 2 Mingrelian 2 Mirandese 2 Modern Greek 2 Moksha 2 Moroccan Arabic 2 Mossi 2 Naxi 2 Neapolitan 2 Nepali (individual language) 2 Newari 2 Northern Frisian 2 Northern Kurdish 2 Northern Luri 2 Northern Sami 2 Occitan (post 1500) 2 Ossetian 2 Pampanga 2 Pedi 2 Piemontese 2 Pushto 2 Sardinian 2 Sichuan Yi 2 Sicilian 2 Southern Sotho 2 Swiss-German Sign Language 2 Tai 2 Tosk Albanian 2 Turkish Sign Language 2 Turkmen 2 Tuvinian 2 Udmurt 2 Venetian 2 Volapük 2 Walloon 2 Western Frisian 2 Western Mari 2 Wu Chinese 2 Yakut 2 Yue Chinese 2 Abkhazian 1 Achinese 1 Adyghe 1 Afar 1 Akan 1 Akkadian 1 Akuntsu 1 Ambonese Malay 1 Ancient Greek 1 Ancient Hebrew 1 Apurinã 1 Arpitan 1 Assyrian Neo-Aramaic 1 Banjar 1 Bemba (Zambia) 1 Bislama 1 Bodo (India) 1 Buginese 1 Central Pashto 1 Chamorro 1 Chavacano 1 Cheyenne 1 Choctaw 1 Chukot 1 Congo Swahili 1 Coptic 1 Corsican 1 Cree 1 Creek 1 Crimean Tatar 1 Dogri (macrolanguage) 1 Dzongkha 1 Extremaduran 1 Fiji Hindi 1 Fijian 1 French Sign Language 1 Friulian 1 Gagauz 1 Gan Chinese 1 Geez 1 Gilaki 1 Greek Sign Language 1 Hakha Chin 1 Hakka Chinese 1 Halh Mongolian 1 Hawaiian 1 Herero 1 Hiri Motu 1 Interlingua (International Auxiliary Language Association) 1 Inupiaq 1 Jamaican Creole English 1 Kabardian 1 Kabuverdianu 1 Kachin 1 Kanuri 1 Kara-Kalpak 1 Karelian 1 Kashmiri 1 Kashubian 1 Khunsari 1 Kikuyu 1 Komi-Zyrian 1 Kongo 1 Krio 1 Kuanyama 1 Kupang Malay 1 Kölsch 1 Ladino 1 Lak 1 Latgalian 1 Ligurian 1 Literary Chinese 1 Lozi 1 Lunda 1 Luo (Cameroon) 1 Lushai 1 Makasar 1 Malayic Dayak 1 Marshallese 1 Mbyá Guaraní 1 Mesopotamian Arabic 1 Min Dong Chinese 1 Modern Greek (1453-) 1 Mundurukú 1 Najdi Arabic 1 Narom 1 Nauru 1 Navajo 1 Nayini 1 Ndonga 1 Nigerian Fulfulde 1 North Azerbaijani 1 North Levantine Arabic 1 Northern Uzbek 1 Novial 1 Official Aramaic (700-300 BCE) 1 Old English (ca. 450-1100) 1 Old French 1 Old Russian 1 Old Turkish 1 Pali 1 Pangasinan 1 Papiamento 1 Pennsylvania German 1 Pfaelzisch 1 Picard 1 Pitcairn-Norfolk 1 Plateau Malagasy 1 Pontic 1 Rajasthani 1 Rusyn 1 Samoan 1 Sango 1 Saterfriesisch 1 Scots 1 Shan 1 Silesian 1 Skolt Sami 1 Soi 1 South Levantine Arabic 1 Southern Pashto 1 Sranan Tongo 1 Standard Latvian 1 Swahili (macrolanguage) 1 Swedish Sign Language 1 Tahitian 1 Tok Pisin 1 Tonga (Tonga Islands) 1 Tonga (Zambia) 1 Tulu 1 Tumbuka 1 Tunisian Arabic 1 Tupinambá 1 Uab Meto 1 Venda 1 Veps 1 Vlaams 1 Vlax Romani 1 Votic 1 Warlpiri 1 West Central Oromo 1 Zaza 1 Zeeuws 1 Zhuang 1 Dogri (individual language) 0 Northern Huishui Hmong 0 Portuguse 0 Saidi Arabic 0 Santali 0

40 dataset results for Polish