ART consists of over 20k commonsense narrative contexts and 200k explanations.
7 PAPERS • NO BENCHMARKS YET
WikiArt contains painting from 195 different artists. The dataset has 42129 images for training and 10628 images for testing.
40 PAPERS • 1 BENCHMARK
People-Art is an object detection dataset which consists of people in 43 different styles. People contained in this dataset are quite different from those in common photographs. There are 42 categories of art styles and movements including Naturalism, Cubism, Socialist Realism, Impressionism, and Suprematism
6 PAPERS • 2 BENCHMARKS
SemArt is a multi-modal dataset for semantic art understanding. SemArt is a collection of fine-art painting images in which each image is associated to a number of attributes and a textual artistic comment, such as those that appear in art catalogues or museum collections It contains 21,384 samples that provides artistic comments along with fine-art paintings and their attributes for studying semantic art understanding.
9 PAPERS • NO BENCHMARKS YET
ArtEmis is a large-scale dataset aimed at providing a detailed understanding of the interplay between visual content, its emotional effect, and explanations for the latter in language. This dataset focuses on visual art (e.g., paintings, artistic photographs) as it is a prime example of imagery created to elicit emotional responses from its viewers. ArtEmis contains 439K emotion attributions and explanations from humans, on 81K artworks from WikiArt. Paper: ArtEmis: Affective Language for Visual Art
10 PAPERS • NO BENCHMARKS YET
Anew dataset of facade images from Paris following the Art-deco style.
1 PAPER • NO BENCHMARKS YET
ArtDL is a novel painting data set for iconography classification composed of images collected from online sources. Most of the paintings are from the Renaissance period and depict scenes or characters of Christian art.
1 PAPER • 1 BENCHMARK
…The dataset IconArt dataset was introduced in the following paper : "Weakly Supervised Object Detection in Artworks" Gonthier et al. ECCV 2018 Workshop Computer Vision for Art Analysis - VISART 2018. https://wsoda.telecom-paristech.fr/ https://zenodo.org/record/4737435
4 PAPERS • 1 BENCHMARK
Presents half a million samples and structured meta-data to encourage further research and societal engagement.
ArtImage is a synthetic dataset of articulated object models of 5 categories from PartNet-Mobility for articulated object tasks in category level.
…To fill this gap, we build a large dataset, ClimART, with more than \emph{10 million samples from present, pre-industrial, and future climate conditions}, based on the Canadian Earth System Model. ClimART poses several methodological challenges for the ML community, such as multiple out-of-distribution test sets, underlying domain physics, and a trade-off between accuracy and inference speed.
We introduce ArtBench-10, the first class-balanced, high-quality, cleanly annotated, and standardized dataset for benchmarking artwork generation. ArtBench-10 has several advantages over previous artwork datasets. Firstly, it is class-balanced while most previous artwork datasets suffer from the long tail class distributions. Thirdly, ArtBench-10 is created with standardized data collection, annotation, filtering, and preprocessing procedures.
3 PAPERS • 1 BENCHMARK
The MAMe dataset contains images of high-resolution and variable shape of artworks from 3 different museums: The Metropolitan Museum of Art of New York The Los Angeles County Museum of Art The Cleveland Museum of Art
2 PAPERS • 1 BENCHMARK
Repository of a generative art dataset by computer artist Andy Lomas.
2 PAPERS • NO BENCHMARKS YET
…It consists of four domains, namely Photo (1,670 images), Art Painting (2,048 images), Cartoon (2,344 images) and Sketch (3,929 images). Each domain contains seven categories.
263 PAPERS • 4 BENCHMARKS
…All paintings are sized 512x512, from the following sources: * Princeton University Art Museum, 362 paintings * Harvard University Art Museum, 101 paintings * Metropolitan Museum of Art, 428 paintings * Smithsonian's Freer Gallery of Art, 1,301 paintings
A dataset for fine-grained art attribute recognition introduced in the 6th FGVC Workshop at CVPR 2019. It is a high-quality artwork image dataset with professional photographs of artworks from The Metropolitan Museum of Art and attribute labels curated or verified by experts.
MetFaces is an image dataset of human faces extracted from works of art. The dataset consists of 1336 high-quality PNG images at 1024×1024 resolution. The images were downloaded via the Metropolitan Museum of Art Collection API, and automatically aligned and cropped using dlib. Various automatic filters were used to prune the set.
29 PAPERS • 2 BENCHMARKS
Mapping of detailed discipline tags to one of three broader disciplines (Arts, Science, Business)
The question-answer (QA) pairs are automatically generated using state-of-the-art question generation methods based on paintings and comments provided in an existing art understanding dataset.
To validate the racial bias of four commercial APIs and four state-of-the-art (SOTA) algorithms.
29 PAPERS • NO BENCHMARKS YET
Orchard is a diagnostic dataset for systematically evaluating hierarchical reasoning in state-of-the-art neural sequence models
Dataset used for the challenge to apply computer vision techniques on art objects (paintings, sculptures, drawings etc) from the Rijksmuseum (in Amsterdam, the Netherlands).
0 PAPER • NO BENCHMARKS YET
A new multilingual multi-aspect hate speech analysis dataset and use it to test the current state-of-the-art multilingual multitask learning approaches.
…The Teller sees an abstract scene containing multiple clip art pieces in a semantically meaningful configuration, while the Drawer tries to reconstruct the scene on an empty canvas using available clip art pieces.
8 PAPERS • NO BENCHMARKS YET
Subsets of BDD100K Dataset that are used in Object Detection Under Rainy Conditions for Autonomous Vehicles: A Review of State-of-the-Art and Emerging Techniques
…There are 397 well-sampled categories to evaluate numerous state-of-the-art algorithms for scene recognition.
11 PAPERS • 2 BENCHMARKS
HellaSwag is a challenge dataset for evaluating commonsense NLI that is specially hard for state-of-the-art models, though its questions are trivial for humans (>95% accuracy).
77 PAPERS • 1 BENCHMARK
Includes 4000 images; 200 from each of 20 categories covering different types of scenes such as Cartoons, Art, Objects, Low resolution images, Indoor, Outdoor, Jumbled, Random, and Line drawings.
48 PAPERS • 1 BENCHMARK
Synscapes is a synthetic dataset for street scene parsing created using photorealistic rendering techniques, and show state-of-the-art results for training and validation as well as new types of analysis
28 PAPERS • 1 BENCHMARK
…Features of these videos are extracted by the state-of-the-art popular pre-trained models and released for public use. Each video contains audio and visual modality. Based on the visual information, videos are divided into 24 topics, such as sports, game, arts & entertainment, etc
111 PAPERS • 2 BENCHMARKS
The largest and cleanest face recognition dataset Glint360K, which contains 17,091,657 images of 360,232 individuals, baseline models trained on Glint360K can easily achieve state-of-the-art performance
12 PAPERS • NO BENCHMARKS YET
…The stories were generated by two state-of-the-art visual storytelling models, each aligned to 5 human-edited versions.
VocBench is a framework that benchmark the performance of state-of-the art neural vocoders.
SlowFlow is an optical flow dataset collected by applying Slow Flow technique on data from a high-speed camera and analyzing the performance of the state-of-the-art in optical flow under various levels
…It is captured from real surveillance cameras and the person bounding boxes are obtained from state-of-the-art detection algorithm. The dataset contains 1,717 identities in total.
4 PAPERS • NO BENCHMARKS YET
…IT involves two challenging generative and multi-choice alternative selection tasks for the state-of-the-art NLP models to solve.
1 PAPER • 3 BENCHMARKS
iLur News Texts is a dataset of over 12000 news articles from iLur.am, categorized into 7 classes: sport, politics, weather, economy, accidents, art, society.
The Completion3D benchmark is a dataset for evaluating state-of-the-art 3D Object Point Cloud Completion methods.
26 PAPERS • 1 BENCHMARK
This dataset comprises 1344 expert annotated images of muscle-tendon junctions recorded with 3 ultrasound imaging systems (Aixplorer V6, Esaote MyLab60, Telemed ArtUs), on 2 muscles (Lateral Gastrocnemius
OCTCBVS is a benchmark dataset for testing and evaluating novel and state-of-the-art computer vision algorithms.
ImageNet-R(endition) contains art, cartoons, deviantart, graffiti, embroidery, graphics, origami, paintings, patterns, plastic objects, plush objects, sculptures, sketches, tattoos, toys, and video game
79 PAPERS • 3 BENCHMARKS
…NAO contains 7,934 images and 9,943 objects that are unmodified and representative of real-world scenarios, but cause state-of-the-art detection models to misclassify with high confidence.
…contributed with laborious annotation for driver attention (fixation, saccade, focusing time), accident objects/intervals, as well as the accident categories, and superior performance to state-of-the-arts
…Results show that state-of-the-art neural models perform by far worse than human ceiling. The dataset can also serve as a benchmark for reinvestigating logical AI under the deep learning NLP setting.
22 PAPERS • NO BENCHMARKS YET
…Particular, the data is selected to be difficult to the state-of-the-art models, including BERT and RoBERTa.
107 PAPERS • 1 BENCHMARK
…Three state of the art pre-trained image captioning models are used.
…For advancing the state-of-the-art in small objects recognition, and by placing the question of object recognition in the context of scene understanding.