6 dataset results for Text Classification AND Images

The MNIST database (Modified National Institute of Standards and Technology database) is a large collection of handwritten digits. It has a training set of 60,000 examples, and a test set of 10,000 examples. It is a subset of a larger NIST Special Database 3 (digits written by employees of the United States Census Bureau) and Special Database 1 (digits written by high school students) which contain monochrome images of handwritten digits. The digits have been size-normalized and centered in a fixed-size image. The original black and white (bilevel) images from NIST were size normalized to fit in a 20x20 pixel box while preserving their aspect ratio. The resulting images contain grey levels as a result of the anti-aliasing technique used by the normalization algorithm. the images were centered in a 28x28 image by computing the center of mass of the pixels, and translating the image so as to position this point at the center of the 28x28 field.

6,989 PAPERS • 52 BENCHMARKS

MuMiN

MuMiN is a misinformation graph dataset containing rich social media data (tweets, replies, users, images, articles, hashtags), spanning 21 million tweets belonging to 26 thousand Twitter threads, each of which have been semantically linked to 13 thousand fact-checked claims across dozens of topics, events and domains, in 41 different languages, spanning more than a decade.

4 PAPERS • 3 BENCHMARKS

MuMiN-large

This is the large version of the MuMiN dataset.

1 PAPER • 1 BENCHMARK

MuMiN-medium

This is the medium version of the MuMiN dataset.

1 PAPER • 1 BENCHMARK

MuMiN-small

This is the small version of the MuMiN dataset.

1 PAPER • 1 BENCHMARK

Indian Number Plates Dataset | Vehicle Number Plates | English OCR Detection

This dataset is an extremely challenging set of over 20,000+ original Number plate images captured and crowdsourced from over 700+ urban and rural areas, where each image is manually reviewed and verified by computer vision professionals at Datacluster Labs

0 PAPER • NO BENCHMARKS YET

Datasets

6 dataset results for Text Classification AND Images