8 dataset results for face recog AND German

…The segments are of varying length, between 3 and 10 seconds long, and in each clip the only visible face in the video and audible sound in the soundtrack belong to a single speaking person. In total, the dataset contains roughly 4700 hours of video segments with approximately 150,000 distinct speakers, spanning a wide variety of people, languages and face poses.

36 PAPERS • NO BENCHMARKS YET

SAF

SAF (Short Answer Feedback Dataset)

This dataset can be found on HuggingFace: https://huggingface.co/datasets/Short-Answer-Feedback/saf_communication_networks_english https://huggingface.co/datasets/Short-Answer-Feedback/saf_micro_job_german

3 PAPERS • NO BENCHMARKS YET

WikiANN

WikiANN (PAN-X)

…(1) wikiann · Datasets at Hugging Face. https://huggingface.co/datasets/wikiann. (2) wikiann | TensorFlow Datasets. https://tensorflow.google.cn/datasets/catalog/wikiann. (3) wikiann · Datasets at Hugging Face. https://huggingface.co/datasets/wikiann/viewer/en. (4) WikiAnn Dataset | Papers With Code. https://paperswithcode.com/dataset/wikiann-1.

59 PAPERS • 3 BENCHMARKS

MSNER

MSNER (Multilingual Spoken Named Entity Recognition)

…The test set is available on HuggingFace in BIO format: qmeeus/MSNER @inproceedings{MSNER, author = {Meeus, Quentin and Moens, Marie-Francine and Van hamme, Hugo}, booktitle = {20th Joint ACL-ISO Workshop

1 PAPER • NO BENCHMARKS YET

ParCorFull

ParCorFull (Parallel Corpus Annotated with Full Coreference)

…parallel corpus annotated with full coreference chains that has been created to address an important problem that machine translation and other multilingual natural language processing (NLP) technologies face

11 PAPERS • NO BENCHMARKS YET

GermEval

…You can find more information and explore the dataset on the Hugging Face Datasets page ¹. (1) germeval_14 · Datasets at Hugging Face. https://huggingface.co/datasets/germeval_14. (2) GermEval-2018 Corpus (DE) - Empirical Linguistics and ... - heiDATA. https://heidata.uni-heidelberg.de/dataset.xhtml

2 PAPERS • 1 BENCHMARK

BioVid (BioVid Heat Pain Database)

…The pain stimulation experiment was conducted twice: once with un-occluded face and once with facial EMG sensors.

1 PAPER • NO BENCHMARKS YET

GLips (German Lips)

The German Lipreading dataset consists of 250,000 publicly available videos of the faces of speakers of the Hessian Parliament, which was processed for word-level lip reading using an automatic pipeline

5 PAPERS • NO BENCHMARKS YET