…The segments are of varying length, between 3 and 10 seconds long, and in each clip the only visible face in the video and audible sound in the soundtrack belong to a single speaking person. In total, the dataset contains roughly 4700 hours of video segments with approximately 150,000 distinct speakers, spanning a wide variety of people, languages and face poses.
36 PAPERS • NO BENCHMARKS YET
This dataset can be found on HuggingFace: https://huggingface.co/datasets/Short-Answer-Feedback/saf_communication_networks_english https://huggingface.co/datasets/Short-Answer-Feedback/saf_micro_job_german
3 PAPERS • NO BENCHMARKS YET
…(1) wikiann · Datasets at Hugging Face. https://huggingface.co/datasets/wikiann. (2) wikiann | TensorFlow Datasets. https://tensorflow.google.cn/datasets/catalog/wikiann. (3) wikiann · Datasets at Hugging Face. https://huggingface.co/datasets/wikiann/viewer/en. (4) WikiAnn Dataset | Papers With Code. https://paperswithcode.com/dataset/wikiann-1.
59 PAPERS • 3 BENCHMARKS
…The test set is available on HuggingFace in BIO format: qmeeus/MSNER @inproceedings{MSNER, author = {Meeus, Quentin and Moens, Marie-Francine and Van hamme, Hugo}, booktitle = {20th Joint ACL-ISO Workshop
1 PAPER • NO BENCHMARKS YET
…parallel corpus annotated with full coreference chains that has been created to address an important problem that machine translation and other multilingual natural language processing (NLP) technologies face
11 PAPERS • NO BENCHMARKS YET
…You can find more information and explore the dataset on the Hugging Face Datasets page ¹. (1) germeval_14 · Datasets at Hugging Face. https://huggingface.co/datasets/germeval_14. (2) GermEval-2018 Corpus (DE) - Empirical Linguistics and ... - heiDATA. https://heidata.uni-heidelberg.de/dataset.xhtml
2 PAPERS • 1 BENCHMARK
…The pain stimulation experiment was conducted twice: once with un-occluded face and once with facial EMG sensors.
The German Lipreading dataset consists of 250,000 publicly available videos of the faces of speakers of the Hessian Parliament, which was processed for word-level lip reading using an automatic pipeline
5 PAPERS • NO BENCHMARKS YET