…NAO contains 7,934 images and 9,943 objects that are unmodified and representative of real-world scenarios, but cause state-of-the-art detection models to misclassify with high confidence.
1 PAPER • 1 BENCHMARK
…contributed with laborious annotation for driver attention (fixation, saccade, focusing time), accident objects/intervals, as well as the accident categories, and superior performance to state-of-the-arts
13 PAPERS • NO BENCHMARKS YET
…It involves two challenging generative and multi-choice alternative selection tasks for the state-of-the-art NLP models to solve. Download the dataset using this link.
11 PAPERS • 4 BENCHMARKS
…Results show that state-of-the-art neural models perform by far worse than human ceiling. The dataset can also serve as a benchmark for reinvestigating logical AI under the deep learning NLP setting.
63 PAPERS • 1 BENCHMARK
…Based on rendered scenes from the open-source Blender movie "Spring", it provides photo-realistic HD datasets with state-of-the-art visual effects and ground truth training data.
20 PAPERS • 3 BENCHMARKS
…Particular, the data is selected to be difficult to the state-of-the-art models, including BERT and RoBERTa.
233 PAPERS • 2 BENCHMARKS
Despite recent improvements in open-domain dialogue models, state of the art models are trained and evaluated on short conversations with little context. we find retrieval-augmented methods and methods with an ability to summarize and recall previous conversations outperform the standard encoder-decoder architectures currently considered state of the art
2 PAPERS • NO BENCHMARKS YET
V-D4RL provides pixel-based analogues of the popular D4RL benchmarking tasks, derived from the dm_control suite, along with natural extensions of two state-of-the-art online pixel-based continuous control
8 PAPERS • NO BENCHMARKS YET
…For advancing the state-of-the-art in small objects recognition, and by placing the question of object recognition in the context of scene understanding.
10 PAPERS • NO BENCHMARKS YET
…Three state of the art pre-trained image captioning models are used.
The provided dataset consists of high-quality realistic head models and combined EEG/MEG data which can be used for state-of-the-art methods in brain research, such as modern finite element methods (FEM
1 PAPER • NO BENCHMARKS YET
…This dataset is above 20 times larger than PASCAL3D+ and KITTI, the current state-of-the-art.
17 PAPERS • 14 BENCHMARKS
…dataset that compromises of more than 1,400 images from seven image categories relevant to the above research areas, namely Scenes, Advertisements, Visualization and infographics, Objects, Interior design, Art
TDW is a 3D virtual world simulation platform, utilizing state-of-the-art video game engine technology.
…More specifically, there exist 3 distinct benchmark databases; Turath-Standard, Turath-Art, and Turath-UNESCO.
…If trained on FaithDial, state-of-the-art dialogue models are significantly more faithful while also enhancing other dialogue aspects like cooperativeness, creativity and engagement.
12 PAPERS • NO BENCHMARKS YET
…The benchmark facilitates both motion planning researchers who want to compare the performance of a new local planner relative to many other state-of-the-art approaches as well as end users in the mobile
The FIGER dataset is an entity recognition dataset where entities are labelled using fine-grained system 112 tags, such as person/doctor, art/written_work and building/hotel.
96 PAPERS • 2 BENCHMARKS
The 2021 Kidney and Kidney Tumor Segmentation Challenge The state of the art in kidney and kidney tumor segmentation in contrast-enhanced CT imaging: Results of the KiTS19 Challenge
7 PAPERS • 1 BENCHMARK
…Each image is manually cropped by three expert photographers (graduate students in art whose primary medium is photography) to form three training sets. There are 1,000 photos in the dataset.
3 PAPERS • NO BENCHMARKS YET
…In addition, two state of the art action recognition algorithms are extended to make use of the 3D data, and five new interest point detection strategies are also proposed, that extend to the 3D data.
…TUM-VIE includes challenging sequences where state-of-the art VIO fails or results in large drift. Hence, it can help to push the boundary on event-based visual-inertial algorithms.
…Incorporating state-of-the-art definition generation models, it supports not only Chinese and English, but also Chinese-English cross-lingual queries.
…Furthermore, it integrates state-of-the-art models with standardized and end-to-end pipelines. Overall, OpenGDA provides a user-friendly, scalable and reproducible benchmark
…The average inter-class similarity is sufficiently high to represent a challenge for the current state of the art.
186 PAPERS • 2 BENCHMARKS
…CVSS is derived from the Common Voice speech corpus and the CoVoST 2 speech-to-text translation (ST) corpus, by synthesizing the translation text from CoVoST 2 into speech using state-of-the-art TTS systems
18 PAPERS • NO BENCHMARKS YET
ec-darkpattern is a dataset for dark pattern detection and prepared its baseline detection performance with state-of-the-art machine learning methods.
…Experiments demonstrate that state-of-the-art models do well when distractors are chosen randomly (~86%), but struggle with carefully chosen distractors (~53%, compared to 90% human accuracy) Project
SBU-WSD-Corpus consists of 19 Persian documents in different domains such as Sports, Science, Arts, etc.
This dataset contains 304 manual evaluations of class-level software maintainability, drawn from 5 open-source projects: ArgoUML, Art of Illusion, Diary Management, JUnit 4, JSweet.
WeatherBench 2 consists of an open-source evaluation framework, publicly available training, ground truth and baseline data as well as a continuously updated website with the latest metrics and state-of-the-art
5 PAPERS • NO BENCHMARKS YET
…ST models trained with an addition of the corpus obtain new state-of-the-art results on the MuST-C English-German benchmark test set.
The RSNA-ASNR-MICCAI BraTS 2021 challenge utilizes multi-institutional pre-operative baseline multi-parametric magnetic resonance imaging (mpMRI) scans, and focuses on the evaluation of state-of-the-art
…In particular, it focuses on small instances which prove to be challenging for one or more state-of-the-art TSP algorithms.
5 PAPERS • 1 BENCHMARK
…We assess various state-of-the-art baseline techniques, encompassing models for the tasks of semantic segmentation, object detection, and depth estimation.
…It aims to assess the ability of state-of-the-art representation models to reason over cross-lingual lexical-level concept alignment in context for 14 language pairs.
…locust detection to prevent invasion), and art (e.g., recreational art).
106 PAPERS • 2 BENCHMARKS
The detection and localization of highly realistic deepfake audio-visual content are challenging even for the most advanced state-of-the-art methods. The comprehensive benchmark of the proposed dataset utilizing state-of-the-art deepfake detection and localization methods indicates a significant drop in performance compared to previous datasets.
…The four domains are: Art – artistic images in the form of sketches, paintings, ornamentation, etc.; Clipart – collection of clipart images; Product – images of objects without a background and Real-World
921 PAPERS • 11 BENCHMARKS
…Annotations have been gathered on 2 levels of granulatiry: Sentences Elementary Discourse Units (EDUs), i.e. sub-sentence clauses produced by a state-of-the-art RST parser This dataset is intended to
…The point clouds provided are scanned statically with state-of-the-art equipment and contain very fine details.
…A team of behavior experts annotated each video on a frame-by-frame basis for a state-of-the-art study of the neurophysiological mechanisms involved in aggression and courtship in mice.
We address the computer-assisted search for prior art by creating a training dataset for supervised machine learning called PatentMatch.
Falling Things (FAT) is a dataset for advancing the state-of-the-art in object detection and 3D pose estimation in the context of robotics.
6 PAPERS • NO BENCHMARKS YET
…It covers 15 topics, including humanities, entertainment, sports, military, finance, religion, family life, politics, education, digital devices, environment, science, professional development, art and
…Our findings reveal that state-of-the-art pre-trained multi-modal models (e.g., PaLI-X, BLIP2, etc.) face challenges in answering visual information-seeking questions, but fine-tuning on the InfoSeek dataset
15 PAPERS • 2 BENCHMARKS
…Japanese-English) corpus of patent abstracts, extracted from the MAREC patent data, and the data from the NTCIR PatentMT workshop collections, accompanied with relevance judgements for the task of patent prior-art
…pictures of traffic signs with stickers on their surface) that can fool state-of-the-art neural network-based perception systems and clean traffic sign images without any stickers on them.