Browse State-of-the-Art
Datasets
Methods
More
Newsletter
RC2022
About
Trends
Portals
Libraries
Sign In
Subscribe to the PwC Newsletter
×
Stay informed on the latest trending ML papers with code, research developments, libraries, methods, and datasets.
Read previous issues
Join the community
×
You need to
log in
to edit.
You can
create a new account
if you don't have one.
Browse SoTA
> Reasoning
Reasoning
151 benchmarks • 75 tasks • 224 datasets • 5943 papers with code
Classification
Classification
368 benchmarks
3699 papers with code
Text Classification
170 benchmarks
1276 papers with code
Graph Classification
76 benchmarks
466 papers with code
Medical Image Classification
11 benchmarks
174 papers with code
Audio Classification
28 benchmarks
171 papers with code
See all 24 tasks
Question Answering
Question Answering
256 benchmarks
3839 papers with code
Open-Domain Question Answering
15 benchmarks
230 papers with code
Open-Ended Question Answering
223 papers with code
Knowledge Base Question Answering
10 benchmarks
68 papers with code
Conversational Question Answering
1 benchmark
62 papers with code
See all 20 tasks
Decision Making
Decision Making
1 benchmark
2771 papers with code
Imitation Learning
652 papers with code
Text-to-Image Generation
16 benchmarks
455 papers with code
Deblurring
32 benchmarks
392 papers with code
Conformal Prediction
239 papers with code
Face Detection
17 benchmarks
143 papers with code
Text Simplification
11 benchmarks
127 papers with code
See all 20 tasks
Logical Reasoning
Navigate
594 papers with code
Logical Reasoning
21 benchmarks
289 papers with code
Temporal Sequences
1 benchmark
69 papers with code
Novel Concepts
65 papers with code
StrategyQA
20 papers with code
See all 23 tasks
Natural Language Inference
Natural Language Inference
46 benchmarks
809 papers with code
Answer Generation
2 benchmarks
94 papers with code
Visual Entailment
3 benchmarks
32 papers with code
Cross-Lingual Natural Language Inference
4 benchmarks
17 papers with code
Code Generation
Code Generation
35 benchmarks
606 papers with code
Code Translation
2 benchmarks
50 papers with code
Code Documentation Generation
7 benchmarks
6 papers with code
GitHub issue resolution
4 papers with code
Class-level Code Generation
1 benchmark
3 papers with code
See all 6 tasks
Multi-Label Classification
Multi-Label Classification
37 benchmarks
444 papers with code
Missing Labels
48 papers with code
Extreme Multi-Label Classification
31 papers with code
Hierarchical Multi-label Classification
20 benchmarks
19 papers with code
Medical Code Prediction
7 benchmarks
16 papers with code
General Reinforcement Learning
Offline RL
2 benchmarks
291 papers with code
Model-based Reinforcement Learning
223 papers with code
Mathematical Reasoning
Mathematical Reasoning
28 benchmarks
267 papers with code
Math Word Problem Solving
13 benchmarks
79 papers with code
Formal Logic
1 benchmark
15 papers with code
Geometry Problem Solving
11 papers with code
Abstract Algebra
1 benchmark
6 papers with code
See all 8 tasks
Common Sense Reasoning
Common Sense Reasoning
37 benchmarks
310 papers with code
HellaSwag
17 papers with code
Winogrande
12 papers with code
Physical Commonsense Reasoning
1 benchmark
6 papers with code
Riddle Sense
2 benchmarks
5 papers with code
See all 18 tasks
Visual Reasoning
Visual Reasoning
19 benchmarks
293 papers with code
Visual Commonsense Reasoning
7 benchmarks
33 papers with code
Video Question Answering
Video Question Answering
43 benchmarks
225 papers with code
Zero-Shot Video Question Answer
16 benchmarks
62 papers with code
Few-shot Video Question Answering
1 papers with code
Program Synthesis
Program Synthesis
10 benchmarks
167 papers with code
Program Repair
3 benchmarks
50 papers with code
Type prediction
3 benchmarks
44 papers with code
Value prediction
1 benchmark
18 papers with code
Enumerative Search
5 papers with code
See all 6 tasks
Multi-Label Learning
Multi-Label Learning
1 benchmark
91 papers with code
Missing Labels
48 papers with code
ARC
ARC
137 papers with code
Reconstruction
3D Human Reconstruction
10 benchmarks
58 papers with code
Single-View 3D Reconstruction
12 benchmarks
50 papers with code
4D reconstruction
21 papers with code
Single-Image-Based Hdr Reconstruction
1 benchmark
4 papers with code
Reconstruction
28 benchmarks
2 papers with code
Mathematical Proofs
Automated Theorem Proving
9 benchmarks
98 papers with code
Mathematical Proofs
9 benchmarks
26 papers with code
Robot Task Planning
Task Planning
84 papers with code
Robot Task Planning
2 benchmarks
21 papers with code
Arithmetic Reasoning
Arithmetic Reasoning
5 benchmarks
104 papers with code
Program Repair
Program Repair
3 benchmarks
50 papers with code
Fault localization
21 papers with code
Variable misuse
11 papers with code
Exception type
2 papers with code
Function-docstring mismatch
1 papers with code
See all 7 tasks
Math Word Problem Solving
Math Word Problem Solving
13 benchmarks
79 papers with code
Mathematical Question Answering
Math Word Problem Solving
13 benchmarks
79 papers with code
Multimodal Reasoning
Multimodal Reasoning
3 benchmarks
79 papers with code
Video-based Generative Performance Benchmarking
Video-based Generative Performance Benchmarking (Contextual Understanding)
1 benchmark
16 papers with code
Video-based Generative Performance Benchmarking (Consistency)
1 benchmark
15 papers with code
Video-based Generative Performance Benchmarking (Correctness of Information)
1 benchmark
15 papers with code
Video-based Generative Performance Benchmarking (Detail Orientation))
1 benchmark
15 papers with code
Video-based Generative Performance Benchmarking (Temporal Understanding)
1 benchmark
15 papers with code
Systematic Generalization
Systematic Generalization
74 papers with code
Decision Making Under Uncertainty
Decision Making Under Uncertainty
52 papers with code
Uncertainty Visualization
5 papers with code
Natural Language Visual Grounding
Natural Language Visual Grounding
1 benchmark
28 papers with code
Generative Visual Question Answering
Video-based Generative Performance Benchmarking
6 benchmarks
20 papers with code
Discrete Choice Models
Discrete Choice Models
16 papers with code
Causal Identification
Causal Identification
14 papers with code
Odd One Out
Odd One Out
1 benchmark
12 papers with code
Geometry Problem Solving
Geometry Problem Solving
11 papers with code
Autonomous Navigation
Sequential Place Recognition
5 papers with code
Autonomous Web Navigation
2 papers with code
Autonomous Flight (Dense Forest)
1 benchmark
1 papers with code
Abstract Argumentation
Abstract Argumentation
5 papers with code
Error Understanding
Error Understanding
2 benchmarks
5 papers with code
Image Paragraph Captioning
Image Paragraph Captioning
1 benchmark
5 papers with code
Theory of Mind Modeling
Theory of Mind Modeling
5 papers with code
Analogical Similarity
Analogical Similarity
1 benchmark
4 papers with code
Anachronisms
Anachronisms
3 papers with code
Assortment Optimization
Assortment Optimization
3 papers with code
Human Judgment Correlation
Human Judgment Correlation
2 benchmarks
3 papers with code
Human Judgment Classification
Human Judgment Classification
1 benchmark
2 papers with code
Identify Odd Metapor
Identify Odd Metapor
1 benchmark
2 papers with code
Commonsense Reasoning for RL
Commonsense Reasoning for RL
1 benchmark
1 papers with code
Pre-election ratings estimation
Pre-election ratings estimation
1 papers with code