Reasoning

128 benchmarks • 68 tasks • 182 datasets • 4252 papers with code

Classification

Classification

3262 papers with code

Text Classification

1108 papers with code

Graph Classification

382 papers with code

Audio Classification

134 papers with code

Medical Image Classification

124 papers with code

See all 19 tasks

Question Answering

Question Answering

2928 papers with code

Open-Ended Question Answering

209 papers with code

Open-Domain Question Answering

201 papers with code

Conversational Question Answering

62 papers with code

Answer Selection

47 papers with code

See all 19 tasks

Decision Making

Decision Making

2081 papers with code

Imitation Learning

524 papers with code

Natural Language Inference

Natural Language Inference

739 papers with code

Answer Generation

60 papers with code

Visual Entailment

28 papers with code

Cross-Lingual Natural Language Inference

16 papers with code

Logical Reasoning

Navigate

414 papers with code

Logical Reasoning

185 papers with code

Novel Concepts

51 papers with code

Temporal Sequences

51 papers with code

StrategyQA

13 papers with code

See all 23 tasks

Multi-Label Classification

Multi-Label Classification

377 papers with code

Missing Labels

41 papers with code

Extreme Multi-Label Classification

29 papers with code

Hierarchical Multi-label Classification

15 papers with code

Medical Code Prediction

15 papers with code

General Reinforcement Learning

Offline RL

226 papers with code

Model-based Reinforcement Learning

195 papers with code

Conformal Prediction

151 papers with code

Text Simplification

119 papers with code

Music Source Separation

53 papers with code

Decision Making Under Uncertainty

45 papers with code

Audio Source Separation

44 papers with code

See all 9 tasks

Common Sense Reasoning

Common Sense Reasoning

259 papers with code

Physical Commonsense Reasoning

6 papers with code

Riddle Sense

5 papers with code

Winowhy

4 papers with code

Anachronisms

3 papers with code

See all 16 tasks

Visual Reasoning

Visual Reasoning

215 papers with code

Visual Commonsense Reasoning

29 papers with code

Program Synthesis

Program Synthesis

139 papers with code

Type prediction

41 papers with code

Program Repair

35 papers with code

Value prediction

16 papers with code

Enumerative Search

5 papers with code

See all 6 tasks

Mathematical Reasoning

Mathematical Reasoning

119 papers with code

Math Word Problem Solving

63 papers with code

Formal Logic

11 papers with code

Geometry Problem Solving

8 papers with code

Abstract Algebra

3 papers with code

See all 8 tasks

Video Question Answering

Video Question Answering

155 papers with code

Zero-Shot Video Question Answer

34 papers with code

Few-shot Video Question Answering

1 papers with code

Multi-Label Learning

Multi-Label Learning

84 papers with code

Missing Labels

41 papers with code

Mathematical Proofs

Automated Theorem Proving

70 papers with code

Mathematical Proofs

17 papers with code

Arithmetic Reasoning

Arithmetic Reasoning

71 papers with code

Math Word Problem Solving

Math Word Problem Solving

63 papers with code

Mathematical Question Answering

Math Word Problem Solving

63 papers with code

Program Repair

Program Repair

35 papers with code

Fault localization

15 papers with code

Variable misuse

9 papers with code

Exception type

2 papers with code

Function-docstring mismatch

1 papers with code

See all 7 tasks

Systematic Generalization

Systematic Generalization

62 papers with code

Video-based Generative Performance Benchmarking

Video-based Generative Performance Benchmarking (Contextual Understanding)

11 papers with code

Video-based Generative Performance Benchmarking (Consistency)

10 papers with code

Video-based Generative Performance Benchmarking (Correctness of Information)

10 papers with code

Video-based Generative Performance Benchmarking (Detail Orientation))

10 papers with code

Video-based Generative Performance Benchmarking (Temporal Understanding)

10 papers with code

Decision Making Under Uncertainty

Decision Making Under Uncertainty

45 papers with code

Uncertainty Visualization

3 papers with code

Multimodal Reasoning

Multimodal Reasoning

38 papers with code

Natural Language Visual Grounding

Natural Language Visual Grounding

16 papers with code

Generative Visual Question Answering

Video-based Generative Performance Benchmarking

15 papers with code

Discrete Choice Models

Discrete Choice Models

14 papers with code

Causal Identification

Causal Identification

12 papers with code

Odd One Out

Odd One Out

10 papers with code

Geometry Problem Solving

Geometry Problem Solving

8 papers with code

Autonomous Navigation

Sequential Place Recognition

5 papers with code

Autonomous Flight (Dense Forest)

1 papers with code

Autonomous Web Navigation

Abstract Argumentation

Abstract Argumentation

4 papers with code

Analogical Similarity

Analogical Similarity

4 papers with code

Theory of Mind Modeling

Theory of Mind Modeling

4 papers with code

Anachronisms

Anachronisms

3 papers with code

Human Judgment Correlation

Human Judgment Correlation

3 papers with code

Human Judgment Classification

Human Judgment Classification

2 papers with code

Identify Odd Metapor

Identify Odd Metapor

2 papers with code

Commonsense Reasoning for RL

Commonsense Reasoning for RL

1 papers with code

Pre-election ratings estimation

Pre-election ratings estimation

1 papers with code