Reasoning

136 benchmarks • 69 tasks • 183 datasets • 4315 papers with code

Classification

Classification

3288 papers with code

Text Classification

1119 papers with code

Graph Classification

384 papers with code

Audio Classification

138 papers with code

Medical Image Classification

125 papers with code

See all 19 tasks

Question Answering

Question Answering

2952 papers with code

Open-Ended Question Answering

209 papers with code

Open-Domain Question Answering

202 papers with code

Conversational Question Answering

62 papers with code

Answer Selection

47 papers with code

See all 19 tasks

Decision Making

Decision Making

2104 papers with code

Imitation Learning

527 papers with code

Natural Language Inference

Natural Language Inference

745 papers with code

Answer Generation

60 papers with code

Visual Entailment

29 papers with code

Cross-Lingual Natural Language Inference

16 papers with code

Logical Reasoning

Navigate

421 papers with code

Logical Reasoning

186 papers with code

Novel Concepts

51 papers with code

Temporal Sequences

51 papers with code

StrategyQA

13 papers with code

See all 23 tasks

Multi-Label Classification

Multi-Label Classification

379 papers with code

Missing Labels

41 papers with code

Extreme Multi-Label Classification

29 papers with code

Hierarchical Multi-label Classification

15 papers with code

Medical Code Prediction

15 papers with code

General Reinforcement Learning

Offline RL

227 papers with code

Model-based Reinforcement Learning

197 papers with code

Conformal Prediction

156 papers with code

Text Simplification

120 papers with code

Music Source Separation

53 papers with code

Decision Making Under Uncertainty

45 papers with code

Audio Source Separation

44 papers with code

See all 9 tasks

Code Generation

Code Generation

351 papers with code

Code Translation

40 papers with code

Code Documentation Generation

6 papers with code

Class-level Code Generation

2 papers with code

Library-Oriented Code Generation

2 papers with code

See all 6 tasks

Common Sense Reasoning

Common Sense Reasoning

261 papers with code

Physical Commonsense Reasoning

6 papers with code

Riddle Sense

5 papers with code

Winowhy

4 papers with code

Anachronisms

3 papers with code

See all 16 tasks

Visual Reasoning

Visual Reasoning

215 papers with code

Visual Commonsense Reasoning

29 papers with code

Program Synthesis

Program Synthesis

140 papers with code

Type prediction

41 papers with code

Program Repair

35 papers with code

Value prediction

16 papers with code

Enumerative Search

5 papers with code

See all 6 tasks

Mathematical Reasoning

Mathematical Reasoning

121 papers with code

Math Word Problem Solving

63 papers with code

Formal Logic

12 papers with code

Geometry Problem Solving

8 papers with code

Abstract Algebra

3 papers with code

See all 8 tasks

Video Question Answering

Video Question Answering

158 papers with code

Zero-Shot Video Question Answer

36 papers with code

Few-shot Video Question Answering

1 papers with code

Multi-Label Learning

Multi-Label Learning

84 papers with code

Missing Labels

41 papers with code

Mathematical Proofs

Automated Theorem Proving

70 papers with code

Mathematical Proofs

17 papers with code

Arithmetic Reasoning

Arithmetic Reasoning

72 papers with code

Math Word Problem Solving

Math Word Problem Solving

63 papers with code

Mathematical Question Answering

Math Word Problem Solving

63 papers with code

Program Repair

Program Repair

35 papers with code

Fault localization

15 papers with code

Variable misuse

9 papers with code

Exception type

2 papers with code

Function-docstring mismatch

1 papers with code

See all 7 tasks

Systematic Generalization

Systematic Generalization

62 papers with code

Video-based Generative Performance Benchmarking

Video-based Generative Performance Benchmarking (Contextual Understanding)

11 papers with code

Video-based Generative Performance Benchmarking (Consistency)

10 papers with code

Video-based Generative Performance Benchmarking (Correctness of Information)

10 papers with code

Video-based Generative Performance Benchmarking (Detail Orientation))

10 papers with code

Video-based Generative Performance Benchmarking (Temporal Understanding)

10 papers with code

Decision Making Under Uncertainty

Decision Making Under Uncertainty

45 papers with code

Uncertainty Visualization

3 papers with code

Multimodal Reasoning

Multimodal Reasoning

38 papers with code

Generative Visual Question Answering

Video-based Generative Performance Benchmarking

16 papers with code

Natural Language Visual Grounding

Natural Language Visual Grounding

16 papers with code

Discrete Choice Models

Discrete Choice Models

14 papers with code

Causal Identification

Causal Identification

12 papers with code

Odd One Out

Odd One Out

10 papers with code

Geometry Problem Solving

Geometry Problem Solving

8 papers with code

Autonomous Navigation

Sequential Place Recognition

5 papers with code

Autonomous Flight (Dense Forest)

1 papers with code

Autonomous Web Navigation

Abstract Argumentation

Abstract Argumentation

4 papers with code

Analogical Similarity

Analogical Similarity

4 papers with code

Theory of Mind Modeling

Theory of Mind Modeling

4 papers with code

Anachronisms

Anachronisms

3 papers with code

Human Judgment Correlation

Human Judgment Correlation

3 papers with code

Human Judgment Classification

Human Judgment Classification

2 papers with code

Identify Odd Metapor

Identify Odd Metapor

2 papers with code

Commonsense Reasoning for RL

Commonsense Reasoning for RL

1 papers with code

Pre-election ratings estimation

Pre-election ratings estimation

1 papers with code