V-COCO (Verbs in COCO)

Introduced by Gupta et al. in Visual Semantic Role Labeling

Verbs in COCO (V-COCO) is a dataset that builds off COCO for human-object interaction detection. V-COCO provides 10,346 images (2,533 for training, 2,867 for validating and 4,946 for testing) and 16,199 person instances. Each person has annotations for 29 action categories and there are no interaction labels including objects.

Source: Visual Compositional Learning for Human-Object Interaction Detection

Homepage

Benchmarks

Add a new result Link an existing benchmark

Trend	Task	Dataset Variant	Best Model	Paper	Code
	Human-Object Interaction Detection	V-COCO	RLIPv2

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

s-gupta/v-coco

144

Tasks

Human-Object Interaction Detection

Similar Datasets

HAKE

Ambiguous-HOI

HICO

HICO-DET

Source: https://www.researchgate.net/figure/Pose-estimation-and-action-recognition-results-on-the-V-COCO-Dataset-16-which-has_fig9_339477856.

V-COCO (Verbs in COCO)

Benchmarks

Add a new result Link an existing benchmark

Papers

Dataset Loaders

Add Remove

Tasks

Similar Datasets

HAKE

Ambiguous-HOI

HICO

HICO-DET

Usage

License

Modalities

Languages

V-COCO (Verbs in COCO)

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Similar Datasets

HAKE

Ambiguous-HOI

HICO

HICO-DET

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages