Microsoft Research Multimodal Aligned Recipe Corpus

Introduced by Lin et al. in A Recipe for Creating Multimodal Aligned Datasets for Sequential Tasks

To construct the MICROSOFT RESEARCH MULTIMODAL ALIGNED RECIPE CORPUS the authors first extract a large number of text and video recipes from the web. The goal is to find joint alignments between multiple text recipes and multiple video recipes for the same dish. The task is challenging, as different recipes vary in their order of instructions and use of ingredients. Moreover, video instructions can be noisy, and text and video instructions include different levels of specificity in their descriptions.

Source: A Recipe for Creating Multimodal Aligned Datasets for Sequential Tasks

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

microsoft/multimodal-aligned-recipe-corpus

Tasks

Similar Datasets

MMED

Microsoft Research Multimodal Aligned Recipe Corpus

Benchmarks

Add a new result Link an existing benchmark

Papers

Dataset Loaders

Add Remove

Tasks

Similar Datasets

MMED

Usage

License

Modalities

Languages

Microsoft Research Multimodal Aligned Recipe Corpus

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Similar Datasets

MMED

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages