no code implementations • LREC 2020 • Taichi Nishimura, Suzushi Tomori, Hayato Hashimoto, Atsushi Hashimoto, Yoko Yamakata, Jun Harashima, Yoshitaka Ushiku, Shinsuke Mori
Visual grounding is provided as bounding boxes to image sequences of recipes, and each bounding box is linked to an element of the workflow.