1 code implementation • 28 Aug 2023 • Lucas Ventura, Antoine Yang, Cordelia Schmid, Gül Varol
Most CoIR approaches require manually annotated datasets, comprising image-text-image triplets, where the text describes a modification from the query image to the target image.
Ranked #1 on Composed Video Retrieval (CoVR) on WebVid-CoVR
no code implementations • 20 Dec 2020 • Lucas Ventura, Amanda Duarte, Xavier Giro-i-Nieto
Recent work have addressed the generation of human poses represented by 2D/3D coordinates of human joints for sign language.
1 code implementation • CVPR 2021 • Amanda Duarte, Shruti Palaskar, Lucas Ventura, Deepti Ghadiyaram, Kenneth DeHaan, Florian Metze, Jordi Torres, Xavier Giro-i-Nieto
Towards this end, we introduce How2Sign, a multimodal and multiview continuous American Sign Language (ASL) dataset, consisting of a parallel corpus of more than 80 hours of sign language videos and a set of corresponding modalities including speech, English transcripts, and depth.