1 code implementation • NeurIPS 2021 • Jie Lei, Tamara Berg, Mohit Bansal
Each video in the dataset is annotated with: (1) a human-written free-form NL query, (2) relevant moments in the video w. r. t.
Ranked #6 on
Video Grounding
on QVHighlights
no code implementations • 24 May 2021 • Filip Radenovic, Animesh Sinha, Albert Gordo, Tamara Berg, Dhruv Mahajan
We study the problem of learning how to predict attribute-object compositions from images, and its generalization to unseen compositions missing from the training data.
1 code implementation • CVPR 2021 • Zihang Meng, Licheng Yu, Ning Zhang, Tamara Berg, Babak Damavandi, Vikas Singh, Amy Bearman
Learning the grounding of each word is challenging, due to noise in the human-provided traces and the presence of words that cannot be meaningfully visually grounded.
no code implementations • 4 Dec 2020 • Utkarsh Mall, Kavita Bala, Tamara Berg, Kristen Grauman
The fashion sense -- meaning the clothing styles people wear -- in a geographical region can reveal information about that region.
no code implementations • ECCV 2020 • Albert Gordo, Filip Radenovic, Tamara Berg
Query expansion is a technique widely used in image search consisting in combining highly ranked images from an original query into an expanded query that is then reissued, generally leading to increased recall and precision.
no code implementations • 3 Aug 2016 • Shan Yang, Tanya Ambert, Zherong Pan, Ke Wang, Licheng Yu, Tamara Berg, Ming C. Lin
Most recent garment capturing techniques rely on acquiring multiple views of clothing, which may not always be readily available, especially in the case of pre-existing photographs from the web.