Search Results for author: Benjamin Z. Yao

Found 1 papers, 0 papers with code

VidLA: Video-Language Alignment at Scale

no code implementations21 Mar 2024 Mamshad Nayeem Rizve, Fan Fei, Jayakrishnan Unnikrishnan, Son Tran, Benjamin Z. Yao, Belinda Zeng, Mubarak Shah, Trishul Chilimbi

To effectively address this limitation, we instead keep the network architecture simple and use a set of data tokens that operate at different temporal resolutions in a hierarchical manner, accounting for the temporally hierarchical nature of videos.

Language Modelling Visual Grounding

Cannot find the paper you are looking for? You can Submit a new open access paper.