1 code implementation • 19 Jan 2023 • Xizi Wang, Feng Cheng, Gedas Bertasius, David Crandall
These two contexts are complementary to each other and can help infer the active speaker.
1 code implementation • CVPR 2023 • Feng Cheng, Xizi Wang, Jie Lei, David Crandall, Mohit Bansal, Gedas Bertasius
Furthermore, our model also obtains state-of-the-art video question-answering results on ActivityNet-QA, MSRVTT-QA, MSRVTT-MC and TVQA.
Ranked #2 on Video Retrieval on Condensed Movies (using extra training data)
1 code implementation • 15 Aug 2022 • Satoshi Tsutsui, Xizi Wang, Guangyuan Weng, Yayun Zhang, David Crandall, Chen Yu
We set out to identify properties of training data that lead to action recognition models with greater generalization ability.
no code implementations • 15 Jul 2021 • Xiaomeng Ye, Ziwei Zhao, David Leake, Xizi Wang, David Crandall
Given a pair of cases, the CDH approach attributes the difference in their solutions to the difference in the problems they solve, and generates adaptation rules to adjust solutions accordingly when a retrieved case and new query have similar problem differences.
3 code implementations • 6 Apr 2020 • Yu Yao, Xizi Wang, Mingze Xu, Zelin Pu, Ella Atkins, David Crandall
A new spatial-temporal area under curve (STAUC) evaluation metric is proposed and used with DoTA.