Search Results for author: Xizi Wang

Found 5 papers, 4 papers with code

LoCoNet: Long-Short Context Network for Active Speaker Detection

1 code implementation19 Jan 2023 Xizi Wang, Feng Cheng, Gedas Bertasius, David Crandall

These two contexts are complementary to each other and can help infer the active speaker.

VindLU: A Recipe for Effective Video-and-Language Pretraining

1 code implementation CVPR 2023 Feng Cheng, Xizi Wang, Jie Lei, David Crandall, Mohit Bansal, Gedas Bertasius

Furthermore, our model also obtains state-of-the-art video question-answering results on ActivityNet-QA, MSRVTT-QA, MSRVTT-MC and TVQA.

Ranked #2 on Video Retrieval on Condensed Movies (using extra training data)

Question Answering Retrieval +3

Action Recognition based on Cross-Situational Action-object Statistics

1 code implementation15 Aug 2022 Satoshi Tsutsui, Xizi Wang, Guangyuan Weng, Yayun Zhang, David Crandall, Chen Yu

We set out to identify properties of training data that lead to action recognition models with greater generalization ability.

Action Recognition Object +1

Applying the Case Difference Heuristic to Learn Adaptations from Deep Network Features

no code implementations15 Jul 2021 Xiaomeng Ye, Ziwei Zhao, David Leake, Xizi Wang, David Crandall

Given a pair of cases, the CDH approach attributes the difference in their solutions to the difference in the problems they solve, and generates adaptation rules to adjust solutions accordingly when a retrieved case and new query have similar problem differences.

Cannot find the paper you are looking for? You can Submit a new open access paper.