1 code implementation • ACM Multimedia 2019 2019 • Xirong Li; Chaoxi Xu; Gang Yang; Zhineng Chen; Jianfeng Dong
The backbone of our method is the proposed W2VV++ model, a super version of Word2VisualVec (W2VV) previously developed for visual-to-text matching.
Ranked #3 on Ad-hoc video search on TRECVID-AVS18 (IACC.3) (using extra training data)