no code implementations • 30 Aug 2022 • Shuqiang Cao, Weixin Luo, Bairui Wang, Wei zhang, Lin Ma
In this paper, we advocate a novel and efficient principle for online action detection.
1 code implementation • ICCV 2019 • Bairui Wang, Lin Ma, Wei zhang, Wenhao Jiang, Jingwen Wang, Wei Liu
In this paper, we propose to guide the video caption generation with Part-of-Speech (POS) information, based on a gated fusion of multiple representations of input videos.
no code implementations • 3 Jun 2019 • Wei Zhang, Bairui Wang, Lin Ma, Wei Liu
Unlike previous video captioning work mainly exploiting the cues of video contents to make a language description, we propose a reconstruction network (RecNet) in a novel encoder-decoder-reconstructor architecture, which leverages both forward (video to sentence) and backward (sentence to video) flows for video captioning.
no code implementations • 2 Feb 2019 • Bairui Wang, Lin Ma, Wei zhang, Wenhao Jiang, Feng Zhang
In this paper, we propose a novel model with a hierarchical photo-scene encoder and a reconstructor for the task of album storytelling.
3 code implementations • CVPR 2018 • Bairui Wang, Lin Ma, Wei zhang, Wei Liu
Unlike previous video captioning work mainly exploiting the cues of video contents to make a language description, we propose a reconstruction network (RecNet) with a novel encoder-decoder-reconstructor architecture, which leverages both the forward (video to sentence) and backward (sentence to video) flows for video captioning.