Search Results for author: Fangtao Shao

Found 3 papers, 1 papers with code

Fine-grained Text-Video Retrieval with Frozen Image Encoders

no code implementations14 Jul 2023 Zuozhuo Dai, Fangtao Shao, Qingkun Su, Zilong Dong, Siyu Zhu

In the second stage, we propose a novel decoupled video text cross attention module to capture fine-grained multimodal information in spatial and temporal dimensions.

Retrieval Video Retrieval

Towards Robust Video Instance Segmentation with Temporal-Aware Transformer

no code implementations20 Jan 2023 Zhenghao Zhang, Fangtao Shao, Zuozhuo Dai, Siyu Zhu

In this paper, we observe the temporal information is important as well and we propose TAFormer to aggregate spatio-temporal features both in transformer encoder and decoder.

Instance Segmentation Semantic Segmentation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.