Search Results for author: Zhimin Zeng

Exploiting Visual Semantic Reasoning for Video-Text Retrieval

Finally, the region features are aggregated to form frame-level features for further encoding to measure video-text similarity.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.