VLG-Net leverages recent advantages in Graph Neural Networks (GCNs) and leverages a novel multi-modality graph-based fusion method for the task of natural language video grounding.
Source: VLG-Net: Video-Language Graph Matching Network for Video GroundingPaper | Code | Results | Date | Stars |
---|
Task | Papers | Share |
---|---|---|
Moment Retrieval | 2 | 28.57% |
Natural Language Moment Retrieval | 2 | 28.57% |
Graph Matching | 1 | 14.29% |
Temporal Localization | 1 | 14.29% |
Video Grounding | 1 | 14.29% |
Component | Type |
|
---|---|---|
🤖 No Components Found | You can add them if they exist; e.g. Mask R-CNN uses RoIAlign |