Natural Language Visual Grounding
16 papers with code • 0 benchmarks • 6 datasets
This task has no description! Would you like to contribute one?
Benchmarks
These leaderboards are used to track progress in Natural Language Visual Grounding
No evaluation results yet. Help compare methods by
submitting
evaluation metrics.
Latest papers with no code
Learning to Assemble Neural Module Tree Networks for Visual Grounding
In particular, we develop a novel modular network called Neural Module Tree network (NMTree) that regularizes the visual grounding along the dependency parsing tree of the sentence, where each node is a neural module that calculates visual attention according to its linguistic feature, and the grounding score is accumulated in a bottom-up direction where as needed.