TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Temporal Sentence Grounding	Charades-STA	MMN (Full, MViT-K400-Pretrain-feature, evaluated by AdaFocus)	R1@0.5	55.2	# 3
Temporal Sentence Grounding	Charades-STA	MMN (Full, MViT-K400-Pretrain-feature, evaluated by AdaFocus)	R1@0.7	32.2	# 3
Temporal Sentence Grounding	Charades-STA	MMN (Full, MViT-K400-Pretrain-feature, evaluated by AdaFocus)	R5@0.7	62.7	# 3
Temporal Sentence Grounding	Charades-STA	MMN (Full, MViT-K400-Pretrain-feature, evaluated by AdaFocus)	R5@0.5	88.3	# 2
Temporal Sentence Grounding	Charades-STA	MMN (Full, I3D-K400-Pretrain-feature, evaluated by AdaFocus)	R1@0.5	49.4	# 6
Temporal Sentence Grounding	Charades-STA	MMN (Full, I3D-K400-Pretrain-feature, evaluated by AdaFocus)	R1@0.7	29.8	# 4
Temporal Sentence Grounding	Charades-STA	MMN (Full, I3D-K400-Pretrain-feature, evaluated by AdaFocus)	R5@0.7	60.5	# 4
Temporal Sentence Grounding	Charades-STA	MMN (Full, I3D-K400-Pretrain-feature, evaluated by AdaFocus)	R5@0.5	85.8	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/negative-sample-matters-a-renaissance-of/temporal-sentence-grounding-on-charades-sta)](https://paperswithcode.com/sota/temporal-sentence-grounding-on-charades-sta?p=negative-sample-matters-a-renaissance-of)`

Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

10 Sep 2021 · Zhenzhi Wang, LiMin Wang, Tao Wu, TianHao Li, Gangshan Wu ·

Temporal grounding aims to localize a video moment which is semantically aligned with a given natural language query. Existing methods typically apply a detection or regression pipeline on the fused representation with the research focus on designing complicated prediction heads or fusion strategies. Instead, from a perspective on temporal grounding as a metric-learning problem, we present a Mutual Matching Network (MMN), to directly model the similarity between language queries and video moments in a joint embedding space. This new metric-learning framework enables fully exploiting negative samples from two new aspects: constructing negative cross-modal pairs in a mutual matching scheme and mining negative pairs across different videos. These new negative samples could enhance the joint representation learning of two modalities via cross-modal mutual matching to maximize their mutual information. Experiments show that our MMN achieves highly competitive performance compared with the state-of-the-art methods on four video grounding benchmarks. Based on MMN, we present a winner solution for the HC-STVG challenge of the 3rd PIC workshop. This suggests that metric learning is still a promising method for temporal grounding via capturing the essential cross-modal correlation in a joint embedding space. Code is available at https://github.com/MCG-NJU/MMN.

PDF Abstract

Code

Add Remove Mark official

mcg-nju/mmn official

aim3-ruc/youmakeup_challenge2022

Tasks

Add Remove

Metric Learning

Representation Learning

Temporal Sentence Grounding

Video Grounding

Datasets

ActivityNet

Charades

ActivityNet Captions

Charades-STA

Results from the Paper

Edit

Ranked #3 on Temporal Sentence Grounding on Charades-STA

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Temporal Sentence Grounding	Charades-STA	MMN (Full, MViT-K400-Pretrain-feature, evaluated by AdaFocus)	R1@0.5	55.2	# 3	Compare
			R1@0.7	32.2	# 3	Compare
			R5@0.7	62.7	# 3	Compare
			R5@0.5	88.3	# 2	Compare
Temporal Sentence Grounding	Charades-STA	MMN (Full, I3D-K400-Pretrain-feature, evaluated by AdaFocus)	R1@0.5	49.4	# 6	Compare
			R1@0.7	29.8	# 4	Compare
			R5@0.7	60.5	# 4	Compare
			R5@0.5	85.8	# 5	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove