TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Action Segmentation	50 Salads	TW-FINCH (K=avg/activity)	Acc	66.5	# 26
Action Segmentation	Breakfast	TW-FINCH (K=avg/activity)	Acc	62.7	# 32
Action Segmentation	Breakfast	TW-FINCH (K=avg/activity)	mIoU	42.3	# 4
Action Segmentation	MPII Cooking 2 Dataset	Unsup. TW-FINCH (K=avg/activity)	Accuracy	42	# 1
Action Segmentation	MPII Cooking 2 Dataset	Unsup. TW-FINCH (K=avg/activity)	mIoU	23.1	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/temporally-weighted-hierarchical-clustering/action-segmentation-on-mpii-cooking-2-dataset)](https://paperswithcode.com/sota/action-segmentation-on-mpii-cooking-2-dataset?p=temporally-weighted-hierarchical-clustering)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/temporally-weighted-hierarchical-clustering/action-segmentation-on-breakfast-1)](https://paperswithcode.com/sota/action-segmentation-on-breakfast-1?p=temporally-weighted-hierarchical-clustering)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/temporally-weighted-hierarchical-clustering/action-segmentation-on-50-salads-1)](https://paperswithcode.com/sota/action-segmentation-on-50-salads-1?p=temporally-weighted-hierarchical-clustering)`

Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation

CVPR 2021 · M. Saquib Sarfraz, Naila Murray, Vivek Sharma, Ali Diba, Luc van Gool, Rainer Stiefelhagen ·

Action segmentation refers to inferring boundaries of semantically consistent visual concepts in videos and is an important requirement for many video understanding tasks. For this and other video understanding tasks, supervised approaches have achieved encouraging performance but require a high volume of detailed frame-level annotations. We present a fully automatic and unsupervised approach for segmenting actions in a video that does not require any training. Our proposal is an effective temporally-weighted hierarchical clustering algorithm that can group semantically consistent frames of the video. Our main finding is that representing a video with a 1-nearest neighbor graph by taking into account the time progression is sufficient to form semantically and temporally consistent clusters of frames where each cluster may represent some action in the video. Additionally, we establish strong unsupervised baselines for action segmentation and show significant performance improvements over published unsupervised methods on five challenging action segmentation datasets. Our code is available at https://github.com/ssarfraz/FINCH-Clustering/tree/master/TW-FINCH

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract

Code

Add Remove Mark official

ssarfraz/FINCH-CLustering official

312

Tasks

Add Remove

Action Segmentation

Clustering

Segmentation

Video Understanding

Datasets

Breakfast 50 Salads MPII Cooking 2 Dataset

Results from the Paper

Edit

Ranked #1 on Action Segmentation on MPII Cooking 2 Dataset

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Action Segmentation	50 Salads	TW-FINCH (K=avg/activity)	Acc	66.5	# 26	Compare
Action Segmentation	Breakfast	TW-FINCH (K=avg/activity)	Acc	62.7	# 32	Compare
Action Segmentation	Breakfast	TW-FINCH (K=avg/activity)	mIoU	42.3	# 4	Compare
Action Segmentation	MPII Cooking 2 Dataset	Unsup. TW-FINCH (K=avg/activity)	Accuracy	42	# 1	Compare
Action Segmentation	MPII Cooking 2 Dataset	Unsup. TW-FINCH (K=avg/activity)	mIoU	23.1	# 1	Compare

Methods

Add Remove

FINCH Clustering

Edit Social Preview

Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove