Action Segmentation

72 papers with code • 9 benchmarks • 16 datasets

Action Segmentation is a challenging problem in high-level video understanding. In its simplest form, Action Segmentation aims to segment a temporally untrimmed video by time and label each segmented part with one of pre-defined action labels. The results of Action Segmentation can be further used as input to various applications, such as video-to-text and action localization.

Source: TricorNet: A Hybrid Temporal Convolutional and Recurrent Network for Video Action Segmentation

Benchmarks

Add a Result

These leaderboards are used to track progress in Action Segmentation

Dataset	Best Model	Compare
Breakfast	AdaFocus (newly extracted I3D-features, LT-Context model)	See all
50 Salads	Br-Prompt+ASPnet (RGB, flow, accelerometer)	See all
GTEA	Semantic2Graph	See all
COIN	UnLoc-L	See all
JIGSAWS	MRG-Net	See all
Assembly101	LTContext	See all
Youtube INRIA Instructional	TSA (FINCH)	See all
50Salads	EUT	See all
MPII Cooking 2 Dataset	Unsup. TW-FINCH (K=avg/activity)	See all

Libraries

Use these libraries to find Action Segmentation models and implementations

pytorch/fairseq

2 papers

29,305

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Temporal Unet: Sample Level Human Action Recognition using WiFi

geekfeiw/WiSLAR • • 19 Apr 2019

In this task, every WiFi distortion sample in the whole series should be categorized into one action, which is a critical technique in precise action localization, continuous action segmentation, and real-time action recognition.

Paper
Code

Frontal Low-rank Random Tensors for Fine-grained Action Segmentation

yz-cnsdqz/ms-tcn-bilinear • • 3 Jun 2019

In this paper, we propose an approach to representing high-order information for temporal action segmentation via a simple yet effective bilinear form.

Paper
Code

Weakly Supervised Energy-Based Learning for Action Segmentation

JunLi-Galios/CDFL • • ICCV 2019

This paper is about labeling video frames with action classes under weak supervision in training, where we have access to a temporal ordering of actions, but their start and end frames in training videos are unknown.

Paper
Code

Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation

cmhungsteve/SSTDA • • CVPR 2020

Despite the recent progress of fully-supervised action segmentation techniques, the performance is still not fully satisfactory.

Paper
Code

SCT: Set Constrained Temporal Transformer for Set Supervised Action Segmentation

MohsenFayyaz89/SCT • • CVPR 2020

In addition, the network estimates the action labels for each frame.

Paper
Code

Learning to Segment Actions from Observation and Narration

dpfried/action-segmentation • • ACL 2020

We apply a generative segmental model of task structure, guided by narration, to action segmentation in video.

Paper
Code

MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation

sj-li/MS-TCN2 • • 16 Jun 2020

Despite the capabilities of these approaches in capturing temporal dependencies, their predictions suffer from over-segmentation errors.

Paper
Code

Online Spatiotemporal Action Detection and Prediction via Causal Representations

sahasuman/bmvc2016_code • 31 Aug 2020

In this thesis, we focus on video action understanding problems from an online and real-time processing point of view.

Paper
Code

ActBERT: Learning Global-Local Video-Text Representations

PaddlePaddle/PaddleVideo • • CVPR 2020

In this paper, we introduce ActBERT for self-supervised learning of joint video-text representations from unlabeled data.

Paper
Code

Alleviating Class-wise Gradient Imbalance for Pulmonary Airway Segmentation

haozheng-sjtu/3d-airway-segmentation • • 24 Nov 2020

Due to the small size and scattered spatial distribution of peripheral bronchi, this is hampered by severe class imbalance between foreground and background regions, which makes it challenging for CNN-based methods to parse distal small airways.

Paper
Code

Action Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result