Action Segmentation

72 papers with code • 9 benchmarks • 16 datasets

Action Segmentation is a challenging problem in high-level video understanding. In its simplest form, Action Segmentation aims to segment a temporally untrimmed video by time and label each segmented part with one of pre-defined action labels. The results of Action Segmentation can be further used as input to various applications, such as video-to-text and action localization.

Source: TricorNet: A Hybrid Temporal Convolutional and Recurrent Network for Video Action Segmentation

Benchmarks

Add a Result

These leaderboards are used to track progress in Action Segmentation

Dataset	Best Model	Compare
Breakfast	AdaFocus (newly extracted I3D-features, LT-Context model)	See all
50 Salads	Br-Prompt+ASPnet (RGB, flow, accelerometer)	See all
GTEA	Semantic2Graph	See all
COIN	UnLoc-L	See all
JIGSAWS	MRG-Net	See all
Assembly101	LTContext	See all
Youtube INRIA Instructional	TSA (FINCH)	See all
50Salads	EUT	See all
MPII Cooking 2 Dataset	Unsup. TW-FINCH (K=avg/activity)	See all

Libraries

Use these libraries to find Action Segmentation models and implementations

pytorch/fairseq

2 papers

29,251

Datasets

Subtasks

Latest papers with no code

Most implemented Social Latest No code

HOIST-Former: Hand-held Objects Identification, Segmentation, and Tracking in the Wild

no code yet • 22 Apr 2024

We address the challenging task of identifying, segmenting, and tracking hand-held objects, which is crucial for applications such as human action segmentation and performance evaluation.

Paper
Add Code

O-TALC: Steps Towards Combating Oversegmentation within Online Action Segmentation

no code yet • 10 Apr 2024

In order to facilitate online action segmentation on a stream of incoming video data, we introduce two methods for improved training and inference of backbone action recognition models, allowing them to be deployed directly for online frame level classification.

Paper
Add Code

Coherent Temporal Synthesis for Incremental Action Segmentation

no code yet • 10 Mar 2024

Data replay is a successful incremental learning technique for images.

Paper
Add Code

A Multimodal Handover Failure Detection Dataset and Baselines

no code yet • 28 Feb 2024

To address this deficit, we present the multimodal Handover Failure Detection dataset, which consists of failures induced by the human participant, such as ignoring the robot or not releasing the object.

Paper
Add Code

ADL4D: Towards A Contextually Rich Dataset for 4D Activities of Daily Living

no code yet • 27 Feb 2024

However, existing datasets for 4D HOI (3D HOI over time) are limited to one subject inter- acting with one object only.

Paper
Add Code

ViSTec: Video Modeling for Sports Technique Recognition and Tactical Analysis

no code yet • 25 Feb 2024

The immense popularity of racket sports has fueled substantial demand in tactical analysis with broadcast videos.

Paper
Add Code

Friends Across Time: Multi-Scale Action Segmentation Transformer for Surgical Phase Recognition

no code yet • 22 Jan 2024

Current state-of-the-art methods use both spatial and temporal information to tackle the surgical phase recognition task.

Paper
Add Code

Depth Over RGB: Automatic Evaluation of Open Surgery Skills Using Depth Camera

no code yet • 18 Jan 2024

We focused on hand and tool detection, and action segmentation in suturing procedures.

Paper
Add Code

Robotic Imitation of Human Actions

no code yet • 16 Jan 2024

Imitation can allow us to quickly gain an understanding of a new task.

Paper
Add Code

SFGANS Self-supervised Future Generator for human ActioN Segmentation

no code yet • 31 Dec 2023

The ability to locate and classify action segments in long untrimmed video is of particular interest to many applications such as autonomous cars, robotics and healthcare applications.

Paper
Add Code

Action Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result