Action Parsing

2 papers with code • 0 benchmarks • 2 datasets

Action parsing is the task of, given a video or still image, assigning each frame or image a label describing the action in that frame or image.

Benchmarks

Add a Result

These leaderboards are used to track progress in Action Parsing

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Datasets

Latest papers with no code

Most implemented Social Latest No code

Action parsing using context features

no code yet • 20 May 2022

We propose an action parsing algorithm to parse a video sequence containing an unknown number of actions into its action segments.

Paper
Add Code

Part-level Action Parsing via a Pose-guided Coarse-to-Fine Framework

no code yet • 9 Mar 2022

Therefore, researchers start to focus on a new task, Part-level Action Parsing (PAP), which aims to not only predict the video-level action but also recognize the frame-level fine-grained actions or interactions of body parts for each person in the video.

Paper
Add Code

Technical Report: Disentangled Action Parsing Networks for Accurate Part-level Action Parsing

no code yet • 5 Nov 2021

Despite of dramatic progresses in the area of video classification research, a severe problem faced by the community is that the detailed understanding of human actions is ignored.

Paper
Add Code

A Baseline Framework for Part-level Action Parsing and Action Recognition

no code yet • 7 Oct 2021

This technical report introduces our 2nd place solution to Kinetics-TPS Track on Part-level Action Parsing in ICCV DeeperAction Workshop 2021.

Paper
Add Code

SSCAP: Self-supervised Co-occurrence Action Parsing for Unsupervised Temporal Action Segmentation

no code yet • 29 May 2021

However, it is quite expensive to annotate every frame in a large corpus of videos to construct a comprehensive supervised training dataset.

Paper
Add Code

Intra- and Inter-Action Understanding via Temporal Action Parsing

no code yet • CVPR 2020

Current methods for action recognition primarily rely on deep convolutional networks to derive feature embeddings of visual and motion features.

Paper
Add Code

IncSQL: Training Incremental Text-to-SQL Parsers with Non-Deterministic Oracles

no code yet • 13 Sep 2018

We present a sequence-to-action parsing approach for the natural language to SQL task that incrementally fills the slots of a SQL query with feasible actions from a pre-defined inventory.

Paper
Add Code

DAP3D-Net: Where, What and How Actions Occur in Videos?

no code yet • 10 Feb 2016

Action parsing in videos with complex scenes is an interesting but challenging task in computer vision.

Paper
Add Code

Action Recognition by Hierarchical Mid-level Action Elements

no code yet • ICCV 2015

Realistic videos of human actions exhibit rich spatiotemporal structures at multiple levels of granularity: an action can always be decomposed into multiple finer-grained elements in both space and time.

Paper
Add Code

An Expressive Deep Model for Human Action Parsing from A Single Image

no code yet • 2 Feb 2015

This paper aims at one newly raising task in vision and multimedia research: recognizing human actions from still images.

Paper
Add Code

Action Parsing

Benchmarks Add a Result

Datasets

Latest papers with no code

Content

Benchmarks

Add a Result