Action Anticipation

34 papers with code • 6 benchmarks • 8 datasets

Next action anticipation is defined as observing 1, ... , T frames and predicting the action that happens after a gap of T_a seconds. It is important to note that a new action starts after T_a seconds that is not seen in the observed frames. Here T_a=1 second.

Action Scene Graphs for Long-Form Understanding of Egocentric Videos

fpv-iplab/easg 6 Dec 2023

We present Egocentric Action Scene Graphs (EASGs), a new representation for long-form understanding of egocentric videos.

14
06 Dec 2023

Object-centric Video Representation for Long-term Action Anticipation

brown-palm/ObjectPrompt 31 Oct 2023

To recognize and predict human-object interactions, we use a Transformer-based neural architecture which allows the "retrieval" of relevant objects for action anticipation at various time scales.

3
31 Oct 2023

Palm: Predicting Actions through Language Models @ Ego4D Long-Term Action Anticipation Challenge 2023

dandoge/palm 28 Jun 2023

We present Palm, a solution to the Long-Term Action Anticipation (LTA) task utilizing vision-language and large language models.

9
28 Jun 2023

Action Anticipation with Goal Consistency

olga-zats/goal_consistency 26 Jun 2023

In this paper, we address the problem of short-term action anticipation, i. e., we want to predict an upcoming action one second before it happens.

4
26 Jun 2023

Enhancing Next Active Object-based Egocentric Action Anticipation with Guided Attention

sanketsans/ganov2 22 May 2023

To this end, we propose a novel approach that applies a guided attention mechanism between the objects, and the spatiotemporal features extracted from video clips, enhancing the motion and contextual information, and further decoding the object-centric and motion-centric information to address the problem of STA in egocentric videos.

5
22 May 2023

Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction Videos

zch-yu/epic-affordance-annotation 7 Feb 2023

Object affordance is an important concept in hand-object interaction, providing information on action possibilities based on human motor capacity and objects' physical property thus benefiting tasks such as action anticipation and robot imitation learning.

0
07 Feb 2023

Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation

zeyun-zhong/afft 23 Oct 2022

Although human action anticipation is a task which is inherently multi-modal, state-of-the-art methods on well known action anticipation datasets leverage this data by applying ensemble methods and averaging scores of unimodal anticipation networks.

24
23 Oct 2022

Rethinking Learning Approaches for Long-Term Action Anticipation

nmegha2601/anticipatr 20 Oct 2022

Action anticipation involves predicting future actions having observed the initial portion of a video.

9
20 Oct 2022

Text-Derived Knowledge Helps Vision: A Simple Cross-modal Distillation for Video-based Action Anticipation

stonybrooknlp/action-anticipation-lmtovideo 12 Oct 2022

Anticipating future actions in a video is useful for many autonomous and assistive technologies.

3
12 Oct 2022

Learning State-Aware Visual Representations from Audible Interactions

HimangiM/RepLAI 27 Sep 2022

However, learning representations from videos can be challenging.

11
27 Sep 2022