Egocentric Activity Recognition

14 papers with code • 2 benchmarks • 4 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Egocentric Activity Recognition

Trend	Dataset	Best Model	Paper	Code	Compare
	EPIC-KITCHENS-55	DEEP-HAL with ODF+SDF (AssembleNet++)			See all
	EGTEA	LaViLa (Finetuned, TimeSformer-L)			See all

Libraries

Use these libraries to find Egocentric Activity Recognition models and implementations

open-mmlab/mmaction2

2 papers

3,888

Datasets

Most implemented papers

Most implemented Social Latest No code

Long-Term Feature Banks for Detailed Video Understanding

facebookresearch/video-long-term-feature-banks • • CVPR 2019

To understand the world, we humans constantly need to relate the present to the past, and put events in context.

Paper
Code

Large-scale weakly-supervised pre-training for video action recognition

microsoft/computervision-recipes • • CVPR 2019

Second, frame-based models perform quite well on action recognition; is pre-training for good image features sufficient or is pre-training for spatio-temporal features valuable for optimal transfer learning?

Paper
Code

What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention

fpv-iplab/rulstm • • ICCV 2019

Our method is ranked first in the public leaderboard of the EPIC-Kitchens egocentric action anticipation challenge 2019.

Paper
Code

Learning Video Representations from Large Language Models

facebookresearch/lavila • • CVPR 2023

We introduce LaViLa, a new approach to learning video-language representations by leveraging Large Language Models (LLMs).

Paper
Code

First-Person Hand Action Benchmark with RGB-D Videos and 3D Hand Pose Annotations

guiggh/hand_pose_action • CVPR 2018

Our dataset and experiments can be of interest to communities of 3D hand pose estimation, 6D object pose, and robotics as well as action recognition.

Paper
Code

A Correlation Based Feature Representation for First-Person Activity Recognition

rkahani/FirstPersonActivityRecognition • 15 Nov 2017

The per-frame (per-segment) extracted features are considered as a set of time series, and inter and intra-time series relations are employed to represent the video descriptors.

Paper
Code

Attention is All We Need: Nailing Down Object-centric Attention for Egocentric Activity Recognition

swathikirans/ego-rnn • • 31 Jul 2018

Our model is built on the observation that egocentric activities are highly characterized by the objects and their locations in the video.

Paper
Code

LSTA: Long Short-Term Attention for Egocentric Action Recognition

swathikirans/LSTA • • CVPR 2019

Egocentric activity recognition is one of the most challenging tasks in video analysis.

Paper
Code

EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition

ekazakos/temporal-binding-network • • ICCV 2019

We focus on multi-modal fusion for egocentric action recognition, and propose a novel architecture for multi-modal temporal-binding, i. e. the combination of modalities within a range of temporal offsets.

Paper
Code

Integrating Human Gaze into Attention for Egocentric Activity Recognition

kylemin/Gaze-Attention • • 8 Nov 2020

In addition, we model the distribution of gaze fixations using a variational method.

Paper
Code

Egocentric Activity Recognition

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result