Video Alignment

22 papers with code • 2 benchmarks • 4 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Alignment

Trend	Dataset	Best Model	Paper	Code	Compare
	UPenn Action	TCC + TCN			See all
	MSU Video Alignment and Retrieval Benchmark Suite	VQMT3D			See all

Datasets

Latest papers with no code

Most implemented Social Latest No code

Learning by Aligning 2D Skeleton Sequences and Multi-Modality Fusion

no code yet • 31 May 2023

This paper presents a self-supervised temporal video alignment framework which is useful for several fine-grained human activity understanding applications.

Paper
Add Code

Edit As You Wish: Video Description Editing with Multi-grained Commands

no code yet • 15 May 2023

In this paper, we propose a novel Video Description Editing (VDEdit) task to automatically revise an existing video description guided by flexible user requests.

Paper
Add Code

Video alignment using unsupervised learning of local and global features

no code yet • 13 Apr 2023

We introduce an unsupervised method for alignment that uses global and local features of the frames.

Paper
Add Code

VADER: Video Alignment Differencing and Retrieval

no code yet • ICCV 2023

We propose VADER, a spatio-temporal matching, alignment, and change summarization method to help fight misinformation spread via manipulated videos.

Paper
Add Code

Weakly-supervised Representation Learning for Video Alignment and Analysis

no code yet • 8 Feb 2023

This paper introduces LRProp -- a novel weakly-supervised representation learning approach, with an emphasis on the application of temporal alignment between pairs of videos of the same action category.

Paper
Add Code

PIDRo: Parallel Isomeric Attention with Dynamic Routing for Text-Video Retrieval

no code yet • ICCV 2023

The parallel isomeric attention module is used as the video encoder, which consists of two parallel branches modeling the spatial-temporal information of videos from both patch and frame levels.

Paper
Add Code

Self-supervised and Weakly Supervised Contrastive Learning for Frame-wise Action Representations

no code yet • 6 Dec 2022

In this paper, we introduce a new framework of contrastive action representation learning (CARL) to learn frame-wise action representation in a self-supervised or weakly-supervised manner, especially for long videos.

Paper
Add Code

Learning by Aligning Videos in Time

no code yet • CVPR 2021

To overcome this problem, we propose a temporal regularization term (i. e., Contrastive-IDM) which encourages different frames to be mapped to different points in the embedding space.

Paper
Add Code