Video Alignment

22 papers with code • 2 benchmarks • 4 datasets

This task has no description! Would you like to contribute one?

Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos

svip-lab/weaksvr CVPR 2023

Sequential video understanding, as an emerging video understanding task, has driven lots of researchers' attention because of its goal-oriented nature.

24
22 Mar 2023

Learning a Grammar Inducer from Massive Uncurated Instructional Videos

Sy-Zhang/MMC-PCFG 22 Oct 2022

While previous work focuses on building systems for inducing grammars on text that are well-aligned with video content, we investigate the scenario, in which text and video are only in loose correspondence.

40
22 Oct 2022

Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space

elicassion/3dtrl 23 Jun 2022

To this end, we propose a 3D Token Representation Layer (3DTRL) that estimates the 3D positional information of the visual tokens and leverages it for learning viewpoint-agnostic representations.

17
23 Jun 2022

Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning

minghchen/carl_code CVPR 2022

In this paper, we introduce a novel contrastive action representation learning (CARL) framework to learn frame-wise action representations, especially for long videos, in a self-supervised manner.

77
28 Mar 2022

View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose

google-research/google-research 23 Oct 2020

Recognition of human poses and actions is crucial for autonomous systems to interact smoothly with people.

32,890
23 Oct 2020

View-Invariant Probabilistic Embedding for Human Pose

google-research/google-research ECCV 2020

Depictions of similar human body configurations can vary with changing viewpoints.

32,890
02 Dec 2019

Adversarial Skill Networks: Unsupervised Robot Skill Learning from Video

mees/Adversarial-Skill-Networks 21 Oct 2019

Our method learns a general skill embedding independently from the task context by using an adversarial loss.

19
21 Oct 2019

Temporal Cycle-Consistency Learning

google-research/google-research CVPR 2019

We introduce a self-supervised representation learning method based on the task of temporal alignment between videos.

32,890
16 Apr 2019

Dynamic Temporal Alignment of Speech to Lips

tavihalperin/AV-sync 19 Aug 2018

This alignment is based on deep audio-visual features, mapping the lips video and the speech signal to a shared representation.

27
19 Aug 2018

LAMV: Learning to Align and Match Videos With Kernelized Temporal Layers

facebookresearch/videoalignment CVPR 2018

This paper considers a learnable approach for comparing and aligning videos.

133
01 Jun 2018