Search Results for author: Edward Fish

PLOT-TAL -- Prompt Learning with Optimal Transport for Few-Shot Temporal Action Localization

This paper introduces a novel approach to temporal action localization (TAL) in few-shot learning.

Paper
Add Code

Temporal Action Localization (TAL) aims to identify actions' start, end, and class labels in untrimmed videos.

Paper
Add Code

Recent advancement in Automatic Speech Recognition (ASR) has produced large AI models, which become impractical for deployment in mobile devices.

Paper
Code

Pure vision transformer architectures are highly effective for short video classification and action recognition tasks.

Paper
Add Code

We expand these 'coarse' genre labels by identifying 'fine-grained' semantic information within the multi-modal content of movies.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.