no code implementations • 4 Dec 2020 • Edward Fish, Jon Weinbren, Andrew Gilbert
We expand these 'coarse' genre labels by identifying 'fine-grained' semantic information within the multi-modal content of movies.
no code implementations • 2 Aug 2022 • Edward Fish, Jon Weinbren, Andrew Gilbert
Pure vision transformer architectures are highly effective for short video classification and action recognition tasks.
no code implementations • 5 Oct 2023 • Edward Fish, Jon Weinbren, Andrew Gilbert
Temporal Action Localization (TAL) aims to identify actions' start, end, and class labels in untrimmed videos.
no code implementations • 27 Mar 2024 • Edward Fish, Jon Weinbren, Andrew Gilbert
This paper introduces a novel approach to temporal action localization (TAL) in few-shot learning.