no code implementations • 27 Mar 2024 • Edward Fish, Jon Weinbren, Andrew Gilbert
This paper introduces a novel approach to temporal action localization (TAL) in few-shot learning.
no code implementations • 5 Oct 2023 • Edward Fish, Jon Weinbren, Andrew Gilbert
Temporal Action Localization (TAL) aims to identify actions' start, end, and class labels in untrimmed videos.
1 code implementation • 24 Jul 2023 • Edward Fish, Umberto Michieli, Mete Ozay
Recent advancement in Automatic Speech Recognition (ASR) has produced large AI models, which become impractical for deployment in mobile devices.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 2 Aug 2022 • Edward Fish, Jon Weinbren, Andrew Gilbert
Pure vision transformer architectures are highly effective for short video classification and action recognition tasks.
no code implementations • 4 Dec 2020 • Edward Fish, Jon Weinbren, Andrew Gilbert
We expand these 'coarse' genre labels by identifying 'fine-grained' semantic information within the multi-modal content of movies.