Search Results for author: Edward Fish

Found 5 papers, 1 papers with code

Multi-Resolution Audio-Visual Feature Fusion for Temporal Action Localization

no code implementations5 Oct 2023 Edward Fish, Jon Weinbren, Andrew Gilbert

Temporal Action Localization (TAL) aims to identify actions' start, end, and class labels in untrimmed videos.

Temporal Action Localization

A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization

1 code implementation24 Jul 2023 Edward Fish, Umberto Michieli, Mete Ozay

Recent advancement in Automatic Speech Recognition (ASR) has produced large AI models, which become impractical for deployment in mobile devices.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Two-Stream Transformer Architecture for Long Video Understanding

no code implementations2 Aug 2022 Edward Fish, Jon Weinbren, Andrew Gilbert

Pure vision transformer architectures are highly effective for short video classification and action recognition tasks.

Action Recognition Inductive Bias +3

Rethinking movie genre classification with fine-grained semantic clustering

no code implementations4 Dec 2020 Edward Fish, Jon Weinbren, Andrew Gilbert

We expand these 'coarse' genre labels by identifying 'fine-grained' semantic information within the multi-modal content of movies.

Classification Clustering +2

Cannot find the paper you are looking for? You can Submit a new open access paper.