Search Results for author: Paul Dixon

Found 5 papers, 0 papers with code

Improving vision-inspired keyword spotting using dynamic module skipping in streaming conformer encoder

no code implementations31 Aug 2023 Alexandre Bittar, Paul Dixon, Mohammad Samragh, Kumari Nishu, Devang Naik

Using a vision-inspired keyword spotting framework, we propose an architecture with input-dependent dynamic depth capable of processing streaming audio.

Keyword Spotting

Flexible Keyword Spotting based on Homogeneous Audio-Text Embedding

no code implementations12 Aug 2023 Kumari Nishu, Minsik Cho, Paul Dixon, Devang Naik

Spotting user-defined/flexible keywords represented in text frequently uses an expensive text encoder for joint analysis with an audio encoder in an embedding space, which can suffer from heterogeneous modality representation (i. e., large mismatch) and increased complexity.

Keyword Spotting

Modality Dropout for Improved Performance-driven Talking Faces

no code implementations27 May 2020 Ahmed Hussen Abdelaziz, Barry-John Theobald, Paul Dixon, Reinhard Knothe, Nicholas Apostoloff, Sachin Kajareker

We use subjective testing to demonstrate: 1) the improvement of audiovisual-driven animation over the equivalent video-only approach, and 2) the improvement in the animation of speech-related facial movements after introducing modality dropout.

Cannot find the paper you are looking for? You can Submit a new open access paper.