Search Results for author: Philip N. Garner

Found 18 papers, 5 papers with code

Investigating Cross-lingual Multi-level Adaptive Networks: The Importance of the Correlation of Source and Target Languages

no code implementations IWSLT 2016 Alexandros Lazaridis, Ivan Himawan, Petr Motlicek, Iosif Mporas, Philip N. Garner

We experiment with three different scenarios using, i) French, as a source language uncorrelated to the target language, ii) Ukrainian, as a source language correlated to the target one and finally iii) English as a source language uncorrelated to the target language using a relatively large amount of data in respect to the other two scenarios.

Conversational Speech Recognition Needs Data? Experiments with Austrian German

no code implementations LREC 2022 Julian Linke, Philip N. Garner, Gernot Kubin, Barbara Schuppler

Conversational speech represents one of the most complex of automatic speech recognition (ASR) tasks owing to the high inter-speaker variation in both pronunciation and conversational dynamics.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks

no code implementations22 Apr 2024 Alexandre Bittar, Philip N. Garner

Understanding cognitive processes in the brain demands sophisticated models capable of replicating neural dynamics at large scales.

speech-recognition Speech Recognition

Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting

no code implementations19 Feb 2024 Haolin Chen, Philip N. Garner

Our results demonstrate that catastrophic forgetting can be overcome by our methods without degrading the fine-tuning performance, and using the Kronecker factored approximations produces a better preservation of the pre-training knowledge than the diagonal ones.

Language Modelling Speech Synthesis +1

Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes

no code implementations29 Nov 2023 Pavel Korshunov, Haolin Chen, Philip N. Garner, Sebastien Marcel

From the publicly available speech dataset LibriTTS, we also created a separate database of only audio deepfakes LibriTTS-DF using several latest text to speech methods: YourTTS, Adaspeech, and TorToiSe.

Face Recognition Face Swapping +2

The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation

1 code implementation16 May 2023 Mutian He, Philip N. Garner

Motivated particularly by the task of cross-lingual SLU, we demonstrate that the task of speech translation (ST) is a good means of pretraining speech models for end-to-end SLU on both intra- and cross-lingual scenarios.

Abstractive Text Summarization Continual Learning +7

An investigation into the adaptability of a diffusion-based TTS model

no code implementations3 Mar 2023 Haolin Chen, Philip N. Garner

Given the recent success of diffusion in producing natural-sounding synthetic speech, we investigate how diffusion can be used in speaker adaptive TTS.

Surrogate Gradient Spiking Neural Networks as Encoders for Large Vocabulary Continuous Speech Recognition

no code implementations1 Dec 2022 Alexandre Bittar, Philip N. Garner

Compared to conventional artificial neurons that produce dense and real-valued responses, biologically-inspired spiking neurons transmit sparse and binary information, which can also lead to energy-efficient implementations.

speech-recognition Speech Recognition

Low-Level Physiological Implications of End-to-End Learning of Speech Recognition

no code implementations22 Aug 2022 Louise Coppieters de Gibson, Philip N. Garner

We investigate whether the inference can be inverted to provide insights into that biological system; in particular the hearing mechanism.

speech-recognition Speech Recognition

A surrogate gradient spiking baseline for speech command recognition

1 code implementation Frontiers in Neuroscience 2022 Alexandre Bittar, Philip N. Garner

Artificial neural networks (ANNs) are the basis of recent advances in artificial intelligence (AI); they typically use real valued neuron responses.

Audio Classification Time Series

Bayesian Recurrent Units and the Forward-Backward Algorithm

1 code implementation21 Jul 2022 Alexandre Bittar, Philip N. Garner

Using Bayes's theorem, we derive a unit-wise recurrence as well as a backward recursion similar to the forward-backward algorithm.

Speech Recognition

A t-distribution based operator for enhancing out of distribution robustness of neural network classifiers

1 code implementation9 Jun 2020 Niccolò Antonello, Philip N. Garner

It is shown that classifiers that adopt this novel operator can be more robust to out of distribution samples, often outperforming NNs that use the standard softmax operator.

Unity

A Bayesian Approach to Recurrence in Neural Networks

no code implementations24 Oct 2019 Philip N. Garner, Sibo Tong

We show that introduction of a context indicator leads to a variable feedback that is similar to the forget mechanism in conventional recurrent units.

speech-recognition Speech Recognition

A Variational Prosody Model for the decomposition and synthesis of speech prosody

1 code implementation22 Jun 2018 Branislav Gerazov, Gérard Bailly, Omar Mohammed, Yi Xu, Philip N. Garner

Our work bridges between a comprehensive generative model of intonation and state-of-the-art AI techniques.

Speech Synthesis

Ad Hoc Microphone Array Calibration: Euclidean Distance Matrix Completion Algorithm and Theoretical Guarantees

no code implementations31 Aug 2014 Mohammad J. Taghizadeh, Reza Parhizkar, Philip N. Garner, Herve Bourlard, Afsaneh Asaei

This paper addresses the problem of ad hoc microphone array calibration where only partial information about the distances between microphones is available.

Low-Rank Matrix Completion

Cannot find the paper you are looking for? You can Submit a new open access paper.