Search Results for author: Philip N. Garner

Found 18 papers, 5 papers with code

Investigating Cross-lingual Multi-level Adaptive Networks: The Importance of the Correlation of Source and Target Languages

no code implementations • IWSLT 2016 • Alexandros Lazaridis, Ivan Himawan, Petr Motlicek, Iosif Mporas, Philip N. Garner

We experiment with three different scenarios using, i) French, as a source language uncorrelated to the target language, ii) Ukrainian, as a source language correlated to the target one and finally iii) English as a source language uncorrelated to the target language using a relatively large amount of data in respect to the other two scenarios.

Paper
Add Code

Conversational Speech Recognition Needs Data? Experiments with Austrian German

no code implementations • LREC 2022 • Julian Linke, Philip N. Garner, Gernot Kubin, Barbara Schuppler

Conversational speech represents one of the most complex of automatic speech recognition (ASR) tasks owing to the high inter-speaker variation in both pronunciation and conversational dynamics.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks

no code implementations • 22 Apr 2024 • Alexandre Bittar, Philip N. Garner

Understanding cognitive processes in the brain demands sophisticated models capable of replicating neural dynamics at large scales.

speech-recognition Speech Recognition

Paper
Add Code

Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting

no code implementations • 19 Feb 2024 • Haolin Chen, Philip N. Garner

Our results demonstrate that catastrophic forgetting can be overcome by our methods without degrading the fine-tuning performance, and using the Kronecker factored approximations produces a better preservation of the pre-training knowledge than the diagonal ones.

Language Modelling Speech Synthesis +1

Paper
Add Code

Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes

no code implementations • 29 Nov 2023 • Pavel Korshunov, Haolin Chen, Philip N. Garner, Sebastien Marcel

From the publicly available speech dataset LibriTTS, we also created a separate database of only audio deepfakes LibriTTS-DF using several latest text to speech methods: YourTTS, Adaspeech, and TorToiSe.

Face Recognition Face Swapping +2

Paper
Add Code

Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding

no code implementations • 22 May 2023 • Mutian He, Philip N. Garner

Recently, large pretrained language models have demonstrated strong language understanding capabilities.

In-Context Learning intent-classification +4

Paper
Add Code

The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation

1 code implementation • 16 May 2023 • Mutian He, Philip N. Garner

Motivated particularly by the task of cross-lingual SLU, we demonstrate that the task of speech translation (ST) is a good means of pretraining speech models for end-to-end SLU on both intra- and cross-lingual scenarios.

Abstractive Text Summarization Continual Learning +7

Paper
Code

An investigation into the adaptability of a diffusion-based TTS model

no code implementations • 3 Mar 2023 • Haolin Chen, Philip N. Garner

Given the recent success of diffusion in producing natural-sounding synthetic speech, we investigate how diffusion can be used in speaker adaptive TTS.

Paper
Add Code

Surrogate Gradient Spiking Neural Networks as Encoders for Large Vocabulary Continuous Speech Recognition

no code implementations • 1 Dec 2022 • Alexandre Bittar, Philip N. Garner

Compared to conventional artificial neurons that produce dense and real-valued responses, biologically-inspired spiking neurons transmit sparse and binary information, which can also lead to energy-efficient implementations.

speech-recognition Speech Recognition

Paper
Add Code

Low-Level Physiological Implications of End-to-End Learning of Speech Recognition

no code implementations • 22 Aug 2022 • Louise Coppieters de Gibson, Philip N. Garner

We investigate whether the inference can be inverted to provide insights into that biological system; in particular the hearing mechanism.

speech-recognition Speech Recognition

Paper
Add Code

A surrogate gradient spiking baseline for speech command recognition

1 code implementation • Frontiers in Neuroscience 2022 • Alexandre Bittar, Philip N. Garner

Artificial neural networks (ANNs) are the basis of recent advances in artificial intelligence (AI); they typically use real valued neuron responses.

Ranked #2 on Audio Classification on SSC

Audio Classification Time Series

Paper
Code

Bayesian Recurrent Units and the Forward-Backward Algorithm

1 code implementation • 21 Jul 2022 • Alexandre Bittar, Philip N. Garner

Using Bayes's theorem, we derive a unit-wise recurrence as well as a backward recursion similar to the forward-backward algorithm.

Speech Recognition

Paper
Code

A t-distribution based operator for enhancing out of distribution robustness of neural network classifiers

1 code implementation • 9 Jun 2020 • Niccolò Antonello, Philip N. Garner

It is shown that classifiers that adopt this novel operator can be more robust to out of distribution samples, often outperforming NNs that use the standard softmax operator.

Unity

Paper
Code

A Bayesian Approach to Recurrence in Neural Networks

no code implementations • 24 Oct 2019 • Philip N. Garner, Sibo Tong

We show that introduction of a context indicator leads to a variable feedback that is similar to the forget mechanism in conventional recurrent units.

speech-recognition Speech Recognition

Paper
Add Code

A Variational Prosody Model for the decomposition and synthesis of speech prosody

1 code implementation • 22 Jun 2018 • Branislav Gerazov, Gérard Bailly, Omar Mohammed, Yi Xu, Philip N. Garner

Our work bridges between a comprehensive generative model of intonation and state-of-the-art AI techniques.

Speech Synthesis

Paper
Code

The SUMMA Platform Prototype

no code implementations • EACL 2017 • Renars Liepins, Ulrich Germann, Guntis Barzdins, Alex Birch, ra, Steve Renals, Susanne Weber, Peggy van der Kreeft, Herv{\'e} Bourlard, Jo{\~a}o Prieto, Ond{\v{r}}ej Klejch, Peter Bell, Alex Lazaridis, ros, Alfonso Mendes, Sebastian Riedel, Mariana S. C. Almeida, Pedro Balage, Shay B. Cohen, Tomasz Dwojak, Philip N. Garner, Andreas Giefer, Marcin Junczys-Dowmunt, Hina Imran, David Nogueira, Ahmed Ali, Mir, Sebasti{\~a}o a, Andrei Popescu-Belis, Lesly Miculicich Werlen, Nikos Papasarantopoulos, Abiola Obamuyide, Clive Jones, Fahim Dalvi, Andreas Vlachos, Yang Wang, Sibo Tong, Rico Sennrich, Nikolaos Pappas, Shashi Narayan, Marco Damonte, Nadir Durrani, Sameer Khurana, Ahmed Abdelali, Hassan Sajjad, Stephan Vogel, David Sheppey, Chris Hernon, Jeff Mitchell

We present the first prototype of the SUMMA Platform: an integrated platform for multilingual media monitoring.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding

no code implementations • 15 Apr 2016 • Milos Cernak, Alexandros Lazaridis, Afsaneh Asaei, Philip N. Garner

Segmental errors are further propagated to optional suprasegmental (such as syllable) information coding.

speech-recognition Speech Recognition

Paper
Add Code

Ad Hoc Microphone Array Calibration: Euclidean Distance Matrix Completion Algorithm and Theoretical Guarantees

no code implementations • 31 Aug 2014 • Mohammad J. Taghizadeh, Reza Parhizkar, Philip N. Garner, Herve Bourlard, Afsaneh Asaei

This paper addresses the problem of ad hoc microphone array calibration where only partial information about the distances between microphones is available.

Low-Rank Matrix Completion

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.