Search Results for author: Carlos Segura

Found 8 papers, 4 papers with code

Robust Wake-Up Word Detection by Two-stage Multi-resolution Ensembles

1 code implementation17 Oct 2023 Fernando López, Jordi Luque, Carlos Segura, Pablo Gómez

It employs two models: a lightweight on-device model for real-time processing of the audio stream and a verification model on the server-side, which is an ensemble of heterogeneous architectures that refine detection.

Efficient Keyword Spotting by capturing long-range interactions with Temporal Lambda Networks

1 code implementation16 Apr 2021 Biel Tura, Santiago Escuder, Ferran Diego, Carlos Segura, Jordi Luque

This work explores the application of Lambda networks, an alternative framework for capturing long-range interactions without attention, for the keyword spotting task.

Keyword Spotting speech-recognition +1

Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion

3 code implementations NeurIPS 2019 Joan Serrà, Santiago Pascual, Carlos Segura

End-to-end models for raw audio generation are a challenge, specially if they have to work with non-parallel data, which is a desirable setup in many situations.

Audio Generation Voice Conversion

Cannot find the paper you are looking for? You can Submit a new open access paper.