Search Results for author: Stefano Squartini

Found 17 papers, 6 papers with code

One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition

no code implementations2 Oct 2023 Samuele Cornell, Jee-weon Jung, Shinji Watanabe, Stefano Squartini

This paper presents a novel framework for joint speaker diarization (SD) and automatic speech recognition (ASR), named SLIDAR (sliding-window diarization-augmented recognition).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment

1 code implementation28 Jul 2023 Carlo Aironi, Samuele Cornell, Luca Serafini, Stefano Squartini

Packet loss is a major cause of voice quality degradation in VoIP transmissions with serious impact on intelligibility and user experience.

Image-to-Image Translation Packet Loss Concealment +1

End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations

no code implementations21 Mar 2023 Giovanni Morrone, Samuele Cornell, Luca Serafini, Enrico Zovato, Alessio Brutti, Stefano Squartini

Finally, we also show that the separated signals can be readily used also for automatic speech recognition, reaching performance close to using oracle sources in some configurations.

Action Detection Activity Detection +4

Conversational Speech Separation: an Evaluation Study for Streaming Applications

no code implementations31 May 2022 Giovanni Morrone, Samuele Cornell, Enrico Zovato, Alessio Brutti, Stefano Squartini

Continuous speech separation (CSS) is a recently proposed framework which aims at separating each speaker from an input mixture signal in a streaming fashion.

Speech Separation

Learning Filterbanks for End-to-End Acoustic Beamforming

no code implementations8 Nov 2021 Samuele Cornell, Manuel Pariente, François Grondin, Stefano Squartini

We perform a detailed analysis using the recent Clarity Challenge data and show that by using learnt filterbanks it is possible to surpass oracle-mask based beamforming for short windows.

Deep Optimization of Parametric IIR Filters for Audio Equalization

1 code implementation5 Oct 2021 Giovanni Pepe, Leonardo Gabrielli, Stefano Squartini, Carlo Tripodi, Nicolò Strozzi

This paper describes a novel Deep Learning method for the design of IIR parametric filters for automatic audio equalization.

Graph-based Representation of Audio signals for Sound Event Classification

no code implementations 2021 29th European Signal Processing Conference (EUSIPCO) 2021 Carlo Aironi, Samuele Cornell, Emanuele Principi, Stefano Squartini

In recent years there has been a considerable rise in interest towards Graph Representation and Learning techniques, especially in such cases where data has intrinsically a graph- like structure: social networks, molecular lattices, or semantic interactions, just to name a few.

Real-World Anomaly Detection by using Digital Twin Systems and Weakly-Supervised Learning

no code implementations12 Nov 2020 Andrea Castellani, Sebastian Schmitt, Stefano Squartini

The approaches make use of a Digital Twin to generate a training dataset which simulates the normal operation of the machinery, along with a small set of labeled anomalous measurement from the real machinery.

Anomaly Detection Clustering +1

Transfer Learning for Non-Intrusive Load Monitoring

1 code implementation23 Feb 2019 Michele DIncecco, Stefano Squartini, Mingjun Zhong

It is not clear if the method could be generalised or transferred to different domains, e. g., the test data were drawn from a different country comparing to the training data.

Non-Intrusive Load Monitoring Transfer Learning

Polyphonic Sound Event Detection by using Capsule Neural Network

1 code implementation15 Oct 2018 Fabio Vesperini, Leonardo Gabrielli, Emanuele Principi, Stefano Squartini

Artificial sound event detection (SED) has the aim to mimic the human ability to perceive and understand what is happening in the surroundings.

Event Detection Sound Event Detection

A Multi-Stage Algorithm for Acoustic Physical Model Parameters Estimation

no code implementations14 Sep 2018 Leonardo Gabrielli, Stefano Tomassetti, Stefano Squartini, Carlo Zinato, Stefano Guaiana

In this work we refine previous results by introducing the former approach in a multi-stage algorithm that also adds heuristics and a stochastic optimization method operating on objective cost functions based on psychoacoustics.

Stochastic Optimization

Cannot find the paper you are looking for? You can Submit a new open access paper.