Search Results for author: Francesco Paissan

Found 9 papers, 1 papers with code

Listenable Maps for Audio Classifiers

no code implementations19 Mar 2024 Francesco Paissan, Mirco Ravanelli, Cem Subakan

Despite the impressive performance of deep learning models across diverse tasks, their complexity poses challenges for interpretation.

tinyCLAP: Distilling Constrastive Language-Audio Pretrained Models

no code implementations24 Nov 2023 Francesco Paissan, Elisabetta Farella

Contrastive Language-Audio Pretraining (CLAP) became of crucial importance in the field of audio and speech processing.

Audio Generation Event Detection +2

Audio Editing with Non-Rigid Text Prompts

no code implementations19 Oct 2023 Francesco Paissan, Zhepei Wang, Mirco Ravanelli, Paris Smaragdis, Cem Subakan

We show that the proposed editing pipeline is able to create audio edits that remain faithful to the input audio.

Audio Generation Style Transfer

Posthoc Interpretation via Quantization

1 code implementation22 Mar 2023 Francesco Paissan, Cem Subakan, Mirco Ravanelli

In this paper, we introduce a new approach, called Posthoc Interpretation via Quantization (PIQ), for interpreting decisions made by trained classifiers.

Image Segmentation Quantization +1

Scaling strategies for on-device low-complexity source separation with Conv-Tasnet

no code implementations6 Mar 2023 Mohamed Nabih Ali, Francesco Paissan, Daniele Falavigna, Alessio Brutti

Given the modular nature of the well-known Conv-Tasnet speech separation architecture, in this paper we consider three parameters that directly control the overall size of the model, namely: the number of residual blocks, the number of repetitions of the separation blocks and the number of channels in the depth-wise convolutions, and experimentally evaluate how they affect the speech separation performance.

Speech Separation

XiNet: Efficient Neural Networks for tinyML

no code implementations ICCV 2023 Alberto Ancilotto, Francesco Paissan, Elisabetta Farella

The recent interest in the edge-to-cloud continuum paradigm has emphasized the need for simple and scalable architectures to deliver optimal performance on computationally constrained devices.

Image Classification object-detection +1

Low-complexity acoustic scene classification in DCASE 2022 Challenge

no code implementations8 Jun 2022 Irene Martín-Morató, Francesco Paissan, Alberto Ancilotto, Toni Heittola, Annamaria Mesaros, Elisabetta Farella, Alessio Brutti, Tuomas Virtanen

The provided baseline system is a convolutional neural network which employs post-training quantization of parameters, resulting in 46. 5 K parameters, and 29. 23 million multiply-and-accumulate operations (MMACs).

Acoustic Scene Classification Classification +2

PhiNets: a scalable backbone for low-power AI at the edge

no code implementations1 Oct 2021 Francesco Paissan, Alberto Ancilotto, Elisabetta Farella

In the Internet of Things era, where we see many interconnected and heterogeneous mobile and fixed smart devices, distributing the intelligence from the cloud to the edge has become a necessity.

Cannot find the paper you are looking for? You can Submit a new open access paper.