Search Results for author: Francesco Paissan

Found 9 papers, 1 papers with code

Listenable Maps for Audio Classifiers

no code implementations • 19 Mar 2024 • Francesco Paissan, Mirco Ravanelli, Cem Subakan

Despite the impressive performance of deep learning models across diverse tasks, their complexity poses challenges for interpretation.

Paper
Add Code

tinyCLAP: Distilling Constrastive Language-Audio Pretrained Models

no code implementations • 24 Nov 2023 • Francesco Paissan, Elisabetta Farella

Contrastive Language-Audio Pretraining (CLAP) became of crucial importance in the field of audio and speech processing.

Audio Generation Event Detection +2

Paper
Add Code

Audio Editing with Non-Rigid Text Prompts

no code implementations • 19 Oct 2023 • Francesco Paissan, Zhepei Wang, Mirco Ravanelli, Paris Smaragdis, Cem Subakan

We show that the proposed editing pipeline is able to create audio edits that remain faithful to the input audio.

Audio Generation Style Transfer

Paper
Add Code

Posthoc Interpretation via Quantization

1 code implementation • 22 Mar 2023 • Francesco Paissan, Cem Subakan, Mirco Ravanelli

In this paper, we introduce a new approach, called Posthoc Interpretation via Quantization (PIQ), for interpreting decisions made by trained classifiers.

Image Segmentation Quantization +1

7,894

Paper
Code

Scaling strategies for on-device low-complexity source separation with Conv-Tasnet

no code implementations • 6 Mar 2023 • Mohamed Nabih Ali, Francesco Paissan, Daniele Falavigna, Alessio Brutti

Given the modular nature of the well-known Conv-Tasnet speech separation architecture, in this paper we consider three parameters that directly control the overall size of the model, namely: the number of residual blocks, the number of repetitions of the separation blocks and the number of channels in the depth-wise convolutions, and experimentally evaluate how they affect the speech separation performance.

Speech Separation

Paper
Add Code

XiNet: Efficient Neural Networks for tinyML

no code implementations • ICCV 2023 • Alberto Ancilotto, Francesco Paissan, Elisabetta Farella

The recent interest in the edge-to-cloud continuum paradigm has emphasized the need for simple and scalable architectures to deliver optimal performance on computationally constrained devices.

Image Classification object-detection +1

Paper
Add Code

Low-complexity acoustic scene classification in DCASE 2022 Challenge

no code implementations • 8 Jun 2022 • Irene Martín-Morató, Francesco Paissan, Alberto Ancilotto, Toni Heittola, Annamaria Mesaros, Elisabetta Farella, Alessio Brutti, Tuomas Virtanen

The provided baseline system is a convolutional neural network which employs post-training quantization of parameters, resulting in 46. 5 K parameters, and 29. 23 million multiply-and-accumulate operations (MMACs).

Acoustic Scene Classification Classification +2

Paper
Add Code

PhiNets: a scalable backbone for low-power AI at the edge

no code implementations • 1 Oct 2021 • Francesco Paissan, Alberto Ancilotto, Elisabetta Farella

In the Internet of Things era, where we see many interconnected and heterogeneous mobile and fixed smart devices, distributing the intelligence from the cloud to the edge has become a necessity.

Paper
Add Code

Enabling energy efficient machine learning on a Ultra-Low-Power vision sensor for IoT

no code implementations • 2 Feb 2021 • Francesco Paissan, Massimo Gottardi, Elisabetta Farella

Therefore, domain-specific pipelines are usually delivered in order to exploit the full potential of these cameras.

BIG-bench Machine Learning Motion Detection

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.