no code implementations • 19 Dec 2024 • Riccardo Fosco Gramaccioni, Christian Marinoni, Emilian Postolache, Marco Comunità, Luca Cosmo, Joshua D. Reiss, Danilo Comminiello
Sound designers and Foley artists usually sonorize a scene, such as from a movie or video game, by manually annotating and sonorizing each action of interest in the video.
2 code implementations • 16 Dec 2024 • Giordano Cicchetti, Eleonora Grassucci, Luigi Sigillo, Danilo Comminiello
These models typically align each modality to a designated anchor without ensuring the alignment of all modalities with each other, leading to suboptimal performance in tasks requiring a joint understanding of multiple modalities.
Ranked #1 on
Zero-Shot Audio Retrieval
on AudioCaps
no code implementations • 23 Oct 2024 • Lorenzo Aloisi, Luigi Sigillo, Aurelio Uncini, Danilo Comminiello
In recent years, diffusion models have emerged as a superior alternative to generative adversarial networks (GANs) for high-fidelity image generation, with wide applications in text-to-image generation, image-to-image translation, and super-resolution.
1 code implementation • 3 Oct 2024 • Giovanni Pignata, Eleonora Grassucci, Giordano Cicchetti, Danilo Comminiello
Despite their impressive ability to regenerate content from the compressed semantic information received, generative models pose crucial challenges for communication systems in terms of high memory footprints and heavy computational load.
1 code implementation • 17 Sep 2024 • Eleonora Lopez, Luigi Sigillo, Federica Colonnese, Massimo Panella, Danilo Comminiello
Generating images from brain waves is gaining increasing attention due to its potential to advance brain-computer interface (BCI) systems by understanding how brain signals encode visual cues.
1 code implementation • 13 Sep 2024 • Eleonora Lopez, Aurelio Uncini, Danilo Comminiello
Then, a hypercomplex fusion module learns inter-modal relations among the embeddings of the different modalities.
no code implementations • 16 May 2024 • Eleonora Grassucci, Jinho Choi, Jihong Park, Riccardo F. Gramaccioni, Giordano Cicchetti, Danilo Comminiello
In recent years, novel communication strategies have emerged to face the challenges that the increased number of connected devices and the higher quality of transmitted information are posing.
1 code implementation • 16 May 2024 • Giordano Cicchetti, Eleonora Grassucci, Jihong Park, Jinho Choi, Sergio Barbarossa, Danilo Comminiello
In the new paradigm of semantic communication (SC), the focus is on delivering meanings behind bits by extracting semantic information from raw data.
no code implementations • 11 May 2024 • Danilo Comminiello, Eleonora Grassucci, Danilo P. Mandic, Aurelio Uncini
Hypercomplex algebras have recently been gaining prominence in the field of deep learning owing to the advantages of their division algebras over real vector spaces and their superior results when dealing with multidimensional signals in real-world 3D and 4D paradigms.
no code implementations • 8 May 2024 • Redemptor Jr Laceda Taloma, Patrizio Pisani, Danilo Comminiello
Time series clustering is fundamental in data analysis for discovering temporal patterns.
1 code implementation • 5 May 2024 • Pietro Nardelli, Danilo Comminiello
Compared to other action recognition tasks, violence detection in surveillance videos presents additional issues, such as the wide variety of real fight scenes.
2 code implementations • 8 Apr 2024 • Giordano Cicchetti, Danilo Comminiello
Real-world documents may suffer various forms of degradation, often resulting in lower accuracy in optical character recognition (OCR) systems.
Ranked #1 on
Binarization
on DIBCO 2019
1 code implementation • 27 Mar 2024 • Luigi Sigillo, Riccardo Fosco Gramaccioni, Alessandro Nicolosi, Danilo Comminiello
In this context, our method explores in depth the problem of ship image super resolution, which is crucial for coastal and port surveillance.
Ranked #1 on
Image Super-Resolution
on ShipSpotting
1 code implementation • 26 Mar 2024 • Eleonora Lopez, Eleonora Grassucci, Debora Capriotti, Danilo Comminiello
To achieve this, we define a type of cosine-similarity transform within the parameterized hypercomplex domain.
no code implementations • 14 Feb 2024 • Christian Marinoni, Riccardo Fosco Gramaccioni, Changan Chen, Aurelio Uncini, Danilo Comminiello
The primary goal of the L3DAS23 Signal Processing Grand Challenge at ICASSP 2023 is to promote and support collaborative research on machine learning for 3D audio signal processing, with a specific emphasis on 3D speech enhancement and 3D Sound Event Localization and Detection in Extended Reality applications.
Audio Signal Processing
Sound Event Localization and Detection
+1
no code implementations • 10 Jan 2024 • Eleonora Grassucci, Jihong Park, Sergio Barbarossa, Seong-Lyun Kim, Jinho Choi, Danilo Comminiello
Disclosing generative models capabilities in semantic communication paves the way for a paradigm shift with respect to conventional communication systems, which has great potential to reduce the amount of data traffic and offers a revolutionary versatility to novel tasks and applications that were not even conceivable a few years ago.
no code implementations • 1 Nov 2023 • Andrea Giuseppe Di Francesco, Giuliano Giampietro, Indro Spinelli, Danilo Comminiello
The artist similarity quest has become a crucial subject in social and scientific contexts.
no code implementations • 23 Oct 2023 • Marco Comunità, Riccardo F. Gramaccioni, Emilian Postolache, Emanuele Rodolà, Danilo Comminiello, Joshua D. Reiss
Sound design involves creatively selecting, recording, and editing sound effects for various media like cinema, video games, and virtual/augmented reality.
1 code implementation • 16 Oct 2023 • Luigi Sigillo, Eleonora Grassucci, Aurelio Uncini, Danilo Comminiello
The proposed quaternion wavelet network (QUAVE) can be easily integrated with any pre-existing medical image analysis or synthesis task, and it can be involved with real, quaternion, or hypercomplex-valued models, generalizing their adoption to single-channel data.
1 code implementation • 11 Oct 2023 • Eleonora Lopez, Eleonora Chiarantano, Eleonora Grassucci, Danilo Comminiello
Multimodal emotion recognition from physiological signals is receiving an increasing amount of attention due to the impossibility to control them at will unlike behavioral reactions, thus providing more reliable information.
1 code implementation • 11 Oct 2023 • Matteo Mancanelli, Eleonora Grassucci, Aurelio Uncini, Danilo Comminiello
Neural models based on hypercomplex algebra systems are growing and prolificating for a plethora of applications, ranging from computer vision to natural language processing.
no code implementations • 11 Oct 2023 • Guilherme Vieira, Eleonora Grassucci, Marcos Eduardo Valle, Danilo Comminiello
To overcome these limitations, we employ a dual quaternion representation of rigid motions in the 3D space that jointly describes rotations and translations of point sets, processing each of the points as a single entity.
no code implementations • 11 Oct 2023 • Eleonora Lopez, Filippo Betello, Federico Carmignani, Eleonora Grassucci, Danilo Comminiello
In this step, a parameterized hypercomplex neural network (PHNN) is employed to perform breast cancer classification.
Breast Cancer Histology Image Classification
Breast Tumour Classification
+4
no code implementations • 5 Sep 2023 • Eleonora Grassucci, Yuki Mitsufuji, Ping Zhang, Danilo Comminiello
Semantic communication is poised to play a pivotal role in shaping the landscape of future AI-driven communication systems.
1 code implementation • 7 Jun 2023 • Eleonora Grassucci, Sergio Barbarossa, Danilo Comminiello
We prove, through an in-depth assessment of multiple scenarios, that our method outperforms existing solutions in generating high-quality images with preserved semantic information even in cases where the received content is significantly degraded.
1 code implementation • 18 May 2023 • Luigi Sigillo, Eleonora Grassucci, Danilo Comminiello
We test our model on aerial images of the DroneVeichle dataset containing RGB-IR paired images.
1 code implementation • 4 May 2022 • Eleonora Grassucci, Luigi Sigillo, Aurelio Uncini, Danilo Comminiello
Image-to-image translation (I2I) aims at transferring the content representation from an input domain to an output one, bouncing along different target domains.
Ranked #3 on
Image-to-Image Translation
on CelebA-HQ
1 code implementation • 12 Apr 2022 • Eleonora Lopez, Eleonora Grassucci, Martina Valleriani, Danilo Comminiello
To overcome such limitations, in this paper, we propose a methodological approach for multi-view breast cancer classification based on parameterized hypercomplex neural networks.
Ranked #1 on
Cancer-no cancer per breast classification
on InBreast
(using extra training data)
1 code implementation • 5 Apr 2022 • Eric Guizzo, Tillman Weyde, Simone Scardapane, Danilo Comminiello
On the one hand, the classifier permits to optimize each latent axis of the embeddings for the classification of a specific emotion-related characteristic: valence, arousal, dominance and overall emotion.
1 code implementation • 4 Apr 2022 • Eleonora Grassucci, Gioia Mancini, Christian Brignone, Aurelio Uncini, Danilo Comminiello
We show that our dual quaternion SELD model with temporal convolution blocks (DualQSELD-TCN) achieves better results with respect to real and quaternion-valued baselines thanks to our augmented representation of the sound field.
Ranked #1 on
Sound Event Localization and Detection
on L3DAS21
1 code implementation • 21 Feb 2022 • Eric Guizzo, Christian Marinoni, Marco Pennese, Xinlei Ren, Xiguang Zheng, Chen Zhang, Bruno Masiero, Aurelio Uncini, Danilo Comminiello
The L3DAS22 Challenge is aimed at encouraging the development of machine learning strategies for 3D speech enhancement and 3D sound localization and detection in office-like environments.
4 code implementations • 8 Oct 2021 • Eleonora Grassucci, Aston Zhang, Danilo Comminiello
In this paper, we define the parameterization of hypercomplex convolutional layers and introduce the family of parameterized hypercomplex neural networks (PHNNs) that are lightweight and efficient large-scale models.
Ranked #1 on
Sound Event Detection
on L3DAS21
no code implementations • 19 Apr 2021 • Danilo Comminiello, Alireza Nezamdoust, Simone Scardapane, Michele Scarpiniti, Amir Hussain, Aurelio Uncini
In order to make this class of functional link adaptive filters (FLAFs) efficient, we propose low-complexity expansions and frequency-domain adaptation of the parameters.
3 code implementations • 19 Apr 2021 • Eleonora Grassucci, Edoardo Cicero, Danilo Comminiello
Latest Generative Adversarial Networks (GANs) are gathering outstanding results through a large-scale training, thus employing models composed of millions of parameters requiring extensive computational capabilities.
Ranked #1 on
Image Generation
on Oxford 102 Flowers 128x128
1 code implementation • 12 Apr 2021 • Eric Guizzo, Riccardo F. Gramaccioni, Saeid Jamili, Christian Marinoni, Edoardo Massaro, Claudia Medaglia, Giuseppe Nachira, Leonardo Nucciarelli, Ludovica Paglialunga, Marco Pennese, Sveva Pepe, Enrico Rocchi, Aurelio Uncini, Danilo Comminiello
The L3DAS21 Challenge is aimed at encouraging and fostering collaborative research on machine learning for 3D audio signal processing, with particular focus on 3D speech enhancement (SE) and 3D sound localization and detection (SELD).
3 code implementations • 22 Oct 2020 • Eleonora Grassucci, Danilo Comminiello, Aurelio Uncini
Deep probabilistic generative models have achieved incredible success in many fields of application.
no code implementations • 24 Jul 2020 • Danilo Comminiello, Michele Scarpiniti, Simone Scardapane, Luis A. Azpicueta-Ruiz, Aurelio Uncini
Nonlinear adaptive filters often show some sparse behavior due to the fact that not all the coefficients are equally useful for the modeling of any nonlinearity.
no code implementations • 8 Aug 2019 • Antonio Falvo, Danilo Comminiello, Simone Scardapane, Michele Scarpiniti, Aurelio Uncini
In this paper, we present a deep learning method that is able to reconstruct subsampled MR images obtained by reducing the k-space data, while maintaining a high image quality that can be used to observe brain lesions.
no code implementations • 26 Jul 2019 • Riccardo Vecchi, Simone Scardapane, Danilo Comminiello, Aurelio Uncini
To this end, we investigate two extensions of l1 and structured regularization to the quaternion domain.
no code implementations • 6 Feb 2019 • Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Aurelio Uncini
Complex-valued neural networks (CVNNs) have been shown to be powerful nonlinear approximators when the input data can be properly modeled in the complex domain.
no code implementations • 17 Dec 2018 • Danilo Comminiello, Marco Lella, Simone Scardapane, Aurelio Uncini
Learning from data in the quaternion domain enables us to exploit internal dependencies of 4D signals and treating them as a single entity.
no code implementations • 11 Jul 2018 • Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Simone Totaro, Aurelio Uncini
Gated recurrent neural networks have achieved remarkable results in the analysis of sequential data.
no code implementations • 26 Feb 2018 • Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Aurelio Uncini
Graph neural networks (GNNs) are a class of neural networks that allow to efficiently perform inference on data that is associated to a graph structure, such as, e. g., citation networks or knowledge graphs.
1 code implementation • 2 Jul 2016 • Simone Scardapane, Danilo Comminiello, Amir Hussain, Aurelio Uncini
In this paper, we consider the joint task of simultaneously optimizing (i) the weights of a deep neural network, (ii) the number of neurons for each hidden layer, and (iii) the subset of active input features (i. e., feature selection).
no code implementations • 25 May 2016 • Michele Scarpiniti, Simone Scardapane, Danilo Comminiello, Raffaele Parisi, Aurelio Uncini
In this paper, we derive a modified InfoMax algorithm for the solution of Blind Signal Separation (BSS) problems by using advanced stochastic methods.
no code implementations • 18 May 2016 • Simone Scardapane, Michele Scarpiniti, Danilo Comminiello, Aurelio Uncini
Neural networks require a careful design in order to perform properly on a given task.