PHNNs: Lightweight Neural Networks via Parameterized Hypercomplex Convolutions

8 Oct 2021  ·  Eleonora Grassucci, Aston Zhang, Danilo Comminiello ·

Hypercomplex neural networks have proven to reduce the overall number of parameters while ensuring valuable performance by leveraging the properties of Clifford algebras. Recently, hypercomplex linear layers have been further improved by involving efficient parameterized Kronecker products. In this paper, we define the parameterization of hypercomplex convolutional layers and introduce the family of parameterized hypercomplex neural networks (PHNNs) that are lightweight and efficient large-scale models. Our method grasps the convolution rules and the filter organization directly from data without requiring a rigidly predefined domain structure to follow. PHNNs are flexible to operate in any user-defined or tuned domain, from 1D to $n$D regardless of whether the algebra rules are preset. Such a malleability allows processing multidimensional inputs in their natural domain without annexing further dimensions, as done, instead, in quaternion neural networks for 3D inputs like color images. As a result, the proposed family of PHNNs operates with $1/n$ free parameters as regards its analog in the real domain. We demonstrate the versatility of this approach to multiple domains of application by performing experiments on various image datasets as well as audio datasets in which our method outperforms real and quaternion-valued counterparts. Full code is available at: https://github.com/eleGAN23/HyperNets.

PDF Abstract

Datasets


Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Sound Event Detection L3DAS21 PHC SEDnet n=2 F-Score 0.68 # 1
Error Rate 0.389 # 1
SED-score 0.638 # 1
Sound Event Detection L3DAS21 PHC SEDnet n=16 F-Score 0.588 # 2
Error Rate 0.509 # 3
SED-score 0.461 # 4
Sound Event Detection L3DAS21 PHC SEDnet n=8 F-Score 0.553 # 4
Error Rate 0.56 # 5
SED-score 0.503 # 2
Sound Event Detection L3DAS21 PHC SEDnet n=4 Error Rate 0.453 # 2
SED-score 0.407 # 5
Sound Event Detection L3DAS21 Quaternion SEDnet F-Score 0.58 # 3
Error Rate 0.516 # 4
SED-score 0.468 # 3

Methods