1 code implementation • 25 Jan 2025 • Pauline Bourigault, Danilo P. Mandic
We present a novel approach to anomaly detection by integrating Generalized Hyperbolic (GH) processes into kernel-based methods.
no code implementations • 13 Dec 2024 • Harry J. Davies, Giorgos Iacovides, Danilo P. Mandic
It then refines this approximate concept vector to trigger the concept token with high probability, by perturbing the approximate concept vector with noise and transforming it into token scores with the language model head.
no code implementations • 3 Oct 2024 • Mingxue Xu, Sadia Sharmin, Danilo P. Mandic
To this end, we propose a unified taxonomy, which bridges the matrix/tensor compression approaches and model compression concepts in ML and NLP research.
1 code implementation • 30 Jul 2024 • Harry J. Davies, James Monsen, Danilo P. Mandic
It is also demonstrated that these pre-trained models are straightforward to fine-tune for tasks such as classification of atrial fibrillation (AF), and beat detection in photoplethysmography.
no code implementations • 11 May 2024 • Danilo Comminiello, Eleonora Grassucci, Danilo P. Mandic, Aurelio Uncini
Hypercomplex algebras have recently been gaining prominence in the field of deep learning owing to the advantages of their division algebras over real vector spaces and their superior results when dealing with multidimensional signals in real-world 3D and 4D paradigms.
1 code implementation • 30 Apr 2024 • Wanqi Zhou, Shuanghao Bai, Danilo P. Mandic, Qibin Zhao, Badong Chen
To this end, this work presents the first comprehensive study on improving the adversarial robustness of VLMs against attacks targeting image, text, and multimodal inputs.
no code implementations • 22 Feb 2024 • Pauline Bourigault, Dongpo Xu, Danilo P. Mandic
We develop a robust quaternion recurrent neural network (QRNN) for real-time processing of 3D and 4D data with outliers.
no code implementations • 30 Jan 2024 • Qingchen Wang, Zhe Li, Zdenka Babic, Wei Deng, Ljubiša Stanković, Danilo P. Mandic
However, applying this paradigm to illuminate the interpretability of complex-valued CNNs meets a formidable obstacle: the extension of matched filtering to a general class of noncircular complex-valued data, referred to here as the widely linear matched filter (WLMF), has been only implicit in the literature.
no code implementations • 28 Nov 2023 • Danilo P. Mandic, Sayed Pouria Talebi, Clive Cheong Took, Yili Xia, Dongpo Xu, Min Xiang, Pauline Bourigault
From their inception, quaternions and their division algebra have proven to be advantageous in modelling rotation/orientation in three-dimensional spaces and have seen use from the initial formulation of electromagnetic filed theory through to forming the basis of quantum filed theory.
no code implementations • 2 Jul 2023 • Mingxue Xu, Yao Lei Xu, Danilo P. Mandic
Taking GPT family models (i. e. GPT-2 and CerebrasGPT) as case studies, our approach theoretically results in $46. 89\%$ fewer parameters of the entire model, with a compression ratio $39. 38\times$ - $65. 64\times$ for the embedding layers.
no code implementations • 23 May 2023 • Harry J. Davies, Ghena Hammour, Marek Zylinski, Amir Nassibi, Danilo P. Mandic
Through its operation as a Matched Filter, the encoder searches for matches with an ECG template pattern in the input signal, prior to filtering the matches with the subsequent convolutional layers and selecting peaks corresponding to true ECG matches.
no code implementations • 23 May 2023 • Yuyang Miao, Harry J. Davies, Danilo P. Mandic
Photoplethysmography (PPG) refers to the measurement of variations in blood volume using light and is a feature of most wearable devices.
no code implementations • 9 May 2023 • Yiming Jiang, Jinlan Liu, Dongpo Xu, Danilo P. Mandic
Adam-type algorithms have become a preferred choice for optimisation in the deep learning setting, however, despite success, their convergence is still not well understood.
no code implementations • 9 May 2023 • Shuning Sun, Qiankun Diao, Dongpo Xu, Pauline Bourigault, Danilo P. Mandic
Convex optimization methods have been extensively used in the fields of communications and signal processing.
no code implementations • 23 Mar 2023 • Yao Lei Xu, Kriton Konstantinidis, Danilo P. Mandic
Despite the omnipresence of tensors and tensor operations in modern deep learning, the use of tensor mathematics to formally design and describe neural networks is still under-explored within the deep learning community.
no code implementations • 16 Jan 2023 • Metin C. Yarici, Pierluigi Amadori, Harry Davies, Takashi Nakamura, Nico Lingg, Yiannis Demiris, Danilo P. Mandic
Ear EEG based driver fatigue monitoring systems have the potential to provide a seamless, efficient, and feasibly deployable alternative to existing scalp EEG based systems, which are often cumbersome and impractical.
no code implementations • 22 Dec 2022 • Harry J. Davies, Danilo P. Mandic
Our model aims to encode all of the relevant respiratory information contained within photoplethysmography waveform, and decode it into a waveform that is similar to a gold standard respiratory reference.
no code implementations • 5 Dec 2022 • Hongjian Xiao, Yao Lei Xu, Danilo P. Mandic
Financial markets typically exhibit dynamically complex properties as they undergo continuous interactions with economic and environmental factors.
no code implementations • 26 Oct 2022 • Yao Lei Xu, Kriton Konstantinidis, Danilo P. Mandic
This represents a challenge for modern machine learning models, as the number of model parameters needed to process such data grows exponentially with the data dimensions; an effect known as the Curse-of-Dimensionality.
1 code implementation • 24 Jan 2022 • Rohan Tangri, Danilo P. Mandic, Anthony G. Constantinides
Reinforcement learning is increasingly finding success across domains where the problem can be represented as a Markov decision process.
no code implementations • 20 Sep 2021 • Hongjian Xiao, Danilo P. Mandic
And this algorithm is tested by both stimulated and real signals.
no code implementations • 14 Sep 2021 • Harry J. Davies, Ghena Hammour, Hongjian Xiao, Danilo P. Mandic
Overall, the proposed apparatus provides us with a simple, effective and physically meaningful way to generate faithful surrogate breathing disorder waveforms, a prerequisite for the use of artificial intelligence in respiratory health.
no code implementations • 7 Jun 2021 • Alvaro Arroyo, Bruno Scalzo, Ljubisa Stankovic, Danilo P. Mandic
Stock market returns are typically analyzed using standard regression, yet they reside on irregular domains which is a natural scenario for graph signal processing.
no code implementations • 11 May 2021 • Yao Lei Xu, Giuseppe G. Calvi, Danilo P. Mandic
Recurrent Neural Networks (RNNs) represent the de facto standard machine learning tool for sequence modelling, owing to their expressive power and memory.
no code implementations • 11 May 2021 • Bruno Scalzo Dees, Yao Lei Xu, Anthony G. Constantinides, Danilo P. Mandic
Finally, we also explore the application of modern deep learning models, such as graph neural networks and hyper-graph neural networks, as general purpose models for the modelling and forecasting of underground data, especially in the context of the morning and evening rush hours.
no code implementations • 27 Mar 2021 • Yao Lei Xu, Kriton Konstantinidis, Danilo P. Mandic
Modern data sources are typically of large scale and multi-modal natures, and acquired on irregular domains, which poses serious challenges to traditional deep learning models.
no code implementations • 31 Jan 2021 • Bruno Scalzo, Alvaro Arroyo, Ljubisa Stankovic, Danilo P. Mandic
Classical portfolio optimization methods typically determine an optimal capital allocation through the implicit, yet critical, assumption of statistical time-invariance.
no code implementations • 3 Jan 2021 • Harry J. Davies, Ian Williams, Ghena Hammour, Metin Yarici, Barry M. Seemungal, Danilo P. Mandic
Classification of cognitive workload promises immense benefit in diverse areas ranging from driver safety to augmenting human capability through closed loop brain computer interface.
1 code implementation • 25 Oct 2020 • Yao Lei Xu, Kriton Konstantinidis, Danilo P. Mandic
The irregular and multi-modal nature of numerous modern data sources poses serious challenges for traditional deep learning algorithms.
no code implementations • 18 Sep 2020 • Yao Lei Xu, Danilo P. Mandic
Recurrent Neural Networks (RNNs) are among the most successful machine learning models for sequence modelling, but tend to suffer from an exponential increase in the number of parameters when dealing with large multidimensional data.
no code implementations • 27 Jul 2020 • Bruno Scalzo, Ljubisa Stankovic, Danilo P. Mandic
A class of multivariate spectral representations for real-valued nonstationary random variables is introduced, which is characterised by a general complex Gaussian distribution.
no code implementations • 7 Jun 2020 • Harry J. Davies, Ian Williams, Nicholas S. Peters, Danilo P. Mandic
In this study, we set out to establish the feasibility of SpO2 measurement from the ear canal as a convenient site for long term monitoring, and perform a comprehensive comparison with the right index finger - the conventional clinical measurement site.
no code implementations • 26 Feb 2020 • Davide Bacciu, Danilo P. Mandic
The paper surveys the topic of tensor decompositions in modern machine learning applications.
1 code implementation • 27 Jan 2020 • Alexandros Haliassos, Kriton Konstantinidis, Danilo P. Mandic
However, both TT and other Tensor Networks (TNs), such as Tensor Ring and Hierarchical Tucker, are sensitive to the ordering of their indices (and hence to the features).
no code implementations • 24 Oct 2019 • Jean P. Chereau, Bruno Scalzo Dees, Danilo P. Mandic
Principal component analysis (PCA) is recognised as a quintessential data analysis technique when it comes to describing linear relationships between the features of a dataset.
no code implementations • 12 Oct 2019 • Bruno Scalzo Dees, Ljubisa Stankovic, Anthony G. Constantinides, Danilo P. Mandic
Investment returns naturally reside on irregular domains, however, standard multivariate portfolio optimization methods are agnostic to data structure.
no code implementations • 14 Mar 2019 • Giuseppe G. Calvi, Ahmad Moniri, Mahmoud Mahfouz, Qibin Zhao, Danilo P. Mandic
This is achieved through a tensor valued approach, based on the proposed Tucker Tensor Layer (TTL), as an alternative to the dense weight-matrices of DNNs.
2 code implementations • 22 Nov 2017 • Shota Saito, Danilo P. Mandic, Hideyuki Suzuki
The proposed $p$-Laplacian is shown to outperform standard hypergraph Laplacians in the experiment on a hypergraph semi-supervised learning and normalized cut setting.
no code implementations • 1 Nov 2017 • Ilia Kisil, Giuseppe G. Calvi, Danilo P. Mandic
A novel method for common and individual feature analysis from exceedingly large-scale data is proposed, in order to ensure the tractability of both the computation and storage and thus mitigate the curse of dimensionality, a major bottleneck in modern data science.
no code implementations • 3 Jan 2017 • Takashi Nakamura, Valentin Goverdovsky, Mary J. Morrell, Danilo P. Mandic
The monitoring of sleep patterns without patient's inconvenience or involvement of a medical specialist is a clinical question of significant importance.
no code implementations • 8 Mar 2016 • Sayed Pouria Talebi, Danilo P. Mandic
In the first stage a quaternion extended Kalman filter, which provides a unified framework for joint modeling of voltage measurements from all the phases, is used to estimate the instantaneous phase increment of the three-phase voltages.
no code implementations • 13 Jun 2014 • Dongpo Xu, Danilo P. Mandic
The optimization of real scalar functions of quaternion variables, such as the mean square error or array output power, underpins many practical applications.
1 code implementation • 5 Jul 2012 • Qibin Zhao, Cesar F. Caiafa, Danilo P. Mandic, Zenas C. Chao, Yasuo Nagasaka, Naotaka Fujii, Liqing Zhang, Andrzej Cichocki
A new generalized multilinear regression model, termed the Higher-Order Partial Least Squares (HOPLS), is introduced with the aim to predict a tensor (multiway array) $\tensor{Y}$ from a tensor $\tensor{X}$ through projecting the data onto the latent space and performing regression on the corresponding latent variables.