no code implementations • 28 Mar 2024 • Sarwan Ali, Prakash Chourasia, Murray Patterson
This study introduces a novel approach, combining substruct counting, $k$-mers, and Daylight-like fingerprints, to expand the representation of chemical structures in SMILES strings.
no code implementations • 12 Feb 2024 • Sarwan Ali, Tamkanat E Ali, Prakash Chourasia, Murray Patterson
In this work, we present a novel approach based on the compression-based Model, motivated from \cite{jiang2023low}, which combines the simplicity of basic compression algorithms like Gzip and Bz2, with Normalized Compression Distance (NCD) algorithm to achieve better performance on classification tasks without relying on handcrafted features or pre-trained models.
1 code implementation • 25 Apr 2023 • Zahra Tayebi, Sarwan Ali, Prakash Chourasia, Taslim Murad, Murray Patterson
Sparse coding is a popular technique in machine learning that enables the representation of data with a set of informative features and can capture complex relationships between amino acids and identify subtle patterns in the sequence that might be missed by low-dimensional methods.
no code implementations • 24 Apr 2023 • Sarwan Ali, Babatunde Bello, Prakash Chourasia, Ria Thazhe Punathil, Pin-Yu Chen, Imdad Ullah Khan, Murray Patterson
Understanding the host-specificity of different families of viruses sheds light on the origin of, e. g., SARS-CoV-2, rabies, and other such zoonotic pathogens in humans.
1 code implementation • 6 Apr 2023 • Sarwan Ali, Prakash Chourasia, Zahra Tayebi, Babatunde Bello, Murray Patterson
In this work, we propose \emph{ViralVectors}, a compact feature vector generation from virome sequencing data that allows effective downstream analysis.
no code implementations • 17 Feb 2023 • Prakash Chourasia, Taslim Murad, Zahra Tayebi, Sarwan Ali, Imdad Ullah Khan, Murray Patterson
This paper presents a federated learning (FL) approach to train an AI model for SARS-Cov-2 variant classification.
no code implementations • 1 Feb 2023 • Sarwan Ali, Prakash Chourasia, Murray Patterson
Anderson acceleration (AA) is a well-known method for accelerating the convergence of iterative algorithms, with applications in various fields including deep learning and optimization.
1 code implementation • 16 Nov 2022 • Prakash Chourasia, Sarwan Ali, Murray Patterson
We show that by using different techniques, such as informed initialization and kernel matrix selection, that t-SNE performs significantly better.
no code implementations • 15 Nov 2022 • Prakash Chourasia, Sarwan Ali, Simone Ciccolella, Gianluca Della Vedova, Murray Patterson
As a result, new methods such as Pangolin, which can scale to the millions of samples of SARS-CoV-2 currently available, have appeared.
no code implementations • 6 Jan 2022 • Sarwan Ali, Babatunde Bello, Prakash Chourasia, Ria Thazhe Punathil, Yijing Zhou, Murray Patterson
In coronaviruses, the surface (S) protein, or spike protein, is an important part of determining host specificity since it is the point of contact between the virus and the host cell membrane.