Search Results for author: Prakash Chourasia

Found 10 papers, 3 papers with code

Expanding Chemical Representation with k-mers and Fragment-based Fingerprints for Molecular Fingerprinting

no code implementations28 Mar 2024 Sarwan Ali, Prakash Chourasia, Murray Patterson

This study introduces a novel approach, combining substruct counting, $k$-mers, and Daylight-like fingerprints, to expand the representation of chemical structures in SMILES strings.

Drug Discovery

A Universal Non-Parametric Approach For Improved Molecular Sequence Analysis

no code implementations12 Feb 2024 Sarwan Ali, Tamkanat E Ali, Prakash Chourasia, Murray Patterson

In this work, we present a novel approach based on the compression-based Model, motivated from \cite{jiang2023low}, which combines the simplicity of basic compression algorithms like Gzip and Bz2, with Normalized Compression Distance (NCD) algorithm to achieve better performance on classification tasks without relying on handcrafted features or pre-trained models.

T Cell Receptor Protein Sequences and Sparse Coding: A Novel Approach to Cancer Classification

1 code implementation25 Apr 2023 Zahra Tayebi, Sarwan Ali, Prakash Chourasia, Taslim Murad, Murray Patterson

Sparse coding is a popular technique in machine learning that enables the representation of data with a set of informative features and can capture complex relationships between amino acids and identify subtle patterns in the sequence that might be missed by low-dimensional methods.

Multi-class Classification Specificity

Virus2Vec: Viral Sequence Classification Using Machine Learning

no code implementations24 Apr 2023 Sarwan Ali, Babatunde Bello, Prakash Chourasia, Ria Thazhe Punathil, Pin-Yu Chen, Imdad Ullah Khan, Murray Patterson

Understanding the host-specificity of different families of viruses sheds light on the origin of, e. g., SARS-CoV-2, rabies, and other such zoonotic pathogens in humans.

Classification Specificity

ViralVectors: Compact and Scalable Alignment-free Virome Feature Generation

1 code implementation6 Apr 2023 Sarwan Ali, Prakash Chourasia, Zahra Tayebi, Babatunde Bello, Murray Patterson

In this work, we propose \emph{ViralVectors}, a compact feature vector generation from virome sequencing data that allows effective downstream analysis.

4k Decision Making

Anderson Acceleration For Bioinformatics-Based Machine Learning

no code implementations1 Feb 2023 Sarwan Ali, Prakash Chourasia, Murray Patterson

Anderson acceleration (AA) is a well-known method for accelerating the convergence of iterative algorithms, with applications in various fields including deep learning and optimization.

Informative Initialization and Kernel Selection Improves t-SNE for Biological Sequences

1 code implementation16 Nov 2022 Prakash Chourasia, Sarwan Ali, Murray Patterson

We show that by using different techniques, such as informed initialization and kernel matrix selection, that t-SNE performs significantly better.

Reads2Vec: Efficient Embedding of Raw High-Throughput Sequencing Reads Data

no code implementations15 Nov 2022 Prakash Chourasia, Sarwan Ali, Simone Ciccolella, Gianluca Della Vedova, Murray Patterson

As a result, new methods such as Pangolin, which can scale to the millions of samples of SARS-CoV-2 currently available, have appeared.

Clustering Vocal Bursts Intensity Prediction

PWM2Vec: An Efficient Embedding Approach for Viral Host Specification from Coronavirus Spike Sequences

no code implementations6 Jan 2022 Sarwan Ali, Babatunde Bello, Prakash Chourasia, Ria Thazhe Punathil, Yijing Zhou, Murray Patterson

In coronaviruses, the surface (S) protein, or spike protein, is an important part of determining host specificity since it is the point of contact between the virus and the host cell membrane.

Open-Ended Question Answering Specificity

Cannot find the paper you are looking for? You can Submit a new open access paper.