Please use this identifier to cite or link to this item:
https://dspace.iiti.ac.in/handle/123456789/16737
Title: | Gpu-enhanced scalable methods for genome sequence feature extraction and clustering |
Authors: | Jha, Preeti Tiwari, Aruna Bharill, Neha Ratnaparkhe, Milind Balkrishna Patel, Om Prakash |
Keywords: | Apache Spark;Feature Extraction;Fuzzy Clustering;Gpu;Scalable Algorithm;Snp;Bioinformatics;Clustering Algorithms;Extraction;Genes;Genome;Program Processors;Scalability;Apache Spark;Clusterings;Feature Extraction Algorithms;Features Extraction;Fuzzy Clustering Algorithm;Genomic Data;Gpu-accelerated;Scalable Algorithms;Scalable Clustering;Single Nucleotide Polymorphisms;Feature Extraction;Fuzzy Clustering;Graphics Processing Unit |
Issue Date: | 2025 |
Publisher: | Springer |
Citation: | Jha, P., Tiwari, A., Bharill, N., Ratnaparkhe, M., & Patel, O. P. (2025). Gpu-enhanced scalable methods for genome sequence feature extraction and clustering. International Journal of System Assurance Engineering and Management. https://doi.org/10.1007/s13198-025-02894-2 |
Abstract: | With the evolution of bioinformatics, vast amounts of genomic data are generated every day. Clustering has been widely used to derive meaningful insights from huge genomic datasets. To deal with such huge amounts of genomic data, scalable clustering algorithms were designed earlier. The main limitation of scalable clustering algorithms is that these methods cannot take the raw form of huge single nucleotide polymorphism (SNP) sequences as input. In this paper, we propose the Scalable GPU accelerated SNP feature extraction (SGPU-SNPfe) algorithm, which preprocesses the raw SNP sequences and produces twelve-dimensional numerical feature vectors. The SGPU-SNPfe enables Spark to utilize GPUs in high-performance computing (HPC). The preprocessed SNP sequences obtained from the SGPU-SNPfe algorithm are used as input to scalable fuzzy clustering algorithms. The experimental results demonstrate the effectiveness of the SGPU-SNPfe algorithm on scalable fuzzy clustering algorithms in terms of the Silhouette Index and Davies-Bouldin Index. © 2025 Elsevier B.V., All rights reserved. |
URI: | https://dx.doi.org/10.1007/s13198-025-02894-2 https://dspace.iiti.ac.in:8080/jspui/handle/123456789/16737 |
ISSN: | 0975-6809 0976-4348 |
Type of Material: | Journal Article |
Appears in Collections: | Department of Computer Science and Engineering |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
Altmetric Badge: