Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/11546
Title: High-Performance Computing based Scalable Online Fuzzy Clustering Algorithms for Big Data
Authors: Jha, Preeti
Tiwari, Aruna
Pulakitha, Rapolu
Chauhan, Aditi
Keywords: Clustering algorithms;Data streams;Electric sparks;Large dataset;Apache spark;High-performance computing;Incremental streaming algorithm;Online fuzzy clustering;Online optimization;Performance computing;Random sampling;Scalable algorithms;Streaming algorithm;Weighted fuzzy c-means;Fuzzy clustering
Issue Date: 2022
Publisher: Institute of Electrical and Electronics Engineers Inc.
Citation: Jha, P., Tiwari, A., Bharill, N., Ratnaparkhe, M., Patel, O. P., Pulakitha, R., & Chauhan, A. (2022). High-performance computing based scalable online fuzzy clustering algorithms for big data. Paper presented at the Proceedings of the 2022 IEEE Symposium Series on Computational Intelligence, SSCI 2022, 1400-1407. doi:10.1109/SSCI51031.2022.10022194 Retrieved from www.scopus.com
Abstract: With the rise of big data trends so quickly, real-time stream data processing has become very important. Stream data is a type of big, fast, and unreliable dataset that cannot be handled well by traditional algorithms. Designing the algorithm that can efficiently process streaming data is a challenging task. This paper shows how important is to make a real-time clustering algorithm for data streams with high concept drift and an algorithm that can adapt to different dimensions. We propose Scalable Random Sampling Online Optimization Weighted Fuzzy c-Means (SRSOO-WFCM) algorithms for handling Big Data in a High-Performance Computing (HPC) environment using an Apache Spark cluster. To compare SRSOO-WFCM with the traditional Online Fuzzy c-Means (OFCM) algorithm, we made a scalable version of OFCM named SOFCM. The proposed SRSOO-WFCM and SOFCM are incremental algorithms that involve making one sequential run through the data subsets. We employ both loadable and very large datasets to perform extensive experiments that facilitate comparing the proposed SRSOO-WFCM and SOFCM algorithms. The proposed SRSOO-WFCM algorithm performs better than the SOFCM in terms of Normalized Mutual Information (NMI), Adjusted Rand Index (ARI), and F-score, respectively. © 2022 IEEE.
URI: https://doi.org/10.1109/SSCI51031.2022.10022194
https://dspace.iiti.ac.in/handle/123456789/11546
ISBN: 978-1665487689
ISSN: 0000-0000
Type of Material: Conference Paper
Appears in Collections:Department of Computer Science and Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: