Please use this identifier to cite or link to this item:
https://dspace.iiti.ac.in/handle/123456789/11546
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Jha, Preeti | en_US |
dc.contributor.author | Tiwari, Aruna | en_US |
dc.contributor.author | Pulakitha, Rapolu | en_US |
dc.contributor.author | Chauhan, Aditi | en_US |
dc.date.accessioned | 2023-04-11T11:16:05Z | - |
dc.date.available | 2023-04-11T11:16:05Z | - |
dc.date.issued | 2022 | - |
dc.identifier.citation | Jha, P., Tiwari, A., Bharill, N., Ratnaparkhe, M., Patel, O. P., Pulakitha, R., & Chauhan, A. (2022). High-performance computing based scalable online fuzzy clustering algorithms for big data. Paper presented at the Proceedings of the 2022 IEEE Symposium Series on Computational Intelligence, SSCI 2022, 1400-1407. doi:10.1109/SSCI51031.2022.10022194 Retrieved from www.scopus.com | en_US |
dc.identifier.isbn | 978-1665487689 | - |
dc.identifier.issn | 0000-0000 | - |
dc.identifier.other | EID(2-s2.0-85147794908) | - |
dc.identifier.uri | https://doi.org/10.1109/SSCI51031.2022.10022194 | - |
dc.identifier.uri | https://dspace.iiti.ac.in/handle/123456789/11546 | - |
dc.description.abstract | With the rise of big data trends so quickly, real-time stream data processing has become very important. Stream data is a type of big, fast, and unreliable dataset that cannot be handled well by traditional algorithms. Designing the algorithm that can efficiently process streaming data is a challenging task. This paper shows how important is to make a real-time clustering algorithm for data streams with high concept drift and an algorithm that can adapt to different dimensions. We propose Scalable Random Sampling Online Optimization Weighted Fuzzy c-Means (SRSOO-WFCM) algorithms for handling Big Data in a High-Performance Computing (HPC) environment using an Apache Spark cluster. To compare SRSOO-WFCM with the traditional Online Fuzzy c-Means (OFCM) algorithm, we made a scalable version of OFCM named SOFCM. The proposed SRSOO-WFCM and SOFCM are incremental algorithms that involve making one sequential run through the data subsets. We employ both loadable and very large datasets to perform extensive experiments that facilitate comparing the proposed SRSOO-WFCM and SOFCM algorithms. The proposed SRSOO-WFCM algorithm performs better than the SOFCM in terms of Normalized Mutual Information (NMI), Adjusted Rand Index (ARI), and F-score, respectively. © 2022 IEEE. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | en_US |
dc.source | Proceedings of the 2022 IEEE Symposium Series on Computational Intelligence, SSCI 2022 | en_US |
dc.subject | Clustering algorithms | en_US |
dc.subject | Data streams | en_US |
dc.subject | Electric sparks | en_US |
dc.subject | Large dataset | en_US |
dc.subject | Apache spark | en_US |
dc.subject | High-performance computing | en_US |
dc.subject | Incremental streaming algorithm | en_US |
dc.subject | Online fuzzy clustering | en_US |
dc.subject | Online optimization | en_US |
dc.subject | Performance computing | en_US |
dc.subject | Random sampling | en_US |
dc.subject | Scalable algorithms | en_US |
dc.subject | Streaming algorithm | en_US |
dc.subject | Weighted fuzzy c-means | en_US |
dc.subject | Fuzzy clustering | en_US |
dc.title | High-Performance Computing based Scalable Online Fuzzy Clustering Algorithms for Big Data | en_US |
dc.type | Conference Paper | en_US |
Appears in Collections: | Department of Computer Science and Engineering |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
Altmetric Badge: