Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/11546
Full metadata record
DC FieldValueLanguage
dc.contributor.authorJha, Preetien_US
dc.contributor.authorTiwari, Arunaen_US
dc.contributor.authorPulakitha, Rapoluen_US
dc.contributor.authorChauhan, Aditien_US
dc.date.accessioned2023-04-11T11:16:05Z-
dc.date.available2023-04-11T11:16:05Z-
dc.date.issued2022-
dc.identifier.citationJha, P., Tiwari, A., Bharill, N., Ratnaparkhe, M., Patel, O. P., Pulakitha, R., & Chauhan, A. (2022). High-performance computing based scalable online fuzzy clustering algorithms for big data. Paper presented at the Proceedings of the 2022 IEEE Symposium Series on Computational Intelligence, SSCI 2022, 1400-1407. doi:10.1109/SSCI51031.2022.10022194 Retrieved from www.scopus.comen_US
dc.identifier.isbn978-1665487689-
dc.identifier.issn0000-0000-
dc.identifier.otherEID(2-s2.0-85147794908)-
dc.identifier.urihttps://doi.org/10.1109/SSCI51031.2022.10022194-
dc.identifier.urihttps://dspace.iiti.ac.in/handle/123456789/11546-
dc.description.abstractWith the rise of big data trends so quickly, real-time stream data processing has become very important. Stream data is a type of big, fast, and unreliable dataset that cannot be handled well by traditional algorithms. Designing the algorithm that can efficiently process streaming data is a challenging task. This paper shows how important is to make a real-time clustering algorithm for data streams with high concept drift and an algorithm that can adapt to different dimensions. We propose Scalable Random Sampling Online Optimization Weighted Fuzzy c-Means (SRSOO-WFCM) algorithms for handling Big Data in a High-Performance Computing (HPC) environment using an Apache Spark cluster. To compare SRSOO-WFCM with the traditional Online Fuzzy c-Means (OFCM) algorithm, we made a scalable version of OFCM named SOFCM. The proposed SRSOO-WFCM and SOFCM are incremental algorithms that involve making one sequential run through the data subsets. We employ both loadable and very large datasets to perform extensive experiments that facilitate comparing the proposed SRSOO-WFCM and SOFCM algorithms. The proposed SRSOO-WFCM algorithm performs better than the SOFCM in terms of Normalized Mutual Information (NMI), Adjusted Rand Index (ARI), and F-score, respectively. © 2022 IEEE.en_US
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineers Inc.en_US
dc.sourceProceedings of the 2022 IEEE Symposium Series on Computational Intelligence, SSCI 2022en_US
dc.subjectClustering algorithmsen_US
dc.subjectData streamsen_US
dc.subjectElectric sparksen_US
dc.subjectLarge dataseten_US
dc.subjectApache sparken_US
dc.subjectHigh-performance computingen_US
dc.subjectIncremental streaming algorithmen_US
dc.subjectOnline fuzzy clusteringen_US
dc.subjectOnline optimizationen_US
dc.subjectPerformance computingen_US
dc.subjectRandom samplingen_US
dc.subjectScalable algorithmsen_US
dc.subjectStreaming algorithmen_US
dc.subjectWeighted fuzzy c-meansen_US
dc.subjectFuzzy clusteringen_US
dc.titleHigh-Performance Computing based Scalable Online Fuzzy Clustering Algorithms for Big Dataen_US
dc.typeConference Paperen_US
Appears in Collections:Department of Computer Science and Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: