Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/4817
Full metadata record
DC FieldValueLanguage
dc.contributor.authorJha, Preetien_US
dc.contributor.authorTiwari, Arunaen_US
dc.contributor.authorBharill, Nehaen_US
dc.contributor.authorMounika, Mukkamallaen_US
dc.date.accessioned2022-03-17T01:00:00Z-
dc.date.accessioned2022-03-17T15:35:37Z-
dc.date.available2022-03-17T01:00:00Z-
dc.date.available2022-03-17T15:35:37Z-
dc.date.issued2021-
dc.identifier.citationJha, P., Tiwari, A., Bharill, N., Ratnaparkhe, M., Nagendra, N., & Mounika, M. (2021). Scalable incremental fuzzy consensus clustering algorithm for handling big data. Soft Computing, 25(13), 8703-8719. doi:10.1007/s00500-021-05733-1en_US
dc.identifier.issn1432-7643-
dc.identifier.otherEID(2-s2.0-85103356568)-
dc.identifier.urihttps://doi.org/10.1007/s00500-021-05733-1-
dc.identifier.urihttps://dspace.iiti.ac.in/handle/123456789/4817-
dc.description.abstractConsensus clustering can produce novel, stable, and robust clustering results. Consensus clustering intends to merge a few existing basic segments into a coordinated one, and this has been broadly perceived as a promising solution for heterogeneous data clustering for big data. Even though many clustering algorithms have been proposed, getting a decent quality segment with high effectiveness is still not yet decided. In this paper, we propose a scalable incremental fuzzy consensus clustering (SIFCC) algorithm for a big data framework. It has been implemented on Apache Spark cluster framework, a distributed data stream environment for handling big data by considering the data as a set of data subsets that are processed incrementally. Sparks work great for iterative algorithms by supporting in-memory calculations, scalability, etc. SIFCC not only facilitates efficient big data clustering, but also improves the quality of clusters, performs storage space optimization, and time complexity during clustering. To establish the comparison, we designed and implemented the scalable model of existing fuzzy consensus clustering (FCC) on Apache Spark cluster, named as a scalable fuzzy consensus clustering (SFCC). Extensive experiments on real-world datasets show that the SIFCC algorithm achieves the better potential for clustering of Big Data in comparison with SFCC. © 2021, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.en_US
dc.language.isoenen_US
dc.publisherSpringer Science and Business Media Deutschland GmbHen_US
dc.sourceSoft Computingen_US
dc.subjectCluster analysisen_US
dc.subjectData streamsen_US
dc.subjectDigital storageen_US
dc.subjectIterative methodsen_US
dc.subjectLarge dataseten_US
dc.subjectCluster frameworken_US
dc.subjectConsensus clusteringen_US
dc.subjectDistributed data streamsen_US
dc.subjectHeterogeneous data clusteringen_US
dc.subjectIterative algorithmen_US
dc.subjectQuality segmentsen_US
dc.subjectReal-world datasetsen_US
dc.subjectRobust clusteringen_US
dc.subjectClustering algorithmsen_US
dc.titleScalable incremental fuzzy consensus clustering algorithm for handling big dataen_US
dc.typeJournal Articleen_US
Appears in Collections:Department of Computer Science and Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: