Please use this identifier to cite or link to this item:
https://dspace.iiti.ac.in/handle/123456789/4803
Title: | A Novel Scalable Kernelized Fuzzy Clustering Algorithms Based on In-Memory Computation for Handling Big Data |
Authors: | Jha, Preeti Tiwari, Aruna Bharill, Neha Mounika, Mukkamalla |
Keywords: | Big data;Cluster analysis;Cluster computing;Fuzzy clustering;Iterative methods;Radial basis function networks;Fuzzy clustering algorithm;Fuzzy-c means;In-memory computation;Iterative Optimization;Kernelized clustering algorithm;Kernelized fuzzy clustering;Literals;Memory computations;Nonlinear separable;Random sampling;Clustering algorithms |
Issue Date: | 2021 |
Publisher: | Institute of Electrical and Electronics Engineers Inc. |
Citation: | Jha, P., Tiwari, A., Bharill, N., Ratnaparkhe, M., Mounika, M., & Nagendra, N. (2021). A novel scalable kernelized fuzzy clustering algorithms based on in-memory computation for handling big data. IEEE Transactions on Emerging Topics in Computational Intelligence, 5(6), 908-919. doi:10.1109/TETCI.2020.3016302 |
Abstract: | Traditional scalable clustering algorithms mainly deal with the clustering of linearly separable data, but it is challenging to cluster the non-linear separable data efficiently in the feature space. In this article, we propose a novel Kernelized Scalable Random Sampling with Iterative Optimization Fuzzy c-Means (KSRSIO-FCM) clustering algorithm using Big Data framework. To propose the KSRSIO-FCM, we also propose the Kernelized version of Scalable Literal Fuzzy c-Means (KSLFCM) clustering algorithm, which is an integral part of the proposed KSRSIO-FCM algorithm. These kernelized clustering algorithms are evolved to deal with the non-linear separable problems by applying a kernel Radial Basis Functions (RBF) which maps the input data space non-linearly into a high dimensional feature space. We aim to design and implement the kernelized fuzzy clustering algorithms on Apache Spark, which performs the efficient clustering of Big Data due to its in-memory cluster computing technique. Exhaustive experiments are performed on various big datasets to show the effectiveness of proposed KSRSIO-FCM in comparison with other scalable clustering algorithms, i.e., KSLFCM, SRSIO-FCM, and SLFCM. The reported experimental results show that the KSRSIO-FCM algorithm in comparison with KSLFCM, SRSIO-FCM, and SLFCM achieves significant improvement in terms of time and space complexity, Normalized Mutual Information (NMI), Adjusted Rand Index (ARI), and F-score, respectively. Furthermore, we have carried out a performance analysis of KSRSIO-FCM versus KSLFCM. Thus, the reported results show that the KSRSIO-FCM implemented on Apache Spark has better potential for Big Data clustering as compared to traditional scalable fuzzy clustering methods. © 2017 IEEE. |
URI: | https://doi.org/10.1109/TETCI.2020.3016302 https://dspace.iiti.ac.in/handle/123456789/4803 |
ISSN: | 2471-285X |
Type of Material: | Journal Article |
Appears in Collections: | Department of Computer Science and Engineering |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
Altmetric Badge: