Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/4803
Title: A Novel Scalable Kernelized Fuzzy Clustering Algorithms Based on In-Memory Computation for Handling Big Data
Authors: Jha, Preeti
Tiwari, Aruna
Bharill, Neha
Mounika, Mukkamalla
Keywords: Big data;Cluster analysis;Cluster computing;Fuzzy clustering;Iterative methods;Radial basis function networks;Fuzzy clustering algorithm;Fuzzy-c means;In-memory computation;Iterative Optimization;Kernelized clustering algorithm;Kernelized fuzzy clustering;Literals;Memory computations;Nonlinear separable;Random sampling;Clustering algorithms
Issue Date: 2021
Publisher: Institute of Electrical and Electronics Engineers Inc.
Citation: Jha, P., Tiwari, A., Bharill, N., Ratnaparkhe, M., Mounika, M., & Nagendra, N. (2021). A novel scalable kernelized fuzzy clustering algorithms based on in-memory computation for handling big data. IEEE Transactions on Emerging Topics in Computational Intelligence, 5(6), 908-919. doi:10.1109/TETCI.2020.3016302
Abstract: Traditional scalable clustering algorithms mainly deal with the clustering of linearly separable data, but it is challenging to cluster the non-linear separable data efficiently in the feature space. In this article, we propose a novel Kernelized Scalable Random Sampling with Iterative Optimization Fuzzy c-Means (KSRSIO-FCM) clustering algorithm using Big Data framework. To propose the KSRSIO-FCM, we also propose the Kernelized version of Scalable Literal Fuzzy c-Means (KSLFCM) clustering algorithm, which is an integral part of the proposed KSRSIO-FCM algorithm. These kernelized clustering algorithms are evolved to deal with the non-linear separable problems by applying a kernel Radial Basis Functions (RBF) which maps the input data space non-linearly into a high dimensional feature space. We aim to design and implement the kernelized fuzzy clustering algorithms on Apache Spark, which performs the efficient clustering of Big Data due to its in-memory cluster computing technique. Exhaustive experiments are performed on various big datasets to show the effectiveness of proposed KSRSIO-FCM in comparison with other scalable clustering algorithms, i.e., KSLFCM, SRSIO-FCM, and SLFCM. The reported experimental results show that the KSRSIO-FCM algorithm in comparison with KSLFCM, SRSIO-FCM, and SLFCM achieves significant improvement in terms of time and space complexity, Normalized Mutual Information (NMI), Adjusted Rand Index (ARI), and F-score, respectively. Furthermore, we have carried out a performance analysis of KSRSIO-FCM versus KSLFCM. Thus, the reported results show that the KSRSIO-FCM implemented on Apache Spark has better potential for Big Data clustering as compared to traditional scalable fuzzy clustering methods. © 2017 IEEE.
URI: https://doi.org/10.1109/TETCI.2020.3016302
https://dspace.iiti.ac.in/handle/123456789/4803
ISSN: 2471-285X
Type of Material: Journal Article
Appears in Collections:Department of Computer Science and Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: