Please use this identifier to cite or link to this item:
https://dspace.iiti.ac.in/handle/123456789/6601
Title: | A reduced universum twin support vector machine for class imbalance learning |
Authors: | Richhariya, Bharat Tanveer, M. |
Keywords: | Benchmarking;Digital storage;Large dataset;Matrix algebra;Support vector machines;Vectors;Class imbalance;Imbalance ratio;Rectangular kernel;Twin support vector machines;Universum;Data reduction |
Issue Date: | 2020 |
Publisher: | Elsevier Ltd |
Citation: | Richhariya, B., & Tanveer, M. (2020). A reduced universum twin support vector machine for class imbalance learning. Pattern Recognition, 102 doi:10.1016/j.patcog.2019.107150 |
Abstract: | In most of the real world datasets, there is an imbalance in the number of samples belonging to different classes. Various pattern classification problems such as fault or disease detection involve class imbalanced data. The support vector machine (SVM) classifier becomes biased towards the majority class due to class imbalance. Moreover, in the existing SVM based techniques for class imbalance, there is no information about the distribution of data. Motivated by the idea of prior information about data distribution, a reduced universum twin support vector machine for class imbalance learning (RUTSVM-CIL) is proposed in this paper. For the first time, universum learning is incorporated with SVM to solve the problem of class imbalance. Oversampling and undersampling of data is performed to remove the imbalance in the classes. The universum data points are used to give prior information about the data. To reduce the computation time of our universum based algorithm, we use a small sized rectangular kernel matrix. The reduced kernel matrix needs less storage space, and thus applicable for large scale imbalanced datasets. Comprehensive experimentation is performed on various synthetic, real world and large scale imbalanced datasets. In comparison to the existing approaches for class imbalance, the proposed RUTSVM-CIL gives better generalization performance for most of the benchmark datasets. Also, the computation cost of RUTSVM-CIL is very less, making it suitable for real world applications. © 2020 |
URI: | https://doi.org/10.1016/j.patcog.2019.107150 https://dspace.iiti.ac.in/handle/123456789/6601 |
ISSN: | 0031-3203 |
Type of Material: | Journal Article |
Appears in Collections: | Department of Mathematics |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
Altmetric Badge: