Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/6601
Title: A reduced universum twin support vector machine for class imbalance learning
Authors: Richhariya, Bharat
Tanveer, M.
Keywords: Benchmarking;Digital storage;Large dataset;Matrix algebra;Support vector machines;Vectors;Class imbalance;Imbalance ratio;Rectangular kernel;Twin support vector machines;Universum;Data reduction
Issue Date: 2020
Publisher: Elsevier Ltd
Citation: Richhariya, B., & Tanveer, M. (2020). A reduced universum twin support vector machine for class imbalance learning. Pattern Recognition, 102 doi:10.1016/j.patcog.2019.107150
Abstract: In most of the real world datasets, there is an imbalance in the number of samples belonging to different classes. Various pattern classification problems such as fault or disease detection involve class imbalanced data. The support vector machine (SVM) classifier becomes biased towards the majority class due to class imbalance. Moreover, in the existing SVM based techniques for class imbalance, there is no information about the distribution of data. Motivated by the idea of prior information about data distribution, a reduced universum twin support vector machine for class imbalance learning (RUTSVM-CIL) is proposed in this paper. For the first time, universum learning is incorporated with SVM to solve the problem of class imbalance. Oversampling and undersampling of data is performed to remove the imbalance in the classes. The universum data points are used to give prior information about the data. To reduce the computation time of our universum based algorithm, we use a small sized rectangular kernel matrix. The reduced kernel matrix needs less storage space, and thus applicable for large scale imbalanced datasets. Comprehensive experimentation is performed on various synthetic, real world and large scale imbalanced datasets. In comparison to the existing approaches for class imbalance, the proposed RUTSVM-CIL gives better generalization performance for most of the benchmark datasets. Also, the computation cost of RUTSVM-CIL is very less, making it suitable for real world applications. © 2020
URI: https://doi.org/10.1016/j.patcog.2019.107150
https://dspace.iiti.ac.in/handle/123456789/6601
ISSN: 0031-3203
Type of Material: Journal Article
Appears in Collections:Department of Mathematics

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: