Faculty Publications

Density peak-based pre-clustering support vector machine for multi-class imbalanced classification

Zonglin Di, Tongji University
Qi Kang, Tongji University
Daogang Peng, Shanghai University of Electric Power
Mengchu Zhou, Newark College of Engineering

Document Type

Conference Proceeding

Publication Date

10-1-2019

Abstract

Imbalanced classification using a support vector machine (SVM) is a normal but crucial problem in machine learning. Compared with binary classification, multiclass classification is much more complicated. Most existing studies on imbalanced classification using SVM focus on binary imbalanced classification; while only few of them look into imbalanced classification with multiple classes. Pre-clustering is a useful technique to prepare proper data from an imbalanced dataset for a classifier. It can be used to extract the feature of a dataset first and improve classification performance. Density peak based on Euclidean distance proves its effectiveness and generality in clustering. Motivated by this and the fact that the number of clusters is known in multi-class classification using a one-vs-rest strategy, we combine density peak clustering and SVM to propose a new pre-clustering method to perform effective imbalanced classification with multiple classes. Specifically, we transform a multi-class classification problem into several binary classification tasks. The results on 5 public datasets in terms of F-measure, G-mean and Area Under Curve (AUC) show its superiority over the original SVM and SVM with other methods including random under-sampling, Synthetic Minority Oversampling Technique, pre-clustering using K-Means and EasyEnsemble methods using either a one-vs-rest or one-vs-one strategy.

Identifier

85076726688 (Scopus)

ISBN

[9781728145693]

Publication Title

Conference Proceedings IEEE International Conference on Systems Man and Cybernetics

External Full Text Location

https://doi.org/10.1109/SMC.2019.8914451

ISSN

1062922X

First Page

Last Page

Volume

2019-October

Grant

51775385

Fund Ref

National Natural Science Foundation of China

Recommended Citation

Di, Zonglin; Kang, Qi; Peng, Daogang; and Zhou, Mengchu, "Density peak-based pre-clustering support vector machine for multi-class imbalanced classification" (2019). Faculty Publications. 7312.
https://digitalcommons.njit.edu/fac_pubs/7312

This document is currently not available here.

COinS

DOI

10.1109/SMC.2019.8914451

Faculty Publications

Density peak-based pre-clustering support vector machine for multi-class imbalanced classification

Document Type

Publication Date

Abstract

Identifier

ISBN

Publication Title

External Full Text Location

ISSN

First Page

Last Page

Volume

Grant

Fund Ref

Recommended Citation

DOI

Search

Browse

Author Corner

Links

Faculty Publications

Density peak-based pre-clustering support vector machine for multi-class imbalanced classification

Authors

Document Type

Publication Date

Abstract

Identifier

ISBN

Publication Title

External Full Text Location

ISSN

First Page

Last Page

Volume

Grant

Fund Ref

Recommended Citation

Share

DOI

Search

Browse

Author Corner

Links