DOI: 10.1142/s1469026824500135 ISSN: 1469-0268

A Hybrid Approach for Binary Classification of Imbalanced Data

Hsinhan Tsai, Ta-Wei Yang, Wai-Man Wong, Han-Yi Kao, Cheng-Fu Chou

Binary classification with an imbalanced dataset is challenging. Models tend to consider all samples as belonging to the majority class. Although existing solutions such as sampling methods, cost-sensitive methods, and ensemble learning methods improve the poor accuracy of the minority class, these methods are limited by overfitting or cost parameters that are difficult to decide. This paper proposes a hybrid approach with dimension reduction that consists of data block construction, dimensionality reduction, and ensemble learning with deep neural network classifiers. The performance is evaluated on eight imbalanced public datasets in terms of recall, G-mean, AUC, F-measure, and balanced accuracy. The results show that the proposed model outperforms state-of-the-art methods.

More from our Archive