DOI: 10.1142/s0218001409007326 ISSN:
CLASSIFICATION OF IMBALANCED DATA: A REVIEW
YANMIN SUN, ANDREW K. C. WONG, MOHAMED S. KAMEL- Artificial Intelligence
- Computer Vision and Pattern Recognition
- Software
Classification of data with imbalanced class distribution has encountered a significant drawback of the performance attainable by most standard classifier learning algorithms which assume a relatively balanced class distribution and equal misclassification costs. This paper provides a review of the classification of imbalanced data regarding: the application domains; the nature of the problem; the learning difficulties with standard classifier learning algorithms; the learning objectives and evaluation measures; the reported research solutions; and the class imbalance problem in the presence of multiple classes.