DOI: 10.1142/s0218001409007326 ISSN:

CLASSIFICATION OF IMBALANCED DATA: A REVIEW

YANMIN SUN, ANDREW K. C. WONG, MOHAMED S. KAMEL
  • Artificial Intelligence
  • Computer Vision and Pattern Recognition
  • Software

Classification of data with imbalanced class distribution has encountered a significant drawback of the performance attainable by most standard classifier learning algorithms which assume a relatively balanced class distribution and equal misclassification costs. This paper provides a review of the classification of imbalanced data regarding: the application domains; the nature of the problem; the learning difficulties with standard classifier learning algorithms; the learning objectives and evaluation measures; the reported research solutions; and the class imbalance problem in the presence of multiple classes.