Loading [MathJax]/extensions/MathMenu.js
Imbalanced Data Problem in Machine Learning: A Review | IEEE Journals & Magazine | IEEE Xplore

Imbalanced Data Problem in Machine Learning: A Review


This survey explores machine learning techniques for handling imbalanced data, including data-level methods like oversampling and under-sampling, algorithm-level solution...

Abstract:

One of the prominent challenges encountered in real-world data is an imbalance, characterized by unequal distribution of observations across different target classes, whi...Show More

Abstract:

One of the prominent challenges encountered in real-world data is an imbalance, characterized by unequal distribution of observations across different target classes, which complicates achieving accurate model classifications. This survey delves into various machine learning techniques developed to address the difficulties posed by imbalanced data. It discusses data-level methods such as oversampling and undersampling, algorithm-level solutions including ensemble learning and specific algorithm adjustments, cost-sensitive algorithms, and hybrid strategies that combine multiple approaches. Moreover, this paper emphasizes the crucial role of evaluation methods like Precision, F1 Score, Recall, G-mean, and AUC in measuring the effectiveness of these strategies under imbalanced conditions. A detailed review of recent research articles helps pinpoint persistent gaps in generalizability, scalability, and robustness across these methods, underscoring the necessity for ongoing improvements. The survey seeks to offer an extensive overview of current approaches that improve the efficiency and effectiveness of machine learning models dealing with imbalanced datasets, thus equipping researchers with the insights needed to develop robust and effective models ready for real-world application.
This survey explores machine learning techniques for handling imbalanced data, including data-level methods like oversampling and under-sampling, algorithm-level solution...
Published in: IEEE Access ( Volume: 13)
Page(s): 13686 - 13699
Date of Publication: 20 January 2025
Electronic ISSN: 2169-3536

Funding Agency:


References

References is not available for this document.