Journals & Magazines >IEEE Access >Volume: 13

Imbalanced Data Problem in Machine Learning: A Review

This survey explores machine learning techniques for handling imbalanced data, including data-level methods like oversampling and under-sampling, algorithm-level solution...

Abstract:

One of the prominent challenges encountered in real-world data is an imbalance, characterized by unequal distribution of observations across different target classes, whi...Show More

Metadata

Abstract:

One of the prominent challenges encountered in real-world data is an imbalance, characterized by unequal distribution of observations across different target classes, which complicates achieving accurate model classifications. This survey delves into various machine learning techniques developed to address the difficulties posed by imbalanced data. It discusses data-level methods such as oversampling and undersampling, algorithm-level solutions including ensemble learning and specific algorithm adjustments, cost-sensitive algorithms, and hybrid strategies that combine multiple approaches. Moreover, this paper emphasizes the crucial role of evaluation methods like Precision, F1 Score, Recall, G-mean, and AUC in measuring the effectiveness of these strategies under imbalanced conditions. A detailed review of recent research articles helps pinpoint persistent gaps in generalizability, scalability, and robustness across these methods, underscoring the necessity for ongoing improvements. The survey seeks to offer an extensive overview of current approaches that improve the efficiency and effectiveness of machine learning models dealing with imbalanced datasets, thus equipping researchers with the insights needed to develop robust and effective models ready for real-world application.

This survey explores machine learning techniques for handling imbalanced data, including data-level methods like oversampling and under-sampling, algorithm-level solution...

Published in: IEEE Access ( Volume: 13)

Page(s): 13686 - 13699

Date of Publication: 20 January 2025

Electronic ISSN: 2169-3536

DOI: 10.1109/ACCESS.2025.3531662

Funding Agency:

Related Articles are not available for this document.

Contents

References is not available for this document.

Imbalanced Data Problem in Machine Learning: A Review

Abstract:

Metadata

Abstract:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Imbalanced Data Problem in Machine Learning: A Review

Alerts

Abstract:

Metadata

Abstract:

Funding Agency:

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?