Conferences >2018 IEEE International Confe...

Synthetic Oversampling with the Majority Class: A New Perspective on Handling Extreme Imbalance

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The class imbalance problem is a pervasive issue in many real-world domains. Oversampling methods that inflate the rare class by generating synthetic data are amongst the...Show More

Metadata

Abstract:

The class imbalance problem is a pervasive issue in many real-world domains. Oversampling methods that inflate the rare class by generating synthetic data are amongst the most popular techniques for resolving class imbalance. However, they concentrate on the characteristics of the minority class and use them to guide the oversampling process. By completely overlooking the majority class, they lose a global view on the classification problem and, while alleviating the class imbalance, may negatively impact learnability by generating borderline or overlapping instances. This becomes even more critical when facing extreme class imbalance, where the minority class is strongly underrepresented and on its own does not contain enough information to conduct the oversampling process. We propose a novel method for synthetic oversampling that uses the rich information inherent in the majority class to synthesize minority class data. This is done by generating synthetic data that is at the same Mahalanbois distance from the majority class as the known minority instances. We evaluate over 26 benchmark datasets, and show that our method offers a distinct performance improvement over the existing state-of-the-art in oversampling techniques.

Published in: 2018 IEEE International Conference on Data Mining (ICDM)

Date of Conference: 17-20 November 2018

Date Added to IEEE Xplore: 30 December 2018

ISBN Information:

ISSN Information:

DOI: 10.1109/ICDM.2018.00060

Conference Location: Singapore

Contents

References is not available for this document.

Synthetic Oversampling with the Majority Class: A New Perspective on Handling Extreme Imbalance

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Synthetic Oversampling with the Majority Class: A New Perspective on Handling Extreme Imbalance

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?