Abstract:
Driven by a plethora of real machine learning applications, there have been many attempts at improving the performance of a classifier applied to imbalanced dataset. In t...Show MoreMetadata
Abstract:
Driven by a plethora of real machine learning applications, there have been many attempts at improving the performance of a classifier applied to imbalanced dataset. In this paper we propose a maximum entropy machine (MEM) based hybrid algorithm to handle binary classification problems with high imbalance ratios and large numbers of features in the datasets. At the training stage, we combine an efficient MEM algorithm with the SMOTE algorithm to build a classifier in a batch manner. At the application stage, the different-cost strategy is incorporated into the MEM algorithm to handle the imbalance learning problem in an online manner. Experiments are conducted based on various real datasets (including one China Mobile dataset and several other standard test datasets) with different imbalance ratios and different numbers of features. The results show that the proposed algorithm outperforms the state-of-the-art algorithms significantly in terms of robustness and overall classification performance.
Date of Conference: 16-18 August 2018
Date Added to IEEE Xplore: 14 February 2019
ISBN Information:
Print on Demand(PoD) ISSN: 2377-8644