In this paper we present an experimental study of machine learning from imbalanced data sets applied to the difficult problem of astronomical object classification in multi-spectral wide-field images. The imbalanced data set problem is very common in several domains, and occurs when there are many more examples of some classes than others; therefore, classifiers perform poorly on these data sets. In order to improve the performance of machine learning algorithms over minority class examples, we propose to create new instances using a modification of the well-known SMOTE technique, but only of those misclassified examples given by an ensemble of classifiers. Our preliminary experimental results show that the proposed approach obtain above. 700 using recall, precision and f-measure as metrics for evaluation; using small data sets.
Published in:
Soft Computing and Pattern Recognition (SoCPaR), 2011 International Conference of
Date of Conference: 14-16 Oct. 2011