Skip to Main Content
Feature selection has become the cornerstone of many classification problems. It has been applied in many domains such as Web mining, text categorization, gene expression microarray analysis, image analysis, and combinatorial chemistry. One type of well-studied feature selection methodology is filtering, which is typically divided into ranking and subset evaluation. This work provides an empirical study regarding one type of feature ranking for which very limited research exists, namely wrapper-based feature ranking. Nine performance metrics are evaluated, and while these metrics are commonly used in data mining to evaluate classifier performance, they are rarely used as feature ranking techniques. Moreover, five different learners, 5-nearest neighbors (5NN), logistic regression (LR), multi layer perceptron (MLP), Naive Bayes (NB), and support vector machines (SVM) in conjunction with two different methodologies, 3-fold cross-validation (CV) and 3-fold cross-validation risk impact (CVR) are used in this study to evaluate feature relevancy and to determine ranking similarities among the different ranking techniques.