By Topic

Combining feature ranking for text classification

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

The purchase and pricing options are temporarily unavailable. Please try again later.
2 Author(s)
Makrehchi, M. ; Univ. of Waterloo, Waterloo ; Kamel, M.S.

Feature ranking is one of the dimensionality reduction methods. Because of its simplicity and low cost, it is widely used in text classification. One problem with feature ranking methods is their non-robust behavior when applied to different data sets. In other words, the feature ranking methods behave differently from one data set to the other. The problem is more complex when we consider that the performance of feature ranking methods is different when being used by different classifiers. In this paper, a new method based on combining feature rankings is proposed to find the best features among a set of feature rankings. Four preferential voting method are employed to combine feature rankings obtained by eight well-known ranking measures. According to the results, combining methods can offer reliable results that are very close to the best solution without the need to use a classifier. The proposed method is applied to the text classification problem and evaluated on three well-known data sets using SVM classifier.

Published in:

Systems, Man and Cybernetics, 2007. ISIC. IEEE International Conference on

Date of Conference:

7-10 Oct. 2007