I. Introduction
Classification learning, an important research field in data mining, has wide applications in areas such as computer vision, pattern recognition, and bioinformatics. However, in many applications, the data has high dimensions. It brings the classification algorithms with high computational complexity and poor learning performance. To tackle this problem, one effective way is to apply the feature selection (FS) technique, where a feature subset is chosen to achieve similar (or better) performance [1].