Abstract:
Feature selection (FS) has received significant attention since the use of a well-selected subset of features may achieve better classification performance than that of f...Show MoreMetadata
Abstract:
Feature selection (FS) has received significant attention since the use of a well-selected subset of features may achieve better classification performance than that of full features in many real-world applications. It can be considered as a multiobjective optimization consisting of two objectives: 1) minimizing the number of selected features and 2) maximizing classification performance. Ant colony optimization (ACO) has shown its effectiveness in FS due to its problem-guided search operator and flexible graph representation. However, there lacks an effective ACO-based approach for multiobjective FS to handle the problematic characteristics originated from the feature interactions and highly discontinuous Pareto fronts. This article presents an Information-theory-based Nondominated Sorting ACO (called INSA) to solve the aforementioned difficulties. First, the probabilistic function in ACO is modified based on the information theory to identify the importance of features; second, a new ACO strategy is designed to construct solutions; and third, a novel pheromone updating strategy is devised to ensure the high diversity of tradeoff solutions. INSA’s performance is compared with four machine-learning-based methods, four representative single-objective evolutionary algorithms, and six state-of-the-art multiobjective ones on 13 benchmark classification datasets, which consist of both low and high-dimensional samples. The empirical results verify that INSA is able to obtain solutions with better classification performance using features whose count is similar to or less than those obtained by its peers.
Published in: IEEE Transactions on Cybernetics ( Volume: 53, Issue: 8, August 2023)
Funding Agency:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Ant Colony ,
- Ant Colony Optimization ,
- Non-dominated Sorting ,
- Feature Selection For Classification ,
- Multi-objective Feature Selection ,
- Important Characteristics ,
- Classification Performance ,
- Real-world Applications ,
- Information Theory ,
- Probability Function ,
- Multi-objective Optimization ,
- Feature Subset ,
- Variety Of Solutions ,
- Pareto Front ,
- Search Operations ,
- Computation Time ,
- Support Vector Machine ,
- Parameter Settings ,
- Large-scale Datasets ,
- Particle Swarm Optimization ,
- Multi-objective Evolutionary Algorithms ,
- Feature Selection Problem ,
- Heuristic Information ,
- Classification Error Rate ,
- Improve Classification Performance ,
- Urban Land Cover ,
- Relevant Indicators ,
- Optimal Subset ,
- Dataset Characteristics ,
- Multi-objective Algorithm
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Ant Colony ,
- Ant Colony Optimization ,
- Non-dominated Sorting ,
- Feature Selection For Classification ,
- Multi-objective Feature Selection ,
- Important Characteristics ,
- Classification Performance ,
- Real-world Applications ,
- Information Theory ,
- Probability Function ,
- Multi-objective Optimization ,
- Feature Subset ,
- Variety Of Solutions ,
- Pareto Front ,
- Search Operations ,
- Computation Time ,
- Support Vector Machine ,
- Parameter Settings ,
- Large-scale Datasets ,
- Particle Swarm Optimization ,
- Multi-objective Evolutionary Algorithms ,
- Feature Selection Problem ,
- Heuristic Information ,
- Classification Error Rate ,
- Improve Classification Performance ,
- Urban Land Cover ,
- Relevant Indicators ,
- Optimal Subset ,
- Dataset Characteristics ,
- Multi-objective Algorithm
- Author Keywords