Abstract
Ensemble learning has attracted much attention of researchers studying variable selection due to its great power in improving selection accuracy and stabilizing selection results. In this paper, we present a novel ensemble pruning technique called Pruned-ST2E to obtain more effective variable selection ensembles. The order to aggregate the individuals generated by the ST2E algorithm (Xin and Zhu in J Comput Graph Stat 21(2):275–294, 2012) is rearranged. To estimate the importance of each candidate variable, only some members ranked ahead are remained. Experiments with simulated and real-world data show that the performance of Pruned-ST2E is comparable or superior to several other benchmark methods. Through analyzing the accuracy–diversity pattern in both ST2E and Pruned-ST2E, it is revealed that the inserted pruning step excludes less accurate members. The reserved members also become more concentrated on the true importance vector. Moreover, Pruned-ST2E is easy to implement. Therefore, Pruned-ST2E can be considered as an alternative for tackling variable selection tasks in practice.
| Original language | English |
|---|---|
| Pages (from-to) | 217-230 |
| Number of pages | 14 |
| Journal | International Journal of Machine Learning and Cybernetics |
| Volume | 11 |
| Issue number | 1 |
| DOIs | |
| State | Published - 1 Jan 2020 |
Keywords
- Aggregation order
- Ensemble pruning
- Selection accuracy
- Variable selection
- Variable selection ensemble