170 likes | 335 Views
A genetic algorithm-based method for feature subset selection. Feng Tan; Xuezheng Fu; Yanqing Zang; Anu G. Bourgeois Springer Soft Comput (2008) 12:111-120 Yi-Chia Lan. Outline. Introduction Feature selection methods Entropy-based feature ranking T-statistics
E N D
A genetic algorithm-based method for feature subset selection Feng Tan; Xuezheng Fu; Yanqing Zang; Anu G. Bourgeois Springer Soft Comput (2008) 12:111-120 Yi-Chia Lan
Outline • Introduction • Feature selection methods • Entropy-based feature ranking • T-statistics • SVM-RFE(Recursive Feature Elimination) • Framework of feature selection algorithm • Experiments and results
Introduction (cont.) Training data (sets) Classificatory accuracy Test data (sets)
Introduction • 1. Feature selection • Removing redundant irrelevant or noise features • Improve the predictive accuracy • 2. The experimental result demonstrate: • Higher classification accuracy • Minimize size of feature subsets
Feature selection methods (cont.) Entropy-based α : parameter : average distance among the instances : Euclidean distance between the two instances
Feature selection methods (cont.) T-statistics
Feature selection methods SVM-RFE At the optimum of J , the first order is neglected second order becomes
Framework of feature selection algorithm Fitness function : x : feature vector representing ; c(x) : classification accuracy w : parameter {0~1} ; s(x) : weighted size Crossover : Single-point crossover operator Mutation : 0.001