TY - JOUR
T1 - A maximum-margin genetic algorithm for misclassification cost minimizing feature selection problem
AU - Pendharkar, Parag C.
N1 - Copyright:
Copyright 2013 Elsevier B.V., All rights reserved.
PY - 2013/8
Y1 - 2013/8
N2 - We consider a feature selection problem where the decision-making objective is to minimize overall misclassification cost by selecting relevant features from a training dataset. We propose a two-stage solution approach for solving misclassification cost minimizing feature selection (MCMFS) problem. Additionally, we propose a maximum-margin genetic algorithm (MMGA) that maximizes margin of separation between classes by taking into account all examples as opposed to maximizing margin of separation using a few support vectors. Feature selection is carried out by either an exhaustive or a heuristic simulated annealing approach in the first stage and a cost sensitive classification using either MMGA or cost sensitive support vector machines (SVM) in the second stage. Using simulated and real-world data sets and different misclassification cost matrices, we test our two-stage approach for solving the MCMFS problem. Our results indicate that feature selection plays an important role when misclassification cost asymmetries increase and the MMGA shows equal or better performance than the SVM.
AB - We consider a feature selection problem where the decision-making objective is to minimize overall misclassification cost by selecting relevant features from a training dataset. We propose a two-stage solution approach for solving misclassification cost minimizing feature selection (MCMFS) problem. Additionally, we propose a maximum-margin genetic algorithm (MMGA) that maximizes margin of separation between classes by taking into account all examples as opposed to maximizing margin of separation using a few support vectors. Feature selection is carried out by either an exhaustive or a heuristic simulated annealing approach in the first stage and a cost sensitive classification using either MMGA or cost sensitive support vector machines (SVM) in the second stage. Using simulated and real-world data sets and different misclassification cost matrices, we test our two-stage approach for solving the MCMFS problem. Our results indicate that feature selection plays an important role when misclassification cost asymmetries increase and the MMGA shows equal or better performance than the SVM.
UR - http://www.scopus.com/inward/record.url?scp=84875363688&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84875363688&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2012.12.086
DO - 10.1016/j.eswa.2012.12.086
M3 - Article
AN - SCOPUS:84875363688
SN - 0957-4174
VL - 40
SP - 3918
EP - 3925
JO - Expert Systems With Applications
JF - Expert Systems With Applications
IS - 10
ER -