TY - GEN
T1 - Evaluating feature selection for stress identification
AU - Deng, Yong
AU - Wu, Zhonghai
AU - Chu, Chao Hsien
AU - Yang, Tao
PY - 2012
Y1 - 2012
N2 - In modern society, more and more people are suffering from stress. The accumulation of stress will result in poor health condition to people. Effectively detecting the stress of human being in time provides a helpful way for people to better manage their stress. Much work has been done on recognizing the stress level of people by extracting features from the bio-signals acquired by physiological sensors. However, little work has been focused on the feature selection. In this paper, we propose a feature selection method based on Principal Component Analysis (PCA). After the features are selected, their effectiveness in terms of correct rate and computational time are evaluated using five classification algorithms, Linear Discriminant Function, C4.5 induction tree, Support Vector Machine (SVM), Naïve Bayes and K Nearest Neighbor (KNN). We use the driver stress database contributed by MIT Media lab for our experiments. Leaving one out as well as 10-fold data preparation approach is implemented as the cross validation method for our evaluation. Paired t-test is then performed to analyze and compare the experimental results, as well as for their statistical significance. Our study demonstrates the importance of feature selection and the effectiveness of the methods used in accurately classifying stress levels.
AB - In modern society, more and more people are suffering from stress. The accumulation of stress will result in poor health condition to people. Effectively detecting the stress of human being in time provides a helpful way for people to better manage their stress. Much work has been done on recognizing the stress level of people by extracting features from the bio-signals acquired by physiological sensors. However, little work has been focused on the feature selection. In this paper, we propose a feature selection method based on Principal Component Analysis (PCA). After the features are selected, their effectiveness in terms of correct rate and computational time are evaluated using five classification algorithms, Linear Discriminant Function, C4.5 induction tree, Support Vector Machine (SVM), Naïve Bayes and K Nearest Neighbor (KNN). We use the driver stress database contributed by MIT Media lab for our experiments. Leaving one out as well as 10-fold data preparation approach is implemented as the cross validation method for our evaluation. Paired t-test is then performed to analyze and compare the experimental results, as well as for their statistical significance. Our study demonstrates the importance of feature selection and the effectiveness of the methods used in accurately classifying stress levels.
UR - http://www.scopus.com/inward/record.url?scp=84868325945&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84868325945&partnerID=8YFLogxK
U2 - 10.1109/IRI.2012.6303062
DO - 10.1109/IRI.2012.6303062
M3 - Conference contribution
AN - SCOPUS:84868325945
SN - 9781467322843
T3 - Proceedings of the 2012 IEEE 13th International Conference on Information Reuse and Integration, IRI 2012
SP - 584
EP - 591
BT - Proceedings of the 2012 IEEE 13th International Conference on Information Reuse and Integration, IRI 2012
T2 - 2012 IEEE 13th International Conference on Information Reuse and Integration, IRI 2012
Y2 - 8 August 2012 through 10 August 2012
ER -