TY - JOUR
T1 - On the discriminability of keystroke feature vectors used in fixed text keystroke authentication
AU - Balagani, Kiran S.
AU - Phoha, Vir V.
AU - Ray, Asok
AU - Phoha, Shashi
N1 - Funding Information:
This work is supported in part by the Louisiana BoR Grant No. LEQSF (2007-12)-ENH-PKSFI-PRS-03 and the USAF Contract No. FA8750-09-C-0191.
PY - 2011/5/1
Y1 - 2011/5/1
N2 - Heterogeneous and aggregate vectors are the two widely used feature vectors in fixed text keystroke authentication. In this paper, we address the question "Which vectors, heterogeneous, aggregate, or a combination of both, are more discriminative and why?" We accomplish this in three ways - (1) by providing an intuitive example to illustrate how aggregation of features inherently reduces discriminability; (2) by formulating " discriminability" as a non-parametric estimate of Bhattacharya distance, we show theoretically that the discriminability of a heterogeneous vector is higher than an aggregate vector; and (3) by conducting user recognition experiments using a dataset containing keystrokes from 33 users typing a 32-character reference text, we empirically validate our theoretical analysis. To compare the discriminability of heterogeneous and aggregate vectors with different combinations of keystroke features, we conduct feature selection analysis using three methods: (1) ReliefF, (2) correlation based feature selection, and (3) consistency based feature selection. Results of feature selection analysis reinforce the findings of our theoretical analysis.
AB - Heterogeneous and aggregate vectors are the two widely used feature vectors in fixed text keystroke authentication. In this paper, we address the question "Which vectors, heterogeneous, aggregate, or a combination of both, are more discriminative and why?" We accomplish this in three ways - (1) by providing an intuitive example to illustrate how aggregation of features inherently reduces discriminability; (2) by formulating " discriminability" as a non-parametric estimate of Bhattacharya distance, we show theoretically that the discriminability of a heterogeneous vector is higher than an aggregate vector; and (3) by conducting user recognition experiments using a dataset containing keystrokes from 33 users typing a 32-character reference text, we empirically validate our theoretical analysis. To compare the discriminability of heterogeneous and aggregate vectors with different combinations of keystroke features, we conduct feature selection analysis using three methods: (1) ReliefF, (2) correlation based feature selection, and (3) consistency based feature selection. Results of feature selection analysis reinforce the findings of our theoretical analysis.
UR - http://www.scopus.com/inward/record.url?scp=79952697017&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79952697017&partnerID=8YFLogxK
U2 - 10.1016/j.patrec.2011.02.014
DO - 10.1016/j.patrec.2011.02.014
M3 - Article
AN - SCOPUS:79952697017
SN - 0167-8655
VL - 32
SP - 1070
EP - 1080
JO - Pattern Recognition Letters
JF - Pattern Recognition Letters
IS - 7
ER -