TY - GEN
T1 - Discriminatively trained markov model for sequence classification
AU - Yakhnenko, Oksana
AU - Silvescu, Adrian
AU - Honavar, Vasant
PY - 2005
Y1 - 2005
N2 - In this paper, we propose a discriminative counterpart of the directed Markov Models of order k - 1, or MM (k - 1) for sequence classification. MM(k - 1) models capture dependencies among neighboring elements of a sequence. The parameters of the classifiers are initialized to based on the maximum likelihood estimates for their generative counterparts. We derive gradient based update equations for the parameters of the sequence classifiers in order to maximize the conditional likelihood function. Results of our experiments with data sets drawn from biological sequence classification (specifically protein function and subcellular localization) and text classification applications show that the discriminatively trained sequence classifiers outperform their generative counterparts, confirming the benefits of discriminative training when the primary objective is classification. Our experiments also show that the discriminatively trained MM(k - 1) sequence classifiers are competitive with the computationally much more expensive Support Vector Machines trained using k-gram representations of sequences.
AB - In this paper, we propose a discriminative counterpart of the directed Markov Models of order k - 1, or MM (k - 1) for sequence classification. MM(k - 1) models capture dependencies among neighboring elements of a sequence. The parameters of the classifiers are initialized to based on the maximum likelihood estimates for their generative counterparts. We derive gradient based update equations for the parameters of the sequence classifiers in order to maximize the conditional likelihood function. Results of our experiments with data sets drawn from biological sequence classification (specifically protein function and subcellular localization) and text classification applications show that the discriminatively trained sequence classifiers outperform their generative counterparts, confirming the benefits of discriminative training when the primary objective is classification. Our experiments also show that the discriminatively trained MM(k - 1) sequence classifiers are competitive with the computationally much more expensive Support Vector Machines trained using k-gram representations of sequences.
UR - http://www.scopus.com/inward/record.url?scp=34548551161&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34548551161&partnerID=8YFLogxK
U2 - 10.1109/ICDM.2005.52
DO - 10.1109/ICDM.2005.52
M3 - Conference contribution
AN - SCOPUS:34548551161
SN - 0769522785
SN - 9780769522784
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 498
EP - 505
BT - Proceedings - Fifth IEEE International Conference on Data Mining, ICDM 2005
T2 - 5th IEEE International Conference on Data Mining, ICDM 2005
Y2 - 27 November 2005 through 30 November 2005
ER -