TY - GEN
T1 - Comparing kernels for predicting protein binding sites from amino acid sequence
AU - Wu, Feihong
AU - Olson, Byron
AU - Dobbs, Drena
AU - Honavar, Vasant
PY - 2006
Y1 - 2006
N2 - The ability to identify protein binding sites and to detect specific amino acid residues that contribute to the specificity and affinity of protein interactions has important implications for problems ranging from rational drug design to analysis of metabolic and signal transduction networks. Support vector machines (SVM) and related kernel methods offer an attractive approach to predicting protein binding sites. An appropriate choice of the kernel function is critical to the performance of SVM. Kernel functions offer a way to incorporate domain-specific knowledge into the classifier. We compare the performance of three types of kernels functions: identity kernel, sequence-alignment kernel, and amino acid substitution matrix kernel in the case of SVM classifiers for predicting protein-protein, protein-DNA and protein-RNA binding sites. The results show that the identity kernel is quite effective in on all three tasks. The substitution kernel based on amino acid substitution matrices that take into account structural or evolutionary conservation or physicochemical properties of amino acids yields modest improvement.
AB - The ability to identify protein binding sites and to detect specific amino acid residues that contribute to the specificity and affinity of protein interactions has important implications for problems ranging from rational drug design to analysis of metabolic and signal transduction networks. Support vector machines (SVM) and related kernel methods offer an attractive approach to predicting protein binding sites. An appropriate choice of the kernel function is critical to the performance of SVM. Kernel functions offer a way to incorporate domain-specific knowledge into the classifier. We compare the performance of three types of kernels functions: identity kernel, sequence-alignment kernel, and amino acid substitution matrix kernel in the case of SVM classifiers for predicting protein-protein, protein-DNA and protein-RNA binding sites. The results show that the identity kernel is quite effective in on all three tasks. The substitution kernel based on amino acid substitution matrices that take into account structural or evolutionary conservation or physicochemical properties of amino acids yields modest improvement.
UR - http://www.scopus.com/inward/record.url?scp=40649099150&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=40649099150&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:40649099150
SN - 0780394909
SN - 9780780394902
T3 - IEEE International Conference on Neural Networks - Conference Proceedings
SP - 1612
EP - 1616
BT - International Joint Conference on Neural Networks 2006, IJCNN '06
T2 - International Joint Conference on Neural Networks 2006, IJCNN '06
Y2 - 16 July 2006 through 21 July 2006
ER -