TY - JOUR
T1 - Predicting MHC-II binding affinity using multiple instance regression
AU - El-Manzalawy, Yasser
AU - Dobbs, Drena
AU - Honavar, Vasant
N1 - Funding Information:
This work was supported in part by a doctoral fellowship from the Egyptian Government to Yasser EL-Manzalawy, a grant from the National Institutes of Health (GM066387) to Vasant Honavar and Drena Dobbs, and a postdoctoral fellowship to Yasser EL-Manzalawy from the Iowa State University Center for Computational Intelligence, Learning, and Discovery.
PY - 2011
Y1 - 2011
N2 - Reliably predicting the ability of antigen peptides to bind to major histocompatibility complex class II (MHC-II) molecules is an essential step in developing new vaccines. Uncovering the amino acid sequence correlates of the binding affinity of MHC-II binding peptides is important for understanding pathogenesis and immune response. The task of predicting MHC-II binding peptides is complicated by the significant variability in their length. Most existing computational methods for predicting MHC-II binding peptides focus on identifying a nine amino acids core region in each binding peptide. We formulate the problems of qualitatively and quantitatively predicting flexible length MHC-II peptides as multiple instance learning and multiple instance regression problems, respectively. Based on this formulation, we introduce MHCMIR, a novel method for predicting MHC-II binding affinity using multiple instance regression. We present results of experiments using several benchmark data sets that show that MHCMIR is competitive with the state-of-the-art methods for predicting MHC-II binding peptides. An online web server that implements the MHCMIR method for MHC-II binding affinity prediction is freely accessible at http://ailab.cs.iastate.edu/mhcmir.
AB - Reliably predicting the ability of antigen peptides to bind to major histocompatibility complex class II (MHC-II) molecules is an essential step in developing new vaccines. Uncovering the amino acid sequence correlates of the binding affinity of MHC-II binding peptides is important for understanding pathogenesis and immune response. The task of predicting MHC-II binding peptides is complicated by the significant variability in their length. Most existing computational methods for predicting MHC-II binding peptides focus on identifying a nine amino acids core region in each binding peptide. We formulate the problems of qualitatively and quantitatively predicting flexible length MHC-II peptides as multiple instance learning and multiple instance regression problems, respectively. Based on this formulation, we introduce MHCMIR, a novel method for predicting MHC-II binding affinity using multiple instance regression. We present results of experiments using several benchmark data sets that show that MHCMIR is competitive with the state-of-the-art methods for predicting MHC-II binding peptides. An online web server that implements the MHCMIR method for MHC-II binding affinity prediction is freely accessible at http://ailab.cs.iastate.edu/mhcmir.
UR - http://www.scopus.com/inward/record.url?scp=79957588907&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79957588907&partnerID=8YFLogxK
U2 - 10.1109/TCBB.2010.94
DO - 10.1109/TCBB.2010.94
M3 - Article
C2 - 20855923
AN - SCOPUS:79957588907
SN - 1545-5963
VL - 8
SP - 1067
EP - 1079
JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics
JF - IEEE/ACM Transactions on Computational Biology and Bioinformatics
IS - 4
M1 - 5582079
ER -