TY - JOUR
T1 - Inferring protein-DNA dependencies using motif alignments and mutual information
AU - Mahony, Shaun
AU - Auron, Philip E.
AU - Benos, Panayiotis V.
N1 - Funding Information:
The authors would like to thank Carlos Camacho for useful discussion and help with the crystal structures, and an anonymous reviewer for a number of useful insights and suggestions. This work was supported by NSF grant MCB0316255 and by NIH-NIAID contract no. N01-AI50018. P.V.B. was also supported by NIH grants 1R01LM007994-01 and RR014214 and by TATRC/DoD USAMRAA Prime Award W81XWH-05-2-0066. P.E.A. was supported by NIH grant CA06668544.
PY - 2007/7/1
Y1 - 2007/7/1
N2 - Motivation: Mutual information can be used to explore covarying positions in biological sequences. In the past, it has been successfully used to infer RNA secondary structure conformations from multiple sequence alignments. In this study, we show that the same principles allow the discovery of transcription factor amino acids that are coevolving with nucleotides in their DNA-binding targets. Results: Given an alignment of transcription factor binding domains, and a separate alignment of their DNA target motifs, we demonstrate that mutually covarying base-amino acid positions may indicate possible protein-DNA contacts. Examples explored in this study include C2H2 zinc finger, homeodomain and bHLH DNA-binding motif families, where a number of known base-amino acid contacting positions are identified. Mutual information analyses may aid the prediction of base-amino acid contacting pairs for particular transcription factor families, thereby yielding structural insights from sequence information alone. Such inference of protein-DNA contacting positions may guide future experimental studies of DNA recognition.
AB - Motivation: Mutual information can be used to explore covarying positions in biological sequences. In the past, it has been successfully used to infer RNA secondary structure conformations from multiple sequence alignments. In this study, we show that the same principles allow the discovery of transcription factor amino acids that are coevolving with nucleotides in their DNA-binding targets. Results: Given an alignment of transcription factor binding domains, and a separate alignment of their DNA target motifs, we demonstrate that mutually covarying base-amino acid positions may indicate possible protein-DNA contacts. Examples explored in this study include C2H2 zinc finger, homeodomain and bHLH DNA-binding motif families, where a number of known base-amino acid contacting positions are identified. Mutual information analyses may aid the prediction of base-amino acid contacting pairs for particular transcription factor families, thereby yielding structural insights from sequence information alone. Such inference of protein-DNA contacting positions may guide future experimental studies of DNA recognition.
UR - http://www.scopus.com/inward/record.url?scp=34547830863&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34547830863&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btm215
DO - 10.1093/bioinformatics/btm215
M3 - Article
C2 - 17646310
AN - SCOPUS:34547830863
SN - 1367-4803
VL - 23
SP - i297-i304
JO - Bioinformatics
JF - Bioinformatics
IS - 13
ER -