TY - JOUR
T1 - Ultraconserved coding regions outside the homeobox of mammalian Hox genes
AU - Lin, Zhenguo
AU - Ma, Hong
AU - Nei, Masatoshi
N1 - Funding Information:
We thank Masafumi Nozawa, Dimitra Chalkia, Saby Das, John Diller, Zhengjia Wang and three anonymous reviewers for critical reading and valuable comments on the manuscript. This work was supported by National Institutes of Health Grant GM020293 (to MN).
PY - 2008
Y1 - 2008
N2 - Background. All bilaterian animals share a general genetic framework that controls the formation of their body structures, although their forms are highly diversified. The Hox genes that encode transcription factors play a central role in this framework. All Hox proteins contain a highly conserved homeodomain encoded by the homeobox motif, but the other regions are generally assumed to be less conserved. In this study, we used comparative genomic methods to infer possible functional elements in the coding regions of mammalian Hox genes. Results. We identified a set of ultraconserved coding regions (UCRs) outside the homeobox of mammalian Hox genes. Here a UCR is defined as a region of at least 120 nucleotides without synonymous and nonsynonymous nucleotide substitutions among different orders of mammals. Further analysis has indicated that these UCRs occur only in placental mammals and they evolved apparently after the split of placental mammals from marsupials. Analysis of human SNP data suggests that these UCRs are maintained by strong purifying selection. Conclusion. Although mammalian genomes are known to contain ultraconserved non-coding elements (UNEs), this paper seems to be the first to report the UCRs in protein coding genes. The extremely high degree of sequence conservation in non-homeobox regions suggests that they might have important roles for the functions of Hox genes. We speculate that UCRs have some gene regulatory functions possibly in relation to the development of the intra-uterus child-bearing system.
AB - Background. All bilaterian animals share a general genetic framework that controls the formation of their body structures, although their forms are highly diversified. The Hox genes that encode transcription factors play a central role in this framework. All Hox proteins contain a highly conserved homeodomain encoded by the homeobox motif, but the other regions are generally assumed to be less conserved. In this study, we used comparative genomic methods to infer possible functional elements in the coding regions of mammalian Hox genes. Results. We identified a set of ultraconserved coding regions (UCRs) outside the homeobox of mammalian Hox genes. Here a UCR is defined as a region of at least 120 nucleotides without synonymous and nonsynonymous nucleotide substitutions among different orders of mammals. Further analysis has indicated that these UCRs occur only in placental mammals and they evolved apparently after the split of placental mammals from marsupials. Analysis of human SNP data suggests that these UCRs are maintained by strong purifying selection. Conclusion. Although mammalian genomes are known to contain ultraconserved non-coding elements (UNEs), this paper seems to be the first to report the UCRs in protein coding genes. The extremely high degree of sequence conservation in non-homeobox regions suggests that they might have important roles for the functions of Hox genes. We speculate that UCRs have some gene regulatory functions possibly in relation to the development of the intra-uterus child-bearing system.
UR - http://www.scopus.com/inward/record.url?scp=53849094809&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=53849094809&partnerID=8YFLogxK
U2 - 10.1186/1471-2148-8-260
DO - 10.1186/1471-2148-8-260
M3 - Article
C2 - 18816392
AN - SCOPUS:53849094809
SN - 1471-2148
VL - 8
JO - BMC Evolutionary Biology
JF - BMC Evolutionary Biology
IS - 1
M1 - 260
ER -