TY - JOUR
T1 - Distribution of base pair repeats in coding and noncoding DNA sequences
AU - Nikolay, Dokholyan V.
AU - Sergey, Buldyrev V.
AU - Shlomo, Havlin
AU - Eugene, Stanley H.
PY - 1997/1/1
Y1 - 1997/1/1
N2 - We analyze the histograms for the lengths of the 16 possible distinct repeats of identical dimers, known as dimeric tandem repeats, in DNA sequences. For coding regions, the probability of finding a repetitive sequence of l copies of a particular dimer decreases exponentially as l increases. For the noncoding regions, the distribution functions for most of the 16 dimers have long tails and can be approximated by power-law functions, while for coding DNA, they can be well fit by a first-order Markov process. We propose a model, based on known biophysical processes, which leads to the observed probability distribution functions for noncoding DNA. We argue that this difference in the shape of the distribution functions between coding and noncoding DNA arises from the fact that noncoding DNA is more tolerant to evolutionary mutational alterations than coding DNA.
AB - We analyze the histograms for the lengths of the 16 possible distinct repeats of identical dimers, known as dimeric tandem repeats, in DNA sequences. For coding regions, the probability of finding a repetitive sequence of l copies of a particular dimer decreases exponentially as l increases. For the noncoding regions, the distribution functions for most of the 16 dimers have long tails and can be approximated by power-law functions, while for coding DNA, they can be well fit by a first-order Markov process. We propose a model, based on known biophysical processes, which leads to the observed probability distribution functions for noncoding DNA. We argue that this difference in the shape of the distribution functions between coding and noncoding DNA arises from the fact that noncoding DNA is more tolerant to evolutionary mutational alterations than coding DNA.
UR - http://www.scopus.com/inward/record.url?scp=0000952690&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0000952690&partnerID=8YFLogxK
U2 - 10.1103/PhysRevLett.79.5182
DO - 10.1103/PhysRevLett.79.5182
M3 - Article
AN - SCOPUS:0000952690
SN - 0031-9007
VL - 79
SP - 5182
EP - 5185
JO - Physical review letters
JF - Physical review letters
IS - 25
ER -