TY - JOUR
T1 - Recharacterization of ancient DNA miscoding lesions
T2 - Insights in the era of sequencing-by-synthesis
AU - Gilbert, M. Thomas P.
AU - Binladen, Jonas
AU - Miller, Webb
AU - Wiuf, Carsten
AU - Willerslev, Eske
AU - Poinar, Hendrik
AU - Carlson, John E.
AU - Leebens-Mack, James H.
AU - Schuster, Stephan C.
N1 - Funding Information:
M.T.P.G. was supported by the Marie Curie FP6 Actions ‘FORMAPLEX’ grant. J.B. and E.W. were supported by the Wellcome Trust, UK, the Carlsberg Foundation, DK, and the National Science Foundation, DK. W.M. was supported by HIN grant HG002238. This project is funded, in part, under a grant with the Pennsylvania Department of Health using Tobacco Settlement Funds appropriated by the legislature. The Department specifically disclaims responsibility for any analyses, interpretations or conclusions. Funding to pay the Open Access charges for this article was provided by Marie Curies FP6 Actions ‘FORMAPLEX’ grant.
PY - 2007/1
Y1 - 2007/1
N2 - Although ancient DNA (aDNA) miscoding lesions have been studied since the earliest days of the field, their nature remains a source of debate. A variety of conflicting hypotheses exist about which miscoding lesions constitute true aDNA damage as opposed to PCR polymerase amplification error. Furthermore, considerable disagreement and speculation exists on which specific damage events underlie observed miscoding lesions. The root of the problem is that it has previously been difficult to assemble sufficient data to test the hypotheses, and near-impossible to accurately determine the specific strand of origin of observed damage events. With the advent of emulsion-based clonal amplification (emPCR) and the sequencing-by-synthesis technology this has changed. In this paper we demonstrate how data produced on the Roche GS20 genome sequencer can determine miscoding lesion strands of origin, and subsequently be interpreted to enable characterization of the aDNA damage behind the observed phenotypes. Through comparative analyses on 390 965 bp of modern chloroplast and 131 474 bp of ancient woolly mammoth GS20 sequence data we conclusively demonstrate that in this sample at least, a permafrost preserved specimen, Type 2 (cytosine→thymine/guanine→adenine) miscoding lesions represent the overwhelming majority of damage-derived miscoding lesions. Additionally, we show that an as yet unidentified guanine→adenine analogue modification, not the conventionally argued cytosine→uracil deamination, underpins a significant proportion of Type 2 damage. How widespread these implications are for aDNA will become apparent as future studies analyse data recovered from a wider range of substrates.
AB - Although ancient DNA (aDNA) miscoding lesions have been studied since the earliest days of the field, their nature remains a source of debate. A variety of conflicting hypotheses exist about which miscoding lesions constitute true aDNA damage as opposed to PCR polymerase amplification error. Furthermore, considerable disagreement and speculation exists on which specific damage events underlie observed miscoding lesions. The root of the problem is that it has previously been difficult to assemble sufficient data to test the hypotheses, and near-impossible to accurately determine the specific strand of origin of observed damage events. With the advent of emulsion-based clonal amplification (emPCR) and the sequencing-by-synthesis technology this has changed. In this paper we demonstrate how data produced on the Roche GS20 genome sequencer can determine miscoding lesion strands of origin, and subsequently be interpreted to enable characterization of the aDNA damage behind the observed phenotypes. Through comparative analyses on 390 965 bp of modern chloroplast and 131 474 bp of ancient woolly mammoth GS20 sequence data we conclusively demonstrate that in this sample at least, a permafrost preserved specimen, Type 2 (cytosine→thymine/guanine→adenine) miscoding lesions represent the overwhelming majority of damage-derived miscoding lesions. Additionally, we show that an as yet unidentified guanine→adenine analogue modification, not the conventionally argued cytosine→uracil deamination, underpins a significant proportion of Type 2 damage. How widespread these implications are for aDNA will become apparent as future studies analyse data recovered from a wider range of substrates.
UR - https://www.scopus.com/pages/publications/33846667604
UR - https://www.scopus.com/pages/publications/33846667604#tab=citedBy
U2 - 10.1093/nar/gkl483
DO - 10.1093/nar/gkl483
M3 - Article
C2 - 16920744
AN - SCOPUS:33846667604
SN - 0305-1048
VL - 35
SP - 1
EP - 10
JO - Nucleic acids research
JF - Nucleic acids research
IS - 1
ER -