TY - JOUR
T1 - Characterization of background noise in MiSeq MPS data when sequencing human mitochondrial DNA from various sample sources and library preparation methods
AU - McElhoe, Jennifer A.
AU - Holland, Mitchell M.
N1 - Publisher Copyright:
© 2020
PY - 2020/5
Y1 - 2020/5
N2 - Improved resolution of massively parallel sequencing (MPS) allows for the characterization of mitochondrial (mt) DNA heteroplasmy to levels previously unattainable with traditional sequencing approaches. An essential criterion for the reporting of heteroplasmy is the ability of the MPS method to distinguish minor sequence variants (MSVs) from system noise, or error. Therefore, an assessment of the background noise in the MPS method is desirable to identify the point at which reliable data can be reported. Substitution and sequence specific error (SSE) was evaluated for a variety of sample types and two library preparations. Substitution error rates ranged from 0.18 to 0.49 per 100 nucleotides with C positions generally having the highest rate of misincorporation. Comparison of error rates across sample types indicated a significant increase for samples with damaged DNA. The positions of error were varied across datasets (pairwise concordance 0–68%), but had greater consistency within the damaged samples (80–96%). The most commonly observed motif preceding error in forward reads was CCG, while GGT was most common in reverse reads, both consistent with previous findings. The findings illustrate that for datasets containing samples with damaged DNA, reporting thresholds for heteroplasmy may have to be modified and individual sites with error levels exceeding thresholds should be scrutinized. Collectively, the shifting error profiles observed across the various sample types and library preparation methods demonstrates the need for an assessment of error under these varying circumstances. Characterization of the applicable background noise will help to ensure that thresholds are reliably set for detection of true MSVs.
AB - Improved resolution of massively parallel sequencing (MPS) allows for the characterization of mitochondrial (mt) DNA heteroplasmy to levels previously unattainable with traditional sequencing approaches. An essential criterion for the reporting of heteroplasmy is the ability of the MPS method to distinguish minor sequence variants (MSVs) from system noise, or error. Therefore, an assessment of the background noise in the MPS method is desirable to identify the point at which reliable data can be reported. Substitution and sequence specific error (SSE) was evaluated for a variety of sample types and two library preparations. Substitution error rates ranged from 0.18 to 0.49 per 100 nucleotides with C positions generally having the highest rate of misincorporation. Comparison of error rates across sample types indicated a significant increase for samples with damaged DNA. The positions of error were varied across datasets (pairwise concordance 0–68%), but had greater consistency within the damaged samples (80–96%). The most commonly observed motif preceding error in forward reads was CCG, while GGT was most common in reverse reads, both consistent with previous findings. The findings illustrate that for datasets containing samples with damaged DNA, reporting thresholds for heteroplasmy may have to be modified and individual sites with error levels exceeding thresholds should be scrutinized. Collectively, the shifting error profiles observed across the various sample types and library preparation methods demonstrates the need for an assessment of error under these varying circumstances. Characterization of the applicable background noise will help to ensure that thresholds are reliably set for detection of true MSVs.
UR - http://www.scopus.com/inward/record.url?scp=85079880019&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85079880019&partnerID=8YFLogxK
U2 - 10.1016/j.mito.2020.02.005
DO - 10.1016/j.mito.2020.02.005
M3 - Article
C2 - 32068127
AN - SCOPUS:85079880019
SN - 1567-7249
VL - 52
SP - 40
EP - 55
JO - Mitochondrion
JF - Mitochondrion
ER -