TY - JOUR
T1 - Automated classification and cluster visualization of genotypes derived from high resolution melt curves
AU - Kanderian, Sami
AU - Jiang, Lingxia
AU - Knight, Ivor
N1 - Publisher Copyright:
© 2015 Kanderian et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
PY - 2015/11
Y1 - 2015/11
N2 - High Resolution Melting (HRM) following PCR has been used to identify DNA genotypes. Fluorescent dyes bounded to double strand DNA lose their fluorescence with increasing temperature, yielding different signatures for different genotypes. Recent software tools have been made available to aid in the distinction of different genotypes, but they are not fully automated, used only for research purposes, or require some level of interaction or confirmation from an analyst. Materials and Methods We describe a fully automated machine learning software algorithm that classifies unknown genotypes. Dynamic melt curves are transformed to multidimensional clusters of points whereby a training set is used to establish the distribution of genotype clusters. Subsequently, probabilistic and statistical methods were used to classify the genotypes of unknown DNA samples on 4 different assays (40 VKORC1, CYP2C9∗2, CYP2C9∗3 samples in triplicate, and 49 MTHFR c.665CT samples in triplicate) run on the Roche LC480. Melt curves of each of the triplicates were genotyped separately. Results Automated genotyping called 100% of VKORC1, CYP2C9∗3 and MTHFR c.665CT samples correctly. 97.5% of CYP2C9∗2 melt curves were genotyped correctly with the remaining 2.5% given a no call due to the inability to decipher 3 melt curves in close proximity as either homozygous mutant or wild-type with greater than 99.5% posterior probability. Conclusions We demonstrate the ability to fully automate DNA genotyping from HRM curves systematically and accurately without requiring any user interpretation or interaction with the data. Visualization of genotype clusters and quantification of the expected misclassification rate is also available to provide feedback to assay scientists and engineers as changes are made to the assay or instrument.
AB - High Resolution Melting (HRM) following PCR has been used to identify DNA genotypes. Fluorescent dyes bounded to double strand DNA lose their fluorescence with increasing temperature, yielding different signatures for different genotypes. Recent software tools have been made available to aid in the distinction of different genotypes, but they are not fully automated, used only for research purposes, or require some level of interaction or confirmation from an analyst. Materials and Methods We describe a fully automated machine learning software algorithm that classifies unknown genotypes. Dynamic melt curves are transformed to multidimensional clusters of points whereby a training set is used to establish the distribution of genotype clusters. Subsequently, probabilistic and statistical methods were used to classify the genotypes of unknown DNA samples on 4 different assays (40 VKORC1, CYP2C9∗2, CYP2C9∗3 samples in triplicate, and 49 MTHFR c.665CT samples in triplicate) run on the Roche LC480. Melt curves of each of the triplicates were genotyped separately. Results Automated genotyping called 100% of VKORC1, CYP2C9∗3 and MTHFR c.665CT samples correctly. 97.5% of CYP2C9∗2 melt curves were genotyped correctly with the remaining 2.5% given a no call due to the inability to decipher 3 melt curves in close proximity as either homozygous mutant or wild-type with greater than 99.5% posterior probability. Conclusions We demonstrate the ability to fully automate DNA genotyping from HRM curves systematically and accurately without requiring any user interpretation or interaction with the data. Visualization of genotype clusters and quantification of the expected misclassification rate is also available to provide feedback to assay scientists and engineers as changes are made to the assay or instrument.
UR - http://www.scopus.com/inward/record.url?scp=84960171823&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84960171823&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0143295
DO - 10.1371/journal.pone.0143295
M3 - Article
C2 - 26605797
AN - SCOPUS:84960171823
SN - 1932-6203
VL - 10
JO - PloS one
JF - PloS one
IS - 11
M1 - e0143295
ER -