TY - JOUR
T1 - Comparative analysis of mass spectral matching-based compound identification in gas chromatography-mass spectrometry
AU - Koo, Imhoi
AU - Kim, Seongho
AU - Zhang, Xiang
N1 - Funding Information:
This work was supported by National Institute of Health (NIH) grant 1RO1GM087735 through the National Institute of General Medical Sciences (NIGMS) and R21ES021311 through the National Institute of Environmental Health Sciences (NIEHS) .
PY - 2013/7/12
Y1 - 2013/7/12
N2 - Compound identification in gas chromatography-mass spectrometry (GC-MS) is usually achieved by matching query spectra to spectra present in a reference library. Although several spectral similarity measures have been developed and compared using a small reference library, it still remains unknown how the relationship between the spectral similarity measure and the size of reference library affects on the identification accuracy as well as the optimal weight factor. We used three reference libraries to investigate the dependency of the optimal weight factor, spectral similarity measure and the size of reference library. Our study demonstrated that the optimal weight factor depends on not only spectral similarity measure but also the size of reference library. The mixture semi-partial correlation measure outperforms all existing spectral similarity measures in all tested reference libraries, in spite of the computational expense. Furthermore, the accuracy of compound identification using a larger reference library in future is estimated by varying the size of reference library. Simulation study indicates that the mixture semi-partial correlation measure will have the best performance with the increase of reference library in future.
AB - Compound identification in gas chromatography-mass spectrometry (GC-MS) is usually achieved by matching query spectra to spectra present in a reference library. Although several spectral similarity measures have been developed and compared using a small reference library, it still remains unknown how the relationship between the spectral similarity measure and the size of reference library affects on the identification accuracy as well as the optimal weight factor. We used three reference libraries to investigate the dependency of the optimal weight factor, spectral similarity measure and the size of reference library. Our study demonstrated that the optimal weight factor depends on not only spectral similarity measure but also the size of reference library. The mixture semi-partial correlation measure outperforms all existing spectral similarity measures in all tested reference libraries, in spite of the computational expense. Furthermore, the accuracy of compound identification using a larger reference library in future is estimated by varying the size of reference library. Simulation study indicates that the mixture semi-partial correlation measure will have the best performance with the increase of reference library in future.
UR - http://www.scopus.com/inward/record.url?scp=84879155741&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84879155741&partnerID=8YFLogxK
U2 - 10.1016/j.chroma.2013.05.021
DO - 10.1016/j.chroma.2013.05.021
M3 - Article
C2 - 23726352
AN - SCOPUS:84879155741
SN - 0021-9673
VL - 1298
SP - 132
EP - 138
JO - Journal of Chromatography A
JF - Journal of Chromatography A
ER -