TY - JOUR
T1 - A survey of k-mer methods and applications in bioinformatics
AU - Moeckel, Camille
AU - Mareboina, Manvita
AU - Konnaris, Maxwell A.
AU - Chan, Candace S.Y.
AU - Mouratidis, Ioannis
AU - Montgomery, Austin
AU - Chantzi, Nikol
AU - Pavlopoulos, Georgios A.
AU - Georgakopoulos-Soares, Ilias
N1 - Publisher Copyright:
© 2024 The Authors
PY - 2024/12
Y1 - 2024/12
N2 - The rapid progression of genomics and proteomics has been driven by the advent of advanced sequencing technologies, large, diverse, and readily available omics datasets, and the evolution of computational data processing capabilities. The vast amount of data generated by these advancements necessitates efficient algorithms to extract meaningful information. K-mers serve as a valuable tool when working with large sequencing datasets, offering several advantages in computational speed and memory efficiency and carrying the potential for intrinsic biological functionality. This review provides an overview of the methods, applications, and significance of k-mers in genomic and proteomic data analyses, as well as the utility of absent sequences, including nullomers and nullpeptides, in disease detection, vaccine development, therapeutics, and forensic science. Therefore, the review highlights the pivotal role of k-mers in addressing current genomic and proteomic problems and underscores their potential for future breakthroughs in research.
AB - The rapid progression of genomics and proteomics has been driven by the advent of advanced sequencing technologies, large, diverse, and readily available omics datasets, and the evolution of computational data processing capabilities. The vast amount of data generated by these advancements necessitates efficient algorithms to extract meaningful information. K-mers serve as a valuable tool when working with large sequencing datasets, offering several advantages in computational speed and memory efficiency and carrying the potential for intrinsic biological functionality. This review provides an overview of the methods, applications, and significance of k-mers in genomic and proteomic data analyses, as well as the utility of absent sequences, including nullomers and nullpeptides, in disease detection, vaccine development, therapeutics, and forensic science. Therefore, the review highlights the pivotal role of k-mers in addressing current genomic and proteomic problems and underscores their potential for future breakthroughs in research.
UR - http://www.scopus.com/inward/record.url?scp=85194267933&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85194267933&partnerID=8YFLogxK
U2 - 10.1016/j.csbj.2024.05.025
DO - 10.1016/j.csbj.2024.05.025
M3 - Review article
C2 - 38840832
AN - SCOPUS:85194267933
SN - 2001-0370
VL - 23
SP - 2289
EP - 2303
JO - Computational and Structural Biotechnology Journal
JF - Computational and Structural Biotechnology Journal
ER -