TY - JOUR
T1 - Informatics and machine learning to define the phenotype
AU - Basile, Anna Okula
AU - Ritchie, Marylyn De Riggi
N1 - Publisher Copyright:
© 2018 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2018/3/4
Y1 - 2018/3/4
N2 - Introduction: For the past decade, the focus of complex disease research has been the genotype. From technological advancements to the development of analysis methods, great progress has been made. However, advances in our definition of the phenotype have remained stagnant. Phenotype characterization has recently emerged as an exciting area of informatics and machine learning. The copious amounts of diverse biomedical data that have been collected may be leveraged with data-driven approaches to elucidate trait-related features and patterns. Areas covered: In this review, the authors discuss the phenotype in traditional genetic associations and the challenges this has imposed.Approaches for phenotype refinement that can aid in more accurate characterization of traits are also discussed. Further, the authors highlight promising machine learning approaches for establishing a phenotype and the challenges of electronic health record (EHR)-derived data. Expert commentary: The authors hypothesize that through unsupervised machine learning, data-driven approaches can be used to define phenotypes rather than relying on expert clinician knowledge. Through the use of machine learning and an unbiased set of features extracted from clinical repositories, researchers will have the potential to further understand complex traits and identify patient subgroups. This knowledge may lead to more preventative and precise clinical care.
AB - Introduction: For the past decade, the focus of complex disease research has been the genotype. From technological advancements to the development of analysis methods, great progress has been made. However, advances in our definition of the phenotype have remained stagnant. Phenotype characterization has recently emerged as an exciting area of informatics and machine learning. The copious amounts of diverse biomedical data that have been collected may be leveraged with data-driven approaches to elucidate trait-related features and patterns. Areas covered: In this review, the authors discuss the phenotype in traditional genetic associations and the challenges this has imposed.Approaches for phenotype refinement that can aid in more accurate characterization of traits are also discussed. Further, the authors highlight promising machine learning approaches for establishing a phenotype and the challenges of electronic health record (EHR)-derived data. Expert commentary: The authors hypothesize that through unsupervised machine learning, data-driven approaches can be used to define phenotypes rather than relying on expert clinician knowledge. Through the use of machine learning and an unbiased set of features extracted from clinical repositories, researchers will have the potential to further understand complex traits and identify patient subgroups. This knowledge may lead to more preventative and precise clinical care.
UR - http://www.scopus.com/inward/record.url?scp=85044093149&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85044093149&partnerID=8YFLogxK
U2 - 10.1080/14737159.2018.1439380
DO - 10.1080/14737159.2018.1439380
M3 - Review article
C2 - 29431517
AN - SCOPUS:85044093149
SN - 1473-7159
VL - 18
SP - 219
EP - 226
JO - Expert Review of Molecular Diagnostics
JF - Expert Review of Molecular Diagnostics
IS - 3
ER -