TY - JOUR
T1 - Mechanistic machine learning
T2 - How data assimilation leverages physiologic knowledge using Bayesian inference to forecast the future, infer the present, and phenotype
AU - Albers, David J.
AU - Levine, Matthew E.
AU - Stuart, Andrew
AU - Mamykina, Lena
AU - Gluckman, Bruce
AU - Hripcsak, George
N1 - Publisher Copyright:
© © The Author(s) 2018. Published by Oxford University Press on behalf of the American Medical Informatics Association.
PY - 2018/10/1
Y1 - 2018/10/1
N2 - We introduce data assimilation as a computational method that uses machine learning to combine data with human knowledge in the form of mechanistic models in order to forecast future states, to impute missing data from the past by smoothing, and to infer measurable and unmeasurable quantities that represent clinically and scientifically important phenotypes. We demonstrate the advantages it affords in the context of type 2 diabetes by showing how data assimilation can be used to forecast future glucose values, to impute previously missing glucose values, and to infer type 2 diabetes phenotypes. At the heart of data assimilation is the mechanistic model, here an endocrine model. Such models can vary in complexity, contain testable hypotheses about important mechanics that govern the system (eg, nutrition's effect on glucose), and, as such, constrain the model space, allowing for accurate estimation using very little data.
AB - We introduce data assimilation as a computational method that uses machine learning to combine data with human knowledge in the form of mechanistic models in order to forecast future states, to impute missing data from the past by smoothing, and to infer measurable and unmeasurable quantities that represent clinically and scientifically important phenotypes. We demonstrate the advantages it affords in the context of type 2 diabetes by showing how data assimilation can be used to forecast future glucose values, to impute previously missing glucose values, and to infer type 2 diabetes phenotypes. At the heart of data assimilation is the mechanistic model, here an endocrine model. Such models can vary in complexity, contain testable hypotheses about important mechanics that govern the system (eg, nutrition's effect on glucose), and, as such, constrain the model space, allowing for accurate estimation using very little data.
UR - http://www.scopus.com/inward/record.url?scp=85054889195&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85054889195&partnerID=8YFLogxK
U2 - 10.1093/jamia/ocy106
DO - 10.1093/jamia/ocy106
M3 - Review article
C2 - 30312445
AN - SCOPUS:85054889195
SN - 1067-5027
VL - 25
SP - 1392
EP - 1401
JO - Journal of the American Medical Informatics Association
JF - Journal of the American Medical Informatics Association
IS - 10
ER -