TY - JOUR
T1 - Feeding the machine
T2 - Challenges to reproducible predictive modeling in resting-state connectomics
AU - Cwiek, Andrew
AU - Rajtmajer, Sarah M.
AU - Wyble, Bradley
AU - Honavar, Vasant
AU - Grossner, Emily
AU - Hillary, Frank G.
N1 - Publisher Copyright:
© 2021 Massachusetts Institute of Technology Published under a Creative Commons Attribution 4.0 International (CC BY 4.0) license
PY - 2022/2/1
Y1 - 2022/2/1
N2 - In this critical review, we examine the application of predictive models, for example, classifiers, trained using machine learning (ML) to assist in interpretation of functional neuroimaging data. Our primary goal is to summarize how ML is being applied and critically assess common practices. Our review covers 250 studies published using ML and resting-state functional MRI (fMRI) to infer various dimensions of the human functional connectome. Results for holdout (“lockbox”) performance was, on average, ~13% less accurate than performance measured through cross-validation alone, highlighting the importance of lockbox data, which was included in only 16% of the studies. There was also a concerning lack of transparency across the key steps in training and evaluating predictive models. The summary of this literature underscores the importance of the use of a lockbox and highlights several methodological pitfalls that can be addressed by the imaging community. We argue that, ideally, studies are motivated both by the reproducibility and generalizability of findings as well as the potential clinical significance of the insights. We offer recommendations for principled integration of machine learning into the clinical neurosciences with the goal of advancing imaging biomarkers of brain disorders, understanding causative determinants for health risks, and parsing heterogeneous patient outcomes.
AB - In this critical review, we examine the application of predictive models, for example, classifiers, trained using machine learning (ML) to assist in interpretation of functional neuroimaging data. Our primary goal is to summarize how ML is being applied and critically assess common practices. Our review covers 250 studies published using ML and resting-state functional MRI (fMRI) to infer various dimensions of the human functional connectome. Results for holdout (“lockbox”) performance was, on average, ~13% less accurate than performance measured through cross-validation alone, highlighting the importance of lockbox data, which was included in only 16% of the studies. There was also a concerning lack of transparency across the key steps in training and evaluating predictive models. The summary of this literature underscores the importance of the use of a lockbox and highlights several methodological pitfalls that can be addressed by the imaging community. We argue that, ideally, studies are motivated both by the reproducibility and generalizability of findings as well as the potential clinical significance of the insights. We offer recommendations for principled integration of machine learning into the clinical neurosciences with the goal of advancing imaging biomarkers of brain disorders, understanding causative determinants for health risks, and parsing heterogeneous patient outcomes.
UR - http://www.scopus.com/inward/record.url?scp=85124066434&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85124066434&partnerID=8YFLogxK
U2 - 10.1162/netn_a_00212
DO - 10.1162/netn_a_00212
M3 - Article
C2 - 35350584
AN - SCOPUS:85124066434
SN - 2472-1751
VL - 6
SP - 29
EP - 48
JO - Network Neuroscience
JF - Network Neuroscience
IS - 1
ER -