TY - GEN
T1 - Identifying Probable Neurological Disorders with Explainable Machine Learning Techniques
AU - Khatriya, Neha
AU - Chen, Tianjie
AU - Kabir, MdFaisal
AU - Brearly, Timothy
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - As the global burden of neurological disorders increases, the need for new tools to diagnose brain disease has become increasingly critical. Several machine learning models have been developed to classify various brain disorders; however, ensuring that these models can provide clinically useful information is as important as good prediction performance. In this study, we evaluated the effectiveness of various machine learning models to accurately identify individuals with potential cerebral pathology using neuropsychological data collected during routine clinical care. Three machine learning models-Random Forest, XGBoost, and Graph Neural Network-were trained and evaluated. The models' performances were compared to identify the most effective approach. The model with the best performance was then used to generate explainability plots, offering insights into the key features that contribute to predictions. Our work shows that Graph Neural Network best suited for routine clinical data, where missing and imbalanced data is commonplace due to the prioritization of patient needs over data completeness.
AB - As the global burden of neurological disorders increases, the need for new tools to diagnose brain disease has become increasingly critical. Several machine learning models have been developed to classify various brain disorders; however, ensuring that these models can provide clinically useful information is as important as good prediction performance. In this study, we evaluated the effectiveness of various machine learning models to accurately identify individuals with potential cerebral pathology using neuropsychological data collected during routine clinical care. Three machine learning models-Random Forest, XGBoost, and Graph Neural Network-were trained and evaluated. The models' performances were compared to identify the most effective approach. The model with the best performance was then used to generate explainability plots, offering insights into the key features that contribute to predictions. Our work shows that Graph Neural Network best suited for routine clinical data, where missing and imbalanced data is commonplace due to the prioritization of patient needs over data completeness.
UR - http://www.scopus.com/inward/record.url?scp=85218071451&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85218071451&partnerID=8YFLogxK
U2 - 10.1109/BigData62323.2024.10826057
DO - 10.1109/BigData62323.2024.10826057
M3 - Conference contribution
AN - SCOPUS:85218071451
T3 - Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024
SP - 5006
EP - 5013
BT - Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024
A2 - Ding, Wei
A2 - Lu, Chang-Tien
A2 - Wang, Fusheng
A2 - Di, Liping
A2 - Wu, Kesheng
A2 - Huan, Jun
A2 - Nambiar, Raghu
A2 - Li, Jundong
A2 - Ilievski, Filip
A2 - Baeza-Yates, Ricardo
A2 - Hu, Xiaohua
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 IEEE International Conference on Big Data, BigData 2024
Y2 - 15 December 2024 through 18 December 2024
ER -