TY - JOUR
T1 - Predicting serious postoperative complications and evaluating racial fairness in machine learning algorithms for metabolic and bariatric surgery
AU - Kang, Dong Won
AU - Zhou, Shouhao
AU - Torres, Russell
AU - Chowdhury, Abhinandan
AU - Niranjan, Suman
AU - Rogers, Ann
AU - Shen, Chan
N1 - Publisher Copyright:
© 2024 American Society for Metabolic and Bariatric Surgery
PY - 2024/11
Y1 - 2024/11
N2 - Background: Predicting the risk of complications is critical in metabolic and bariatric surgery (MBS). Objectives: To develop machine learning (ML) models to predict serious postoperative complications of MBS and evaluate racial fairness of the models. Setting: Metabolic and Bariatric Surgery Accreditation and Quality Improvement Program (MBSAQIP) national database, United States. Methods: We developed logistic regression, random forest (RF), gradient-boosted tree (GBT), and XGBoost model using the MBSAQIP Participant Use Data File from 2016 to 2020. To address the class imbalance, we randomly undersampled the complication-negative class to match the complication-positive class. Model performance was evaluated using the area under the receiver operating characteristic curve (AUROC), precision, recall, and F1 score. Fairness across White and non-White patient groups was assessed using equal opportunity difference and disparate impact metrics. Results: A total of 40,858 patients were included after undersampling the complication-negative class. The XGBoost model was the best-performing model in terms of AUROC; however, the difference was not statistically significant. While the F1 score and precision did not vary significantly across models, the RF exhibited better recall compared to the logistic regression. Surgery type was the most important feature to predict complications, followed by operative time. The logistic regression model had the best fairness metrics for race. Conclusions: The XGBoost model achieved the highest AUROC, albeit without a statistically significant difference. The RF may be useful when recall is the primary concern. Undersampling of the privileged group may improve the fairness of boosted tree models.
AB - Background: Predicting the risk of complications is critical in metabolic and bariatric surgery (MBS). Objectives: To develop machine learning (ML) models to predict serious postoperative complications of MBS and evaluate racial fairness of the models. Setting: Metabolic and Bariatric Surgery Accreditation and Quality Improvement Program (MBSAQIP) national database, United States. Methods: We developed logistic regression, random forest (RF), gradient-boosted tree (GBT), and XGBoost model using the MBSAQIP Participant Use Data File from 2016 to 2020. To address the class imbalance, we randomly undersampled the complication-negative class to match the complication-positive class. Model performance was evaluated using the area under the receiver operating characteristic curve (AUROC), precision, recall, and F1 score. Fairness across White and non-White patient groups was assessed using equal opportunity difference and disparate impact metrics. Results: A total of 40,858 patients were included after undersampling the complication-negative class. The XGBoost model was the best-performing model in terms of AUROC; however, the difference was not statistically significant. While the F1 score and precision did not vary significantly across models, the RF exhibited better recall compared to the logistic regression. Surgery type was the most important feature to predict complications, followed by operative time. The logistic regression model had the best fairness metrics for race. Conclusions: The XGBoost model achieved the highest AUROC, albeit without a statistically significant difference. The RF may be useful when recall is the primary concern. Undersampling of the privileged group may improve the fairness of boosted tree models.
UR - http://www.scopus.com/inward/record.url?scp=85203004378&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85203004378&partnerID=8YFLogxK
U2 - 10.1016/j.soard.2024.08.008
DO - 10.1016/j.soard.2024.08.008
M3 - Article
C2 - 39232870
AN - SCOPUS:85203004378
SN - 1550-7289
VL - 20
SP - 1056
EP - 1064
JO - Surgery for Obesity and Related Diseases
JF - Surgery for Obesity and Related Diseases
IS - 11
ER -