TY - GEN
T1 - Improving the Prediction Accuracy of Data-Driven Fault Diagnosis for HVAC Systems by Applying the Synthetic Minority Oversampling Technique
AU - Shakerian, Shahrad
AU - Jebelli, Houtan
AU - Sitzabee, William E.
N1 - Publisher Copyright:
© 2021 Computing in Civil Engineering 2021 - Selected Papers from the ASCE International Conference on Computing in Civil Engineering 2021. All rights reserved.
PY - 2021
Y1 - 2021
N2 - Faulty operation of heating, ventilation, and air conditioning (HVAC) systems in a building can lead to thermal discomfort, wasted energy, and shorter equipment life. Early fault diagnosis in HVAC systems is critical to maintaining indoor environmental comfort, saving energy, and preventing further deterioration of the system. Data-driven predictive models have emerged as a popular approach to fault diagnosis of HVAC systems. For an HVAC system, failures are a small probability event. Therefore, the prediction accuracy of these data-driven approaches is affected by imbalanced data sets. The normal data far outweigh the fault data. Predictive models trained with these imbalanced data sets will perform poorly in fault diagnosis of an HVAC system. To address this limitation, this study aims to examine the potential of the synthetic minority oversampling technique (SMOTE) for sampling fault data points to improve the accuracy of the fault predictive model. To that end, six months of operational data (for example, humidity and temperature) were collected from embedded sensors in the HVAC system. A SMOTE algorithm was applied to increase the ratio of failure to normal data from 0.02 to 0.3. Both the original and improved data sets were used to train the fault predictive model based on different supervised learning algorithms. Results indicated that data sets improved through the SMOTE algorithm increased the accuracy of the predictive model by about 20%. This improvement can lay the groundwork for increasingly proactive facility maintenance.
AB - Faulty operation of heating, ventilation, and air conditioning (HVAC) systems in a building can lead to thermal discomfort, wasted energy, and shorter equipment life. Early fault diagnosis in HVAC systems is critical to maintaining indoor environmental comfort, saving energy, and preventing further deterioration of the system. Data-driven predictive models have emerged as a popular approach to fault diagnosis of HVAC systems. For an HVAC system, failures are a small probability event. Therefore, the prediction accuracy of these data-driven approaches is affected by imbalanced data sets. The normal data far outweigh the fault data. Predictive models trained with these imbalanced data sets will perform poorly in fault diagnosis of an HVAC system. To address this limitation, this study aims to examine the potential of the synthetic minority oversampling technique (SMOTE) for sampling fault data points to improve the accuracy of the fault predictive model. To that end, six months of operational data (for example, humidity and temperature) were collected from embedded sensors in the HVAC system. A SMOTE algorithm was applied to increase the ratio of failure to normal data from 0.02 to 0.3. Both the original and improved data sets were used to train the fault predictive model based on different supervised learning algorithms. Results indicated that data sets improved through the SMOTE algorithm increased the accuracy of the predictive model by about 20%. This improvement can lay the groundwork for increasingly proactive facility maintenance.
UR - http://www.scopus.com/inward/record.url?scp=85132545664&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85132545664&partnerID=8YFLogxK
U2 - 10.1061/9780784483893.012
DO - 10.1061/9780784483893.012
M3 - Conference contribution
AN - SCOPUS:85132545664
T3 - Computing in Civil Engineering 2021 - Selected Papers from the ASCE International Conference on Computing in Civil Engineering 2021
SP - 90
EP - 97
BT - Computing in Civil Engineering 2021 - Selected Papers from the ASCE International Conference on Computing in Civil Engineering 2021
A2 - Issa, R. Raymond A.
PB - American Society of Civil Engineers (ASCE)
T2 - 2021 International Conference on Computing in Civil Engineering, I3CE 2021
Y2 - 12 September 2021 through 14 September 2021
ER -