TY - JOUR
T1 - Integration of machine learning and statistical models for crash frequency modeling
AU - Zhou, Dongqin
AU - Gayah, Vikash V.
AU - Wood, Jonathan S.
N1 - Publisher Copyright:
© 2022 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2023
Y1 - 2023
N2 - Crash frequency modeling has been an active research topic in traffic safety, for which various techniques have been proposed that can be loosely classified as either statistical models or machine learning (ML) methods. Statistical models are suitable for drawing inferences and producing relationships that are verifiable by domain experts. However, they generally suffer from low predictive performance due to built-in assumptions about the crash data and adherence to prespecified functional forms. On the other hand, ML methods are data-driven and free from pre-supposed conditions on the dataset, yet they are often not interpretable. In this paper, a combination scheme is proposed to leverage the advantages of both techniques, and it is evaluated using crash data collected from urban highways in the state of Washington. The results show that this combination scheme could significantly improve the predictive performance and model fitness of statistical models without adversely impacting their interpretability.
AB - Crash frequency modeling has been an active research topic in traffic safety, for which various techniques have been proposed that can be loosely classified as either statistical models or machine learning (ML) methods. Statistical models are suitable for drawing inferences and producing relationships that are verifiable by domain experts. However, they generally suffer from low predictive performance due to built-in assumptions about the crash data and adherence to prespecified functional forms. On the other hand, ML methods are data-driven and free from pre-supposed conditions on the dataset, yet they are often not interpretable. In this paper, a combination scheme is proposed to leverage the advantages of both techniques, and it is evaluated using crash data collected from urban highways in the state of Washington. The results show that this combination scheme could significantly improve the predictive performance and model fitness of statistical models without adversely impacting their interpretability.
UR - http://www.scopus.com/inward/record.url?scp=85144168287&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85144168287&partnerID=8YFLogxK
U2 - 10.1080/19427867.2022.2158257
DO - 10.1080/19427867.2022.2158257
M3 - Article
AN - SCOPUS:85144168287
SN - 1942-7867
VL - 15
SP - 1408
EP - 1419
JO - Transportation Letters
JF - Transportation Letters
IS - 10
ER -