TY - JOUR
T1 - Non-Stationary Model for Crime Rate Inference Using Modern Urban Data
AU - Wang, Hongjian
AU - Yao, Huaxiu
AU - Kifer, Daniel
AU - Graif, Corina
AU - Li, Zhenhui
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2019/6/1
Y1 - 2019/6/1
N2 - Crime is one of the most important social problems in the country, affecting public safety, children development, and adult socioeconomic status. Understanding what factors cause higher crime rate is critical for policy makers in their efforts to reduce crime and increase citizens' life quality. We tackle a fundamental problem in our paper: crime rate inference at the neighborhood level. Traditional approaches have used demographics and geographical influences to estimate crime rates in a region. With the fast development of positioning technology and prevalence of mobile devices, a large amount of modern urban data have been collected and such big data can provide new perspectives for understanding crime. In this paper, we use large-scale Point-Of-Interest data and taxi flow data in the city of Chicago, IL in the USA. We observe significantly improved performance in crime rate inference compared to using traditional features. Such an improvement is consistent over multiple years. We also show that these new features are significant in the feature importance analysis. The correlations between crime and various observed features are not constant over the whole city. In order to address this geospatial non-stationary property, we further employ the geographically weighted regression on top of negative binomial model (GWNBR). Experiments have shown that GWNBR outperforms the negative binomial model.
AB - Crime is one of the most important social problems in the country, affecting public safety, children development, and adult socioeconomic status. Understanding what factors cause higher crime rate is critical for policy makers in their efforts to reduce crime and increase citizens' life quality. We tackle a fundamental problem in our paper: crime rate inference at the neighborhood level. Traditional approaches have used demographics and geographical influences to estimate crime rates in a region. With the fast development of positioning technology and prevalence of mobile devices, a large amount of modern urban data have been collected and such big data can provide new perspectives for understanding crime. In this paper, we use large-scale Point-Of-Interest data and taxi flow data in the city of Chicago, IL in the USA. We observe significantly improved performance in crime rate inference compared to using traditional features. Such an improvement is consistent over multiple years. We also show that these new features are significant in the feature importance analysis. The correlations between crime and various observed features are not constant over the whole city. In order to address this geospatial non-stationary property, we further employ the geographically weighted regression on top of negative binomial model (GWNBR). Experiments have shown that GWNBR outperforms the negative binomial model.
UR - http://www.scopus.com/inward/record.url?scp=85140803902&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85140803902&partnerID=8YFLogxK
U2 - 10.1109/TBDATA.2017.2786405
DO - 10.1109/TBDATA.2017.2786405
M3 - Article
AN - SCOPUS:85140803902
SN - 2332-7790
VL - 5
SP - 180
EP - 194
JO - IEEE Transactions on Big Data
JF - IEEE Transactions on Big Data
IS - 2
M1 - 8234616
ER -