TY - JOUR
T1 - A latent variable representation of count data models to accommodate spatial and temporal dependence
T2 - Application to predicting crash frequency at intersections
AU - Castro, Marisol
AU - Paleti, Rajesh
AU - Bhat, Chandra R.
PY - 2012/1
Y1 - 2012/1
N2 - This paper proposes a reformulation of count models as a special case of generalized ordered-response models in which a single latent continuous variable is partitioned into mutually exclusive intervals. Using this equivalent latent variable-based generalized ordered response framework for count data models, we are then able to gainfully and efficiently introduce temporal and spatial dependencies through the latent continuous variables. Our formulation also allows handling excess zeros in correlated count data, a phenomenon that is commonly found in practice. A composite marginal likelihood inference approach is used to estimate model parameters. The modeling framework is applied to predict crash frequency at urban intersections in Arlington, Texas. The sample is drawn from the Texas Department of Transportation (TxDOT) crash incident files between 2003 and 2009, resulting in 1190 intersection-year observations. The results reveal the presence of intersection-specific time-invariant unobserved components influencing crash propensity and a spatial lag structure to characterize spatial dependence. Roadway configuration, approach roadway functional types, traffic control type, total daily entering traffic volumes and the split of volumes between approaches are all important variables in determining crash frequency at intersections.
AB - This paper proposes a reformulation of count models as a special case of generalized ordered-response models in which a single latent continuous variable is partitioned into mutually exclusive intervals. Using this equivalent latent variable-based generalized ordered response framework for count data models, we are then able to gainfully and efficiently introduce temporal and spatial dependencies through the latent continuous variables. Our formulation also allows handling excess zeros in correlated count data, a phenomenon that is commonly found in practice. A composite marginal likelihood inference approach is used to estimate model parameters. The modeling framework is applied to predict crash frequency at urban intersections in Arlington, Texas. The sample is drawn from the Texas Department of Transportation (TxDOT) crash incident files between 2003 and 2009, resulting in 1190 intersection-year observations. The results reveal the presence of intersection-specific time-invariant unobserved components influencing crash propensity and a spatial lag structure to characterize spatial dependence. Roadway configuration, approach roadway functional types, traffic control type, total daily entering traffic volumes and the split of volumes between approaches are all important variables in determining crash frequency at intersections.
UR - http://www.scopus.com/inward/record.url?scp=81055136593&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=81055136593&partnerID=8YFLogxK
U2 - 10.1016/j.trb.2011.09.007
DO - 10.1016/j.trb.2011.09.007
M3 - Article
AN - SCOPUS:81055136593
SN - 0191-2615
VL - 46
SP - 253
EP - 272
JO - Transportation Research Part B: Methodological
JF - Transportation Research Part B: Methodological
IS - 1
ER -