Abstract
This study fills in the current knowledge gaps in statistical analysis of longitudinal zero-inflated count data by providing a comprehensive review and comparison of the hurdle and zero-inflated Poisson models in terms of the conceptual framework, computational advantage, and performance under different real data situations. The design of simulations represents the special features of a well-known longitudinal study of alcoholism so that the results can be generalizable to the substance abuse field. When the hurdle model is more natural under the conceptual framework of the data, the zero-inflated Poisson model tends to produce inaccurate estimates. Model performance improves with larger sample sizes, lower proportions of missing data, and lower correlations between covariates. The simulation also shows that the computational strength of the hurdle model disappears when random effects are included.
Original language | English (US) |
---|---|
Pages (from-to) | 4074-4086 |
Number of pages | 13 |
Journal | Statistics in Medicine |
Volume | 31 |
Issue number | 29 |
DOIs | |
State | Published - Dec 20 2012 |
All Science Journal Classification (ASJC) codes
- Epidemiology
- Statistics and Probability