TY - JOUR
T1 - A Guide to Analyzing Large N, Large T Panel Data
AU - Thombs, Ryan P.
N1 - Publisher Copyright:
© The Author(s) 2022.
PY - 2022
Y1 - 2022
N2 - Fixed effects estimation of a static model with robust or panel corrected standard errors is commonly used to model large N, large T panel data. However, this approach is biased and inconsistent in the presence of dynamic misspecification, slope heterogeneity, and cross-sectional dependence. Common correlated effects estimation of a dynamic model has been advanced to address these issues but is rarely used in sociology. Here, I provide an overview of the large N, large T panel data literature, and I conduct an array of Monte Carlo experiments to compare the fixed effects estimator to the common correlated effects estimator regarding the aforementioned issues. I show that fixed effects estimation with robust or panel corrected standard errors do not address these problems, which is most evident with high levels of slope heterogeneity and lag misspecification, and its performance worsens as the time dimension expands. In contrast, the common correlated effects estimator produces superior estimates as T increases and is robust to slope heterogeneity and cross-sectional dependence. Following the experiments, I present an example by examining the drivers of fossil fuel consumption at the U.S. state level from 1960 to 2018, and I conclude by presenting a decision-making framework for researchers to use to make informed decisions when modeling large N, large T panel data.
AB - Fixed effects estimation of a static model with robust or panel corrected standard errors is commonly used to model large N, large T panel data. However, this approach is biased and inconsistent in the presence of dynamic misspecification, slope heterogeneity, and cross-sectional dependence. Common correlated effects estimation of a dynamic model has been advanced to address these issues but is rarely used in sociology. Here, I provide an overview of the large N, large T panel data literature, and I conduct an array of Monte Carlo experiments to compare the fixed effects estimator to the common correlated effects estimator regarding the aforementioned issues. I show that fixed effects estimation with robust or panel corrected standard errors do not address these problems, which is most evident with high levels of slope heterogeneity and lag misspecification, and its performance worsens as the time dimension expands. In contrast, the common correlated effects estimator produces superior estimates as T increases and is robust to slope heterogeneity and cross-sectional dependence. Following the experiments, I present an example by examining the drivers of fossil fuel consumption at the U.S. state level from 1960 to 2018, and I conclude by presenting a decision-making framework for researchers to use to make informed decisions when modeling large N, large T panel data.
UR - http://www.scopus.com/inward/record.url?scp=85136984629&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85136984629&partnerID=8YFLogxK
U2 - 10.1177/23780231221117645
DO - 10.1177/23780231221117645
M3 - Article
AN - SCOPUS:85136984629
SN - 2378-0231
VL - 8
JO - Socius
JF - Socius
ER -