Understanding selection bias, time-lags and measurement bias in secondary data sources: Putting the Encyclopedia of Associations database in broader context

Shaun Bevan, Frank R. Baumgartner, Erik W. Johnson, John D. McCarthy

Research output: Contribution to journalArticlepeer-review

22 Scopus citations

Abstract

Secondary data gathered for purposes other than research play an important role in the social sciences. A recent data release has made an important source of publicly available data on associational interests, the Encyclopedia of Associations (EA), readily accessible to scholars (www.policyagendas.org). In this paper we introduce these new data and systematically investigate issues of lag between events and subsequent reporting in the EA, as these have important but under-appreciated effects on time-series statistical models. We further analyze the accuracy and coverage of the database in numerous ways. Our study serves as a guide to potential users of this database, but we also reflect upon a number of issues that should concern all researchers who use secondary data such as newspaper records, IRS reports and FBI Uniform Crime Reports.

Original languageEnglish (US)
Pages (from-to)1750-1764
Number of pages15
JournalSocial Science Research
Volume42
Issue number6
DOIs
StatePublished - Nov 2013

All Science Journal Classification (ASJC) codes

  • Education
  • Sociology and Political Science

Fingerprint

Dive into the research topics of 'Understanding selection bias, time-lags and measurement bias in secondary data sources: Putting the Encyclopedia of Associations database in broader context'. Together they form a unique fingerprint.

Cite this