TY - JOUR
T1 - Correcting Sample Selection Bias of Historical Digital Trace Data
T2 - Inverse Probability Weighting (IPW) and Type II Tobit Model
AU - Pak, Chankyung
AU - Cotter, Kelley
AU - Thorson, Kjerstin
N1 - Publisher Copyright:
© 2022 Taylor & Francis Group, LLC.
PY - 2022
Y1 - 2022
N2 - Digital trace data have become one of the central pillars of media research methods. Despite the opportunities for better understanding individual users’ true behaviors in the personalized media environment, many scholars have pointed out the potential for bias in trace data collections, questioning the generalizability of findings based on them. In this study, we propose two statistical bias correction methods–Inverse Probability Weighting (IPW) and Type II Tobit, which are designed to remedy selection bias of inference from digital trace data donated by research participants. Applying these methods to Facebook take-out data, we demonstrate how the correction methods can change estimated effect sizes, which is important for the translation of academic findings into real-world impacts. We conduct two simulation studies, one under fully synthetic and another under partially simulated conditions, and find that Type II Tobit generally provides a more robust and cost-efficient correction method for digital trace data.
AB - Digital trace data have become one of the central pillars of media research methods. Despite the opportunities for better understanding individual users’ true behaviors in the personalized media environment, many scholars have pointed out the potential for bias in trace data collections, questioning the generalizability of findings based on them. In this study, we propose two statistical bias correction methods–Inverse Probability Weighting (IPW) and Type II Tobit, which are designed to remedy selection bias of inference from digital trace data donated by research participants. Applying these methods to Facebook take-out data, we demonstrate how the correction methods can change estimated effect sizes, which is important for the translation of academic findings into real-world impacts. We conduct two simulation studies, one under fully synthetic and another under partially simulated conditions, and find that Type II Tobit generally provides a more robust and cost-efficient correction method for digital trace data.
UR - http://www.scopus.com/inward/record.url?scp=85125361008&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85125361008&partnerID=8YFLogxK
U2 - 10.1080/19312458.2022.2037537
DO - 10.1080/19312458.2022.2037537
M3 - Article
AN - SCOPUS:85125361008
SN - 1931-2458
VL - 16
SP - 134
EP - 155
JO - Communication Methods and Measures
JF - Communication Methods and Measures
IS - 2
ER -