TY - JOUR
T1 - A Correlated Network Scale-Up Model
T2 - Finding the Connection Between Subpopulations
AU - Laga, Ian
AU - Bao, Le
AU - Niu, Xiaoyue
N1 - Publisher Copyright:
© 2023 American Statistical Association.
PY - 2023
Y1 - 2023
N2 - Aggregated Relational Data (ARD), formed from “How many X’s do you know?” questions, is a powerful tool for learning important network characteristics with incomplete network data. Compared to traditional survey methods, ARD is attractive as it does not require a sample from the target population and does not ask respondents to self-reveal their own status. This is helpful for studying hard-to-reach populations like female sex workers who may be hesitant to reveal their status. From December 2008 to February 2009, the Kiev International Institute of Sociology (KIIS) collected ARD from 10,866 respondents to estimate the size of HIV-related groups in Ukraine. To analyze this data, we propose a new ARD model which incorporates respondent and group covariates in a regression framework and includes a bias term that is correlated between groups. We also introduce a new scaling procedure using the correlation structure to further reduce biases. The resulting size estimates of those most-at-risk of HIV infection can improve the HIV response efficiency in Ukraine. Additionally, the proposed model allows us to better understand two network features without the full network data: (a) What characteristics affect who respondents know, and (b) How is knowing someone from one group related to knowing people from other groups. These features can allow researchers to better recruit marginalized individuals into the prevention and treatment programs. Our proposed model and several existing NSUM models are implemented in the networkscaleup R package. Supplementary materials for this article are available online.
AB - Aggregated Relational Data (ARD), formed from “How many X’s do you know?” questions, is a powerful tool for learning important network characteristics with incomplete network data. Compared to traditional survey methods, ARD is attractive as it does not require a sample from the target population and does not ask respondents to self-reveal their own status. This is helpful for studying hard-to-reach populations like female sex workers who may be hesitant to reveal their status. From December 2008 to February 2009, the Kiev International Institute of Sociology (KIIS) collected ARD from 10,866 respondents to estimate the size of HIV-related groups in Ukraine. To analyze this data, we propose a new ARD model which incorporates respondent and group covariates in a regression framework and includes a bias term that is correlated between groups. We also introduce a new scaling procedure using the correlation structure to further reduce biases. The resulting size estimates of those most-at-risk of HIV infection can improve the HIV response efficiency in Ukraine. Additionally, the proposed model allows us to better understand two network features without the full network data: (a) What characteristics affect who respondents know, and (b) How is knowing someone from one group related to knowing people from other groups. These features can allow researchers to better recruit marginalized individuals into the prevention and treatment programs. Our proposed model and several existing NSUM models are implemented in the networkscaleup R package. Supplementary materials for this article are available online.
UR - http://www.scopus.com/inward/record.url?scp=85149391289&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85149391289&partnerID=8YFLogxK
U2 - 10.1080/01621459.2023.2165929
DO - 10.1080/01621459.2023.2165929
M3 - Article
C2 - 37997574
AN - SCOPUS:85149391289
SN - 0162-1459
VL - 118
SP - 1515
EP - 1524
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 543
ER -