TY - JOUR
T1 - Building a social media-based HIV risk behavior index to inform the prediction of HIV new diagnosis
T2 - A feasibility study
AU - Li, Zhenlong
AU - Qiao, Shan
AU - Jiang, Yuqin
AU - Li, Xiaoming
N1 - Publisher Copyright:
© Copyright 2021 Wolters Kluwer Health, Inc. All rights reserved.
PY - 2021/5/1
Y1 - 2021/5/1
N2 - Objective:Analysis of geolocation-based social media Big Data provides unprecedented opportunities for a broad range of domains including health as health is intrinsically linked to the geographic characteristics of places. HIV infection is largely driven by HIV risk behaviors, such as unsafe sexual behavior and drug abuse/addiction. This study explores the feasibility of building a Social media-based HIV Risk Behavior (SRB) index at the United States county level for informing HIV surveillance and prevention, considering social determinants of health and geographic locations.Methods:The SRB index, defined as the proportion of risk behavior related Twitter users among all Twitter users, was calculated at the county level for each year. To evaluate the performance of the new SRB index, the relationships between the county-level SRB and rate of new HIV diagnoses from AIDSVu were analyzed using multivariate regression while simultaneously considering five socioeconomic status (SES) factors (percentage uninsured, median household income, Gini coefficient, percentage living in poverty, percentage high school graduates) in the model. Moran's I and geographically weighted regression analyses (GWR) were leveraged to examine spatial autocorrelations and reveal the potential spatial heterogeneity (geographical variability) of the associations.Results:County-level multivariate regression results revealed that SRB has the strongest association with new HIV diagnosis rate (r > 0.36; P < 0.0001) in both years compared with the five SES factors. Hierarchical regression analysis suggested that the SRB index explains significant additional variance in addition to the five SES factors. The results from GWR analysis not only greatly improved the model explanation power (bringing the adjusted r-square from 0.25 to 0.47 in 2016 and 0.26 to 0.55 in 2017) but also revealed SRB index is the most spatially consistent measurement compared with the five SES factors in terms of impact direction (negative or positive correlation).Conclusion:It is feasible to build a social media-based HIV risk behavior index (SRB) as a new indicator for HIV surveillance at county level. The SRB index improves the regression model explanation power of new HIV diagnosis by providing additional information beyond the traditional social determinant measures, such as SES indicators. SRB index will allow researchers to utilize data captured within existing social media platforms to better understand the geospatial patterns of HIV risk behavior and to inform population-based HIV surveillance and other efforts of HIV prevention and control.
AB - Objective:Analysis of geolocation-based social media Big Data provides unprecedented opportunities for a broad range of domains including health as health is intrinsically linked to the geographic characteristics of places. HIV infection is largely driven by HIV risk behaviors, such as unsafe sexual behavior and drug abuse/addiction. This study explores the feasibility of building a Social media-based HIV Risk Behavior (SRB) index at the United States county level for informing HIV surveillance and prevention, considering social determinants of health and geographic locations.Methods:The SRB index, defined as the proportion of risk behavior related Twitter users among all Twitter users, was calculated at the county level for each year. To evaluate the performance of the new SRB index, the relationships between the county-level SRB and rate of new HIV diagnoses from AIDSVu were analyzed using multivariate regression while simultaneously considering five socioeconomic status (SES) factors (percentage uninsured, median household income, Gini coefficient, percentage living in poverty, percentage high school graduates) in the model. Moran's I and geographically weighted regression analyses (GWR) were leveraged to examine spatial autocorrelations and reveal the potential spatial heterogeneity (geographical variability) of the associations.Results:County-level multivariate regression results revealed that SRB has the strongest association with new HIV diagnosis rate (r > 0.36; P < 0.0001) in both years compared with the five SES factors. Hierarchical regression analysis suggested that the SRB index explains significant additional variance in addition to the five SES factors. The results from GWR analysis not only greatly improved the model explanation power (bringing the adjusted r-square from 0.25 to 0.47 in 2016 and 0.26 to 0.55 in 2017) but also revealed SRB index is the most spatially consistent measurement compared with the five SES factors in terms of impact direction (negative or positive correlation).Conclusion:It is feasible to build a social media-based HIV risk behavior index (SRB) as a new indicator for HIV surveillance at county level. The SRB index improves the regression model explanation power of new HIV diagnosis by providing additional information beyond the traditional social determinant measures, such as SES indicators. SRB index will allow researchers to utilize data captured within existing social media platforms to better understand the geospatial patterns of HIV risk behavior and to inform population-based HIV surveillance and other efforts of HIV prevention and control.
UR - http://www.scopus.com/inward/record.url?scp=85105772142&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85105772142&partnerID=8YFLogxK
U2 - 10.1097/QAD.0000000000002787
DO - 10.1097/QAD.0000000000002787
M3 - Article
C2 - 33867492
AN - SCOPUS:85105772142
SN - 0269-9370
VL - 35
SP - S91-S99
JO - AIDS
JF - AIDS
ER -