TY - JOUR
T1 - Variable Selection for High-Dimensional Nodal Attributes in Social Networks with Degree Heterogeneity
AU - Wang, Jia
AU - Cai, Xizhen
AU - Niu, Xiaoyue
AU - Li, Runze
N1 - Publisher Copyright:
© 2023 American Statistical Association.
PY - 2024
Y1 - 2024
N2 - We consider a class of network models, in which the connection probability depends on ultrahigh-dimensional nodal covariates (homophily) and node-specific popularity (degree heterogeneity). A Bayesian method is proposed to select nodal features in both dense and sparse networks under a mild assumption on popularity parameters. The proposed approach is implemented via Gibbs sampling. To alleviate the computational burden for large sparse networks, we further develop a working model in which parameters are updated based on a dense sub-graph at each step. Model selection consistency is established for both models, in the sense that the probability of the true model being selected converges to one asymptotically, even when the dimension grows with the network size at an exponential rate. The performance of the proposed models and estimation procedures are illustrated through Monte Carlo studies and three real world examples. Supplementary materials for this article are available online.
AB - We consider a class of network models, in which the connection probability depends on ultrahigh-dimensional nodal covariates (homophily) and node-specific popularity (degree heterogeneity). A Bayesian method is proposed to select nodal features in both dense and sparse networks under a mild assumption on popularity parameters. The proposed approach is implemented via Gibbs sampling. To alleviate the computational burden for large sparse networks, we further develop a working model in which parameters are updated based on a dense sub-graph at each step. Model selection consistency is established for both models, in the sense that the probability of the true model being selected converges to one asymptotically, even when the dimension grows with the network size at an exponential rate. The performance of the proposed models and estimation procedures are illustrated through Monte Carlo studies and three real world examples. Supplementary materials for this article are available online.
UR - http://www.scopus.com/inward/record.url?scp=85152893772&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85152893772&partnerID=8YFLogxK
U2 - 10.1080/01621459.2023.2187815
DO - 10.1080/01621459.2023.2187815
M3 - Article
C2 - 39184838
AN - SCOPUS:85152893772
SN - 0162-1459
VL - 119
SP - 1322
EP - 1335
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 546
ER -