Variable Selection for High-Dimensional Nodal Attributes in Social Networks with Degree Heterogeneity

Jia Wang, Xizhen Cai, Xiaoyue Niu, Runze Li

Research output: Contribution to journalArticlepeer-review

Abstract

We consider a class of network models, in which the connection probability depends on ultrahigh-dimensional nodal covariates (homophily) and node-specific popularity (degree heterogeneity). A Bayesian method is proposed to select nodal features in both dense and sparse networks under a mild assumption on popularity parameters. The proposed approach is implemented via Gibbs sampling. To alleviate the computational burden for large sparse networks, we further develop a working model in which parameters are updated based on a dense sub-graph at each step. Model selection consistency is established for both models, in the sense that the probability of the true model being selected converges to one asymptotically, even when the dimension grows with the network size at an exponential rate. The performance of the proposed models and estimation procedures are illustrated through Monte Carlo studies and three real world examples. Supplementary materials for this article are available online.

Original languageEnglish (US)
Pages (from-to)1322-1335
Number of pages14
JournalJournal of the American Statistical Association
Volume119
Issue number546
DOIs
StatePublished - 2024

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Variable Selection for High-Dimensional Nodal Attributes in Social Networks with Degree Heterogeneity'. Together they form a unique fingerprint.

Cite this