TY - GEN
T1 - Sparse topic models by parameter sharing
AU - Soleimani, Hossein
AU - Miller, David J.
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2014/11/14
Y1 - 2014/11/14
N2 - We propose a sparse Bayesian topic model, based on parameter sharing, for modeling text corpora. In Latent Dirichlet Allocation (LDA), each topic models all words, even though many words are not topic-specific, i.e. have similar occurrence frequencies across different topics. We propose a sparser approach by introducing a universal shared model, used by each topic to model the subset of words that are not topic-specific. A Bernoulli random variable is associated with each word under every topic, determining whether that word is modeled topic-specifically, with a free parameter, or by the shared model, with a common parameter. Results of our experiments show that our model achieves sparser topic presence in documents and higher test likelihood than LDA.
AB - We propose a sparse Bayesian topic model, based on parameter sharing, for modeling text corpora. In Latent Dirichlet Allocation (LDA), each topic models all words, even though many words are not topic-specific, i.e. have similar occurrence frequencies across different topics. We propose a sparser approach by introducing a universal shared model, used by each topic to model the subset of words that are not topic-specific. A Bernoulli random variable is associated with each word under every topic, determining whether that word is modeled topic-specifically, with a free parameter, or by the shared model, with a common parameter. Results of our experiments show that our model achieves sparser topic presence in documents and higher test likelihood than LDA.
UR - http://www.scopus.com/inward/record.url?scp=84912553417&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84912553417&partnerID=8YFLogxK
U2 - 10.1109/MLSP.2014.6958911
DO - 10.1109/MLSP.2014.6958911
M3 - Conference contribution
AN - SCOPUS:84912553417
T3 - IEEE International Workshop on Machine Learning for Signal Processing, MLSP
BT - IEEE International Workshop on Machine Learning for Signal Processing, MLSP
A2 - Mboup, Mamadou
A2 - Adali, Tulay
A2 - Moreau, Eric
A2 - Larsen, Jan
PB - IEEE Computer Society
T2 - 2014 24th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2014
Y2 - 21 September 2014 through 24 September 2014
ER -