TY - GEN
T1 - JNet
T2 - 13th ACM International Conference on Web Search and Data Mining, WSDM 2020
AU - Gong, Lin
AU - Lin, Lu
AU - Song, Weihao
AU - Wang, Hongning
N1 - Publisher Copyright:
© 2020 Association for Computing Machinery.
PY - 2020/1/20
Y1 - 2020/1/20
N2 - User representation learning is vital to capture diverse user preferences, while it is also challenging as user intents are latent and scattered among complex and different modalities of user-generated data, thus, not directly measurable. Inspired by the concept of user schema in social psychology, we take a new perspective to perform user representation learning by constructing a shared latent space to capture the dependency among different modalities of user-generated data. Both users and topics are embedded to the same space to encode users’ social connections and text content, to facilitate joint modeling of different modalities, via a probabilistic generative framework. We evaluated the proposed solution on large collections of Yelp reviews and StackOverflow discussion posts, with their associated network structures. The proposed model outperformed several state-of-the-art topic modeling based user models with better predictive power in unseen documents, and state-of-the-art network embedding based user models with improved link prediction quality in unseen nodes. The learnt user representations are also proved to be useful in content recommendation, e.g., expert finding in StackOverflow.
AB - User representation learning is vital to capture diverse user preferences, while it is also challenging as user intents are latent and scattered among complex and different modalities of user-generated data, thus, not directly measurable. Inspired by the concept of user schema in social psychology, we take a new perspective to perform user representation learning by constructing a shared latent space to capture the dependency among different modalities of user-generated data. Both users and topics are embedded to the same space to encode users’ social connections and text content, to facilitate joint modeling of different modalities, via a probabilistic generative framework. We evaluated the proposed solution on large collections of Yelp reviews and StackOverflow discussion posts, with their associated network structures. The proposed model outperformed several state-of-the-art topic modeling based user models with better predictive power in unseen documents, and state-of-the-art network embedding based user models with improved link prediction quality in unseen nodes. The learnt user representations are also proved to be useful in content recommendation, e.g., expert finding in StackOverflow.
UR - http://www.scopus.com/inward/record.url?scp=85079547450&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85079547450&partnerID=8YFLogxK
U2 - 10.1145/3336191.3371770
DO - 10.1145/3336191.3371770
M3 - Conference contribution
AN - SCOPUS:85079547450
T3 - WSDM 2020 - Proceedings of the 13th International Conference on Web Search and Data Mining
SP - 205
EP - 213
BT - WSDM 2020 - Proceedings of the 13th International Conference on Web Search and Data Mining
PB - Association for Computing Machinery, Inc
Y2 - 3 February 2020 through 7 February 2020
ER -