TY - GEN
T1 - Measuring opinion relevance in latent topic space
AU - Cheng, Wei
AU - Ni, Xiaochuan
AU - Sun, Jian Tao
AU - Jin, Xiaoming
AU - Kum, Hye Chung
AU - Zhang, Xiang
AU - Wang, Wei
N1 - Copyright:
Copyright 2012 Elsevier B.V., All rights reserved.
PY - 2011
Y1 - 2011
N2 - Opinion retrieval engines aim to retrieve documents containing user opinions towards a given search query. Different from traditional IR engines which rank documents by their topic relevance to the search query, opinion retrieval engines also consider opinion relevance. The result documents should contain user opinions which should be relevant to the search query. In previous opinion retrieval algorithms, opinion relevance scores are usually calculated by using very straightforward approaches, e.g., the distance between search query and opinion-carrying words. These approaches may cause two problems: 1) opinions in the returned result documents are irrelevant to the search query; 2) opinions related to the search query are not well identified. In this paper, we propose a new approach to deal with this topicopinion mismatch problem. We leverage the idea of Probabilistic Latent Semantic Analysis. Both queries and documents are represented in a latent topic space, and then opinion relevance is calculated semantically in this topic space. Experiments on the TREC blog datasets indicate that our approach is effective in measuring opinion relevance and the opinion retrieval system based on our algorithm yields significant improvements compared with most state-of-the-art methods.
AB - Opinion retrieval engines aim to retrieve documents containing user opinions towards a given search query. Different from traditional IR engines which rank documents by their topic relevance to the search query, opinion retrieval engines also consider opinion relevance. The result documents should contain user opinions which should be relevant to the search query. In previous opinion retrieval algorithms, opinion relevance scores are usually calculated by using very straightforward approaches, e.g., the distance between search query and opinion-carrying words. These approaches may cause two problems: 1) opinions in the returned result documents are irrelevant to the search query; 2) opinions related to the search query are not well identified. In this paper, we propose a new approach to deal with this topicopinion mismatch problem. We leverage the idea of Probabilistic Latent Semantic Analysis. Both queries and documents are represented in a latent topic space, and then opinion relevance is calculated semantically in this topic space. Experiments on the TREC blog datasets indicate that our approach is effective in measuring opinion relevance and the opinion retrieval system based on our algorithm yields significant improvements compared with most state-of-the-art methods.
UR - http://www.scopus.com/inward/record.url?scp=84862954953&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84862954953&partnerID=8YFLogxK
U2 - 10.1109/PASSAT/SocialCom.2011.45
DO - 10.1109/PASSAT/SocialCom.2011.45
M3 - Conference contribution
AN - SCOPUS:84862954953
SN - 9780769545783
T3 - Proceedings - 2011 IEEE International Conference on Privacy, Security, Risk and Trust and IEEE International Conference on Social Computing, PASSAT/SocialCom 2011
SP - 323
EP - 330
BT - Proceedings - 2011 IEEE International Conference on Privacy, Security, Risk and Trust and IEEE International Conference on Social Computing, PASSAT/SocialCom 2011
T2 - 2011 IEEE International Conference on Privacy, Security, Risk and Trust, PASSAT 2011 and 2011 IEEE International Conference on Social Computing, SocialCom 2011
Y2 - 9 October 2011 through 11 October 2011
ER -