TY - GEN
T1 - Sentiment and topic analysis on social media
T2 - 3rd Annual ACM Web Science Conference, WebSci 2013
AU - Huang, Shu
AU - Peng, Wei
AU - Li, Jingxuan
AU - Lee, Dongwon
PY - 2013
Y1 - 2013
N2 - Both sentiment analysis and topic classification are frequently used in customer care and marketing. They can help people understand the brand perception and customer opinions from social media, such as online posts, tweets, forums, and blogs. As such, in recent years, many solutions have been proposed for both tasks. However, we believe that the following two problems have not been addressed adequately: (1) Conventional solutions usually treat the two tasks in isolation. When the two tasks are closely related (e.g., posts about "customer care" often have a "negative" tone), exploring their correlation may yield a better accuracy; (2) Each post is usually assigned with only one sentiment label and one topic label. Since social media is, compared to traditional document corpus, more noisy, ambiguous, and sparser, single label classification may not be able to capture the post classes accurately. To address these two problems, in this paper, we propose a multi-task multi-label (MTML) classification model that performs classification of both sentiments and topics concurrently. It incorporates results of each task from prior steps to promote and reinforce the other iteratively. For each task, the model is trained with multiple labels so that they can help address class ambiguity. In the empirical validation, we compare the accuracy of MTML model against four competing methods in two different settings. Results show that MTML produces a much higher accuracy of both sentiment and topic classifications.
AB - Both sentiment analysis and topic classification are frequently used in customer care and marketing. They can help people understand the brand perception and customer opinions from social media, such as online posts, tweets, forums, and blogs. As such, in recent years, many solutions have been proposed for both tasks. However, we believe that the following two problems have not been addressed adequately: (1) Conventional solutions usually treat the two tasks in isolation. When the two tasks are closely related (e.g., posts about "customer care" often have a "negative" tone), exploring their correlation may yield a better accuracy; (2) Each post is usually assigned with only one sentiment label and one topic label. Since social media is, compared to traditional document corpus, more noisy, ambiguous, and sparser, single label classification may not be able to capture the post classes accurately. To address these two problems, in this paper, we propose a multi-task multi-label (MTML) classification model that performs classification of both sentiments and topics concurrently. It incorporates results of each task from prior steps to promote and reinforce the other iteratively. For each task, the model is trained with multiple labels so that they can help address class ambiguity. In the empirical validation, we compare the accuracy of MTML model against four competing methods in two different settings. Results show that MTML produces a much higher accuracy of both sentiment and topic classifications.
UR - http://www.scopus.com/inward/record.url?scp=84883117557&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84883117557&partnerID=8YFLogxK
U2 - 10.1145/2464464.2464512
DO - 10.1145/2464464.2464512
M3 - Conference contribution
AN - SCOPUS:84883117557
SN - 9781450318891
T3 - Proceedings of the 5th Annual ACM Web Science Conference, WebSci'13
SP - 172
EP - 181
BT - Proceedings of the 5th Annual ACM Web Science Conference, WebSci'13
PB - Association for Computing Machinery
Y2 - 2 May 2013 through 4 May 2013
ER -