Prediction of likes and retweets using text information retrieval

Ishita Daga, Anchal Gupta, Raj Vardhan, Partha Mukherjee

Research output: Contribution to journalConference articlepeer-review

18 Scopus citations


Twitter is one of the major social media platforms today to study human behaviours by analysing their interactions. To ensure popularity of the tweet, the focus should be on the content of the tweet that results in numerous followings of that message with sufficient number of likes and retweets. The high quality of tweets, increases the online reputation of the users who post it. If a user can get the prediction of likes and retweets on his text before posting it on the internet, it would improve the popularity of the tweet from information sharing perspective. In this paper we employed different machine learning classifiers like SVM, Naïve Bayes, Logistic Regression, Random Forest, and Neural Network, on top of two different text processing approaches used in NLP (natural language processing), namely bag-of-words (TFIDF) and word embeddings (Doc2Vec), to check how many likes and retweets can a tweet generate. The results obtained indicate that all the models performed 10-15% better with the bagof-word technique.

Original languageEnglish (US)
Pages (from-to)123-128
Number of pages6
JournalProcedia Computer Science
StatePublished - 2020
Event2020 Complex Adaptive Systems Conference, CAS 2019 - Malvern, United States
Duration: Nov 13 2019Nov 15 2019

All Science Journal Classification (ASJC) codes

  • General Computer Science


Dive into the research topics of 'Prediction of likes and retweets using text information retrieval'. Together they form a unique fingerprint.

Cite this