Campus Sentiment Analysis with GAN-based Data Augmentation

Yu Shang, Xiaohui Su, Zhifeng Xiao, Zidong Chen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

Recent advances have seen the rapid development of online media, which offers various communication channels for people to express opinions. The fast-growing social media platforms have generated tremendous data that can be transformed into business and social value through modern machine learning algorithms. One of the crucial learning tasks is sentiment analysis, which refers to identifying the tendency of subjective information in an expression. Prior efforts in sentiment classification have explored a broad spectrum of methods that have achieved impressive performance gains from the predictive modeling perspective. However, the potential of data augmentation has not been sufficiently explored in this task, and we aim to fill this gap. This study proposes a novel sentiment analysis framework powered by two performance boosters, including a data augmentation method based on a transformer-based generative adversarial network (GAN) and RoBERTa, a robustly optimized Bidirectional Encoder Representations from Transformers (BERT) pretraining model, to improve the prediction accuracy from both the data and the model side. We conduct extensive experiments on a campus sentiment classification dataset. We show that the GAN-based data augmentation method can generate high-quality synthetic samples to increase the size and diversity of the training set. Compared with other model options and augmentation methods, the RoBERTa model enhanced by transformer-based GAN (TransGAN) presents a superior performance in prediction accuracy, validating the efficacy of the proposed framework.

Original languageEnglish (US)
Title of host publication2021 13th International Conference on Advanced Infocomm Technology, ICAIT 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages209-214
Number of pages6
ISBN (Electronic)9781665431880
DOIs
StatePublished - 2021
Event13th International Conference on Advanced Infocomm Technology, ICAIT 2021 - Yanji, China
Duration: Oct 15 2021Oct 18 2021

Publication series

Name2021 13th International Conference on Advanced Infocomm Technology, ICAIT 2021

Conference

Conference13th International Conference on Advanced Infocomm Technology, ICAIT 2021
Country/TerritoryChina
CityYanji
Period10/15/2110/18/21

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Signal Processing
  • Health Informatics
  • Instrumentation

Fingerprint

Dive into the research topics of 'Campus Sentiment Analysis with GAN-based Data Augmentation'. Together they form a unique fingerprint.

Cite this