Dictionary-based Sentiment Analysis at Subword Level

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

While deep learning can offer a promising approach to sentiment analysis, it often presents challenges in complexity and explainability. Alternatively, sentiment analysis based on dictionaries has been explored. In this paper, subword-level dictionaries in English are considered to address performance degradation resulting from domain mismatches. To work at subword level, a framework based on naïve bayes machine learning algorithm is exploited. Furthermore, stopwords at the subword level have been proposed to remove additional interference intrinsic to subword tokenization. Numerical experiments demonstrate that the proposed method achieves higher accuracy and F1 scores compared to the conventional dictionary-based method when there is a mismatch between the dictionary and the corpus of documents while performing marginally worse than state-of-the-art deep learning methods when applied to datasets from the same domain.

Original languageEnglish (US)
Title of host publicationProceedings of the 2024 IEEE International Conference on Industry 4.0, Artificial Intelligence, and Communications Technology, IAICT 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages8-13
Number of pages6
ISBN (Electronic)9798350353464
DOIs
StatePublished - 2024
Event2024 IEEE International Conference on Industry 4.0, Artificial Intelligence, and Communications Technology, IAICT 2024 - Hybrid, Bali, Indonesia
Duration: Jul 4 2024Jul 6 2024

Publication series

NameProceedings of the 2024 IEEE International Conference on Industry 4.0, Artificial Intelligence, and Communications Technology, IAICT 2024

Conference

Conference2024 IEEE International Conference on Industry 4.0, Artificial Intelligence, and Communications Technology, IAICT 2024
Country/TerritoryIndonesia
CityHybrid, Bali
Period7/4/247/6/24

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition
  • Information Systems and Management
  • Safety, Risk, Reliability and Quality
  • Control and Optimization

Fingerprint

Dive into the research topics of 'Dictionary-based Sentiment Analysis at Subword Level'. Together they form a unique fingerprint.

Cite this