TY - GEN
T1 - Topic modeling and sentiment analysis of social media data to drive experiential redesign
AU - Song, Binyang
AU - Meinzer, Emmett
AU - Agrawal, Akash
AU - McComb, Christopher
N1 - Funding Information:
The authors are grateful for the support of the Penn State CERI REU program. This work was also supported by the Defense Advanced Research Projects Agency through cooperative agreement N66001-17-4064. Any opinions, findings, conclusions, or recommendations expressed in this paper are those of the authors and do not necessarily reflect the views of the sponsors.
Publisher Copyright:
© 2020 American Society of Mechanical Engineers (ASME). All rights reserved.
PY - 2020
Y1 - 2020
N2 - The elicitation of customer pain points is a crucial early step in the design or redesign of successful products and services. Online, user-generated data contains rich, real-time information about customer experience, requirements, and preferences. However, it is a nontrivial task to retrieve useful information from these sources because of the sheer amount of data, often unstructured. In this work, we build on previous efforts that used natural language processing techniques to extract meaning from online data and facilitate experiential redesign and extend them by integrating a sentiment analysis. As a use case, we explore the airline industry. A considerable portion of potential passengers opt out of traveling by airplane due to aviophobia, a fear of flying. This causes a market loss to the industry and inconvenience for those who experience aviophobia. The potential contributors to aviophobia are complex and diverse, involving physical, psychological and emotional reactions to the air travel experience. A methodology that is capable of accommodating the complexity and diversity of the commercial airline industry user-generated data is necessary to effectively mine customer pain points. To address the demand, we propose a novel methodology in this study. Using passenger commentary data posted on Reddit, the method implements topic modeling to extract common themes from the commentaries and employs sentiment analysis to elicit and interpret the salient information contained in the extracted themes. This paper ends by providing specific recommendations that are germane to the use case as well as suggesting future research directions.
AB - The elicitation of customer pain points is a crucial early step in the design or redesign of successful products and services. Online, user-generated data contains rich, real-time information about customer experience, requirements, and preferences. However, it is a nontrivial task to retrieve useful information from these sources because of the sheer amount of data, often unstructured. In this work, we build on previous efforts that used natural language processing techniques to extract meaning from online data and facilitate experiential redesign and extend them by integrating a sentiment analysis. As a use case, we explore the airline industry. A considerable portion of potential passengers opt out of traveling by airplane due to aviophobia, a fear of flying. This causes a market loss to the industry and inconvenience for those who experience aviophobia. The potential contributors to aviophobia are complex and diverse, involving physical, psychological and emotional reactions to the air travel experience. A methodology that is capable of accommodating the complexity and diversity of the commercial airline industry user-generated data is necessary to effectively mine customer pain points. To address the demand, we propose a novel methodology in this study. Using passenger commentary data posted on Reddit, the method implements topic modeling to extract common themes from the commentaries and employs sentiment analysis to elicit and interpret the salient information contained in the extracted themes. This paper ends by providing specific recommendations that are germane to the use case as well as suggesting future research directions.
UR - http://www.scopus.com/inward/record.url?scp=85096311303&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85096311303&partnerID=8YFLogxK
U2 - 10.1115/DETC2020-22567
DO - 10.1115/DETC2020-22567
M3 - Conference contribution
AN - SCOPUS:85096311303
T3 - Proceedings of the ASME Design Engineering Technical Conference
BT - 46th Design Automation Conference (DAC)
PB - American Society of Mechanical Engineers (ASME)
T2 - ASME 2020 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, IDETC-CIE 2020
Y2 - 17 August 2020 through 19 August 2020
ER -