TY - JOUR
T1 - Using social media user profiles to identify visitor demographics and origins in Yellowstone national park
AU - Liang, Yun
AU - Yin, Junjun
AU - Park, Soyoung
AU - Pan, Bing
AU - Chi, Guangqing
AU - Miller, Zachary
N1 - Funding Information:
The Center for Social Data Analytics (C-SoDA) and the Institute for Computational and Data Sciences (ICDS) in Penn State University provided seed funding for this project. This work is also supported in part by the National Science Foundation (Award # SES-1823633 ), the USDA National Institute of Food and Agriculture and Multistate Research Project # PEN04623 (Accession #1013257), and the Eunice Kennedy Shriver National Institute of Child Health and Human Development (Award # P2C HD041025 ).
Funding Information:
The Center for Social Data Analytics (C-SoDA) and the Institute for Computational and Data Sciences (ICDS) in Penn State University provided seed funding for this project. This work is also supported in part by the National Science Foundation (Award # SES-1823633), the USDA National Institute of Food and Agriculture and Multistate Research Project #PEN04623 (Accession #1013257), and the Eunice Kennedy Shriver National Institute of Child Health and Human Development (Award #P2C HD041025).
Publisher Copyright:
© 2023 Elsevier Ltd
PY - 2023
Y1 - 2023
N2 - Despite the growing body of studies on mining visitor perceptions and attitudes of national park visitors using social media data, few research investigated user demographics and its representative issues. This study assessed visitor demographics, including gender, age, racial groups, and origins of visitors in a U.S. national park through their Twitter user profiles, and compared the results to a traditional visitor survey. The results showed similar percentages of gender groups between Twitter user profiles and the traditional survey. However, significant differences existed across all age groups and all racial groups between the two data sources. Compared to the survey, the visitors identified from social media data were younger and from more diverse race groups. The lists of the top 10 states and countries of residency of visitors from the two data sources overlapped but had different orders. The findings indicated that social media data could only be a complementary data source due to its representative issues. The results allow researchers to explore social media users’ demographics by advanced social data analytics. However, this study suggests that analyzing Twitter profile information, such as self-reported names and profile photos, requires special attention from researchers even if the data were publicly available. The authors recommend that future research should attend to the representative and private issues of social media data. Management implications: • Social media user profiles can be utilized for predicting users’ demographics, such as gender, age, and racial groups. • Social media data can only be a conplementary data source to understand visitor demographics in future research. • The ethical issues of social media data, including private domain and machine learning algorithms, need further discussion.
AB - Despite the growing body of studies on mining visitor perceptions and attitudes of national park visitors using social media data, few research investigated user demographics and its representative issues. This study assessed visitor demographics, including gender, age, racial groups, and origins of visitors in a U.S. national park through their Twitter user profiles, and compared the results to a traditional visitor survey. The results showed similar percentages of gender groups between Twitter user profiles and the traditional survey. However, significant differences existed across all age groups and all racial groups between the two data sources. Compared to the survey, the visitors identified from social media data were younger and from more diverse race groups. The lists of the top 10 states and countries of residency of visitors from the two data sources overlapped but had different orders. The findings indicated that social media data could only be a complementary data source due to its representative issues. The results allow researchers to explore social media users’ demographics by advanced social data analytics. However, this study suggests that analyzing Twitter profile information, such as self-reported names and profile photos, requires special attention from researchers even if the data were publicly available. The authors recommend that future research should attend to the representative and private issues of social media data. Management implications: • Social media user profiles can be utilized for predicting users’ demographics, such as gender, age, and racial groups. • Social media data can only be a conplementary data source to understand visitor demographics in future research. • The ethical issues of social media data, including private domain and machine learning algorithms, need further discussion.
UR - http://www.scopus.com/inward/record.url?scp=85149338772&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85149338772&partnerID=8YFLogxK
U2 - 10.1016/j.jort.2023.100620
DO - 10.1016/j.jort.2023.100620
M3 - Article
AN - SCOPUS:85149338772
SN - 2213-0780
JO - Journal of Outdoor Recreation and Tourism
JF - Journal of Outdoor Recreation and Tourism
M1 - 100620
ER -