TY - JOUR
T1 - Geographic context-aware text mining
T2 - enhance social media message classification for situational awareness by integrating spatial and temporal features
AU - Scheele, Christopher
AU - Yu, Manzhu
AU - Huang, Qunying
N1 - Publisher Copyright:
© 2021 Informa UK Limited, trading as Taylor & Francis Group. The International Journal of Digital Earth is an Official Journal of the International Society for Digital Earth.
PY - 2021
Y1 - 2021
N2 - To find disaster relevant social media messages, current approaches utilize natural language processing methods or machine learning algorithms relying on text only, which have not been perfected due to the variability and uncertainty in the language used on social media and ignoring the geographic context of the messages when posted. Meanwhile, a disaster relevant social media message is highly sensitive to its posting location and time. However, limited studies exist to explore what spatial features and the extent of how temporal, and especially spatial features can aid text classification. This paper proposes a geographic context-aware text mining method to incorporate spatial and temporal information derived from social media and authoritative datasets, along with the text information, for classifying disaster relevant social media posts. This work designed and demonstrated how diverse types of spatial and temporal features can be derived from spatial data, and then used to enhance text mining. The deep learning-based method and commonly used machine learning algorithms, assessed the accuracy of the enhanced text-mining method. The performance results of different classification models generated by various combinations of textual, spatial, and temporal features indicate that additional spatial and temporal features help improve the overall accuracy of the classification.
AB - To find disaster relevant social media messages, current approaches utilize natural language processing methods or machine learning algorithms relying on text only, which have not been perfected due to the variability and uncertainty in the language used on social media and ignoring the geographic context of the messages when posted. Meanwhile, a disaster relevant social media message is highly sensitive to its posting location and time. However, limited studies exist to explore what spatial features and the extent of how temporal, and especially spatial features can aid text classification. This paper proposes a geographic context-aware text mining method to incorporate spatial and temporal information derived from social media and authoritative datasets, along with the text information, for classifying disaster relevant social media posts. This work designed and demonstrated how diverse types of spatial and temporal features can be derived from spatial data, and then used to enhance text mining. The deep learning-based method and commonly used machine learning algorithms, assessed the accuracy of the enhanced text-mining method. The performance results of different classification models generated by various combinations of textual, spatial, and temporal features indicate that additional spatial and temporal features help improve the overall accuracy of the classification.
UR - http://www.scopus.com/inward/record.url?scp=85113750319&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85113750319&partnerID=8YFLogxK
U2 - 10.1080/17538947.2021.1968048
DO - 10.1080/17538947.2021.1968048
M3 - Article
AN - SCOPUS:85113750319
SN - 1753-8947
VL - 14
SP - 1721
EP - 1743
JO - International Journal of Digital Earth
JF - International Journal of Digital Earth
IS - 11
ER -