TY - GEN
T1 - Time-series analysis of blog and metaphor dynamics for event detection
AU - Goode, Brian J.
AU - Reyes, Juan Ignacio M.
AU - Pardo-Yepez, Daniela R.
AU - Canale, Gabriel L.
AU - Tong, Richard M.
AU - Mares, David
AU - Roan, Michael
AU - Ramakrishnan, Naren
N1 - Publisher Copyright:
© Springer International Publishing Switzerland 2017.
PY - 2017
Y1 - 2017
N2 - Open source indicators (OSI) like social media are useful for detecting and forecasting the onset and progression of political events and mass movements such as elections and civil unrest. Recent work has led us to analyze metaphor usage in Latin American blogs to model such events. In addition to being rich in metaphorical usage, these data sources are heterogeneous with respect to their time-series behavior in terms of publication frequency and metaphor occurrence that make relative comparisons across sources difficult. We hypothesize that understanding these non-normal behaviors is a compulsory step toward improving analysis and forecasting ability. In this work, we discuss our blog data set in detail, and dissect the data along several key characteristics such as blog publication frequency, length, and metaphor usage. In particular, we focus on occurrence clustering: modeling variations in the incidence of both metaphors and blogs over time. We describe these variations in terms of the shape parameters of distributions estimated using maximum likelihood methods. We conclude that although there may be no “characteristic” behavior in the heterogeneity of the sources, we can form groups of blogs with similar behaviors to improve detection ability.
AB - Open source indicators (OSI) like social media are useful for detecting and forecasting the onset and progression of political events and mass movements such as elections and civil unrest. Recent work has led us to analyze metaphor usage in Latin American blogs to model such events. In addition to being rich in metaphorical usage, these data sources are heterogeneous with respect to their time-series behavior in terms of publication frequency and metaphor occurrence that make relative comparisons across sources difficult. We hypothesize that understanding these non-normal behaviors is a compulsory step toward improving analysis and forecasting ability. In this work, we discuss our blog data set in detail, and dissect the data along several key characteristics such as blog publication frequency, length, and metaphor usage. In particular, we focus on occurrence clustering: modeling variations in the incidence of both metaphors and blogs over time. We describe these variations in terms of the shape parameters of distributions estimated using maximum likelihood methods. We conclude that although there may be no “characteristic” behavior in the heterogeneity of the sources, we can form groups of blogs with similar behaviors to improve detection ability.
UR - https://www.scopus.com/pages/publications/84986186183
UR - https://www.scopus.com/inward/citedby.url?scp=84986186183&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-41636-6_2
DO - 10.1007/978-3-319-41636-6_2
M3 - Conference contribution
AN - SCOPUS:84986186183
SN - 9783319416359
T3 - Advances in Intelligent Systems and Computing
SP - 17
EP - 27
BT - Advances in Cross-Cultural Decision Making - Proceedings of the AHFE International Conference on Cross-Cultural Decision Making, CCDM 2016
A2 - Hoffman, Mark
A2 - Schatz, Sae
PB - Springer Verlag
T2 - International Conference on Cross Cultural Decision Making, CCDM 2016
Y2 - 27 July 2016 through 31 July 2016
ER -