TY - GEN
T1 - Construction and first analysis of a corpus for the evaluation and training of microblog/twitter geoparsers
AU - Wallgrün, Jan Oliver
AU - Hardisty, Frank
AU - MacEachren, Alan M.
AU - Karimzadeh, Morteza
AU - Ju, Yiting
AU - Pezanowski, Scott
N1 - Publisher Copyright:
Copyright 2014 ACM.
PY - 2014/11/4
Y1 - 2014/11/4
N2 - This article presents an approach to place reference corpus building and application of the approach to a Geo-Microblog Corpus that will foster research and development in the areas of microblog/twitter geoparsing and geographic information retrieval. Our corpus currently consists of 6000 tweets with identified and georeferenced place names. 30% of the tweets contain at least one place name. The corpus is intended to support the evaluation, comparison, and training of geoparsers. We introduce our corpus building framework, which is developed to be generally applicable beyond microblogs, and explain how we use crowdsourcing and geovisual analytics technology to support the construction of relatively large corpora. We then report on the corpus building work and present an analysis of causes of disagreement between the lay persons performing place identification in our crowdsourcing approach.
AB - This article presents an approach to place reference corpus building and application of the approach to a Geo-Microblog Corpus that will foster research and development in the areas of microblog/twitter geoparsing and geographic information retrieval. Our corpus currently consists of 6000 tweets with identified and georeferenced place names. 30% of the tweets contain at least one place name. The corpus is intended to support the evaluation, comparison, and training of geoparsers. We introduce our corpus building framework, which is developed to be generally applicable beyond microblogs, and explain how we use crowdsourcing and geovisual analytics technology to support the construction of relatively large corpora. We then report on the corpus building work and present an analysis of causes of disagreement between the lay persons performing place identification in our crowdsourcing approach.
UR - http://www.scopus.com/inward/record.url?scp=84942429201&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84942429201&partnerID=8YFLogxK
U2 - 10.1145/2675354.2675701
DO - 10.1145/2675354.2675701
M3 - Conference contribution
AN - SCOPUS:84942429201
T3 - Proceedings of the 8th Workshop on Geographic Information Retrieval, GIR 2014
BT - Proceedings of the 8th Workshop on Geographic Information Retrieval, GIR 2014
A2 - Purves, Ross S.
A2 - Jones, Christopher B.
PB - Association for Computing Machinery
T2 - 8th Workshop on Geographic Information Retrieval, GIR 2014
Y2 - 4 November 2014 through 7 November 2014
ER -