TY - GEN
T1 - AckSeer
T2 - 12th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL '12
AU - Khabsa, Madian
AU - Treeratpituk, Pucktada
AU - Giles, C. Lee
PY - 2012
Y1 - 2012
N2 - Acknowledgments are widely used in scientific articles to express gratitude and credit collaborators. Despite suggestions that indexing acknowledgments automatically will give interesting insights, there is currently, to the best of our knowledge, no such system to track acknowledgments and index them. In this paper we introduce AckSeer, a search engine and a repository for automatically extracted acknowledgments in digital libraries. AckSeer is a fully automated system that scans items in digital libraries including conference papers, journals, and books extracting acknowledgment sections and identifying acknowledged entities mentioned within. We describe the architecture of AckSeer and discuss the extraction algorithms that achieve a F1 measure above 83%. We use multiple Named Entity Recognition (NER) tools and propose a method for merging the outcome from different recognizers. The resulting entities are stored in a database then made searchable by adding them to the AckSeer index along with the metadata of the containing paper/book. We build AckSeer on top of the documents in CiteSeerx digital library yielding more than 500,000 acknowledgments and more than 4 million mentioned entities.
AB - Acknowledgments are widely used in scientific articles to express gratitude and credit collaborators. Despite suggestions that indexing acknowledgments automatically will give interesting insights, there is currently, to the best of our knowledge, no such system to track acknowledgments and index them. In this paper we introduce AckSeer, a search engine and a repository for automatically extracted acknowledgments in digital libraries. AckSeer is a fully automated system that scans items in digital libraries including conference papers, journals, and books extracting acknowledgment sections and identifying acknowledged entities mentioned within. We describe the architecture of AckSeer and discuss the extraction algorithms that achieve a F1 measure above 83%. We use multiple Named Entity Recognition (NER) tools and propose a method for merging the outcome from different recognizers. The resulting entities are stored in a database then made searchable by adding them to the AckSeer index along with the metadata of the containing paper/book. We build AckSeer on top of the documents in CiteSeerx digital library yielding more than 500,000 acknowledgments and more than 4 million mentioned entities.
UR - http://www.scopus.com/inward/record.url?scp=84863541932&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84863541932&partnerID=8YFLogxK
U2 - 10.1145/2232817.2232852
DO - 10.1145/2232817.2232852
M3 - Conference contribution
AN - SCOPUS:84863541932
SN - 9781450311540
T3 - Proceedings of the ACM/IEEE Joint Conference on Digital Libraries
SP - 185
EP - 194
BT - JCDL '12 - Proceedings of the 12th ACM/IEEE-CS Joint Conference on Digital Libraries
Y2 - 10 June 2012 through 14 June 2012
ER -