TY - GEN
T1 - Search engine driven author disambiguation
AU - Tan, Yee Fan
AU - Kan, Min Yen
AU - Lee, Dongwon
PY - 2006
Y1 - 2006
N2 - In scholarly digital libraries, author disambiguation is an important task that attributes a scholarly work with specific authors. This is critical when individuals share the same name. We present an approach to this task that analyzes the results of automatically-crafted web searches. A key observation is that pages from rare web sites are stronger source of evidence than pages from common web sites, which we model as Inverse Host Frequency (IHF). Our system is able to achieve an average accuracy of 0.836.
AB - In scholarly digital libraries, author disambiguation is an important task that attributes a scholarly work with specific authors. This is critical when individuals share the same name. We present an approach to this task that analyzes the results of automatically-crafted web searches. A key observation is that pages from rare web sites are stronger source of evidence than pages from common web sites, which we model as Inverse Host Frequency (IHF). Our system is able to achieve an average accuracy of 0.836.
UR - http://www.scopus.com/inward/record.url?scp=34247201022&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34247201022&partnerID=8YFLogxK
U2 - 10.1145/1141753.1141826
DO - 10.1145/1141753.1141826
M3 - Conference contribution
AN - SCOPUS:34247201022
SN - 1595933549
SN - 9781595933546
T3 - Proceedings of the ACM/IEEE Joint Conference on Digital Libraries
SP - 314
EP - 315
BT - 6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006
T2 - 6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006: Opening Information Horizons, JCDL '06
Y2 - 11 June 2006 through 15 June 2006
ER -