TY - GEN
T1 - Automatic searching of tables in digital libraries
AU - Liu, Ying
AU - Bai, Kun
AU - Mitra, Prasenjit
AU - Giles, C. Lee
PY - 2007
Y1 - 2007
N2 - Tables are ubiquitous. Unfortunately, no search engine supportstable search. In this paper, we propose a novel table specificsearching engine, TableSeer, to facilitate the table extracting, indexing, searching, and sharing. In addition, wepropose an extensive set of medium-independent metadata to precisely present tables. Given a query, TableSeer ranks the returned results using an innovative ranking algorithm - TableRank with a tailored vector space model and a novel term weightingscheme. Experimental results show that TableSeer outperforms existing search engines on table search. In addition, incorporating multiple weighting factors can significantly improve the ranking results.
AB - Tables are ubiquitous. Unfortunately, no search engine supportstable search. In this paper, we propose a novel table specificsearching engine, TableSeer, to facilitate the table extracting, indexing, searching, and sharing. In addition, wepropose an extensive set of medium-independent metadata to precisely present tables. Given a query, TableSeer ranks the returned results using an innovative ranking algorithm - TableRank with a tailored vector space model and a novel term weightingscheme. Experimental results show that TableSeer outperforms existing search engines on table search. In addition, incorporating multiple weighting factors can significantly improve the ranking results.
UR - http://www.scopus.com/inward/record.url?scp=35348895037&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=35348895037&partnerID=8YFLogxK
U2 - 10.1145/1242572.1242732
DO - 10.1145/1242572.1242732
M3 - Conference contribution
AN - SCOPUS:35348895037
SN - 1595936548
SN - 9781595936547
T3 - 16th International World Wide Web Conference, WWW2007
SP - 1135
EP - 1136
BT - 16th International World Wide Web Conference, WWW2007
T2 - 16th International World Wide Web Conference, WWW2007
Y2 - 8 May 2007 through 12 May 2007
ER -