TY - GEN
T1 - TableRank
T2 - AAAI-07/IAAI-07 Proceedings: 22nd AAAI Conference on Artificial Intelligence and the 19th Innovative Applications of Artificial Intelligence Conference
AU - Liu, Ying
AU - Bai, Kun
AU - Mitra, Prasenjit
AU - Giles, C. Lee
PY - 2007
Y1 - 2007
N2 - Tables are ubiquitous in web pages and scientific documents. With the explosive development of the web, tables have become a valuable information repository. Therefore, effectively and efficiently searching tables becomes a challenge. Existing search engines do not provide satisfactory search results largely because the current ranking schemes are inadequate for table search and automatic table understanding and extraction are rather difficult in general. In this work, we design and evaluate a novel table ranking algorithm - TableRank to improve the performance of our table search engine TableSeer. Given a keyword based table query, TableRank facilities TableSeer to return the most relevant tables by tailoring the classic vector space model. TableRank adopts an innovative term weighting scheme by aggregating multiple weighting factors from three levels: term, table and document. The experimental results show that our table search engine outperforms existing search engines on table search. In addition, incorporating multiple weighting factors can significantly improve the ranking results.
AB - Tables are ubiquitous in web pages and scientific documents. With the explosive development of the web, tables have become a valuable information repository. Therefore, effectively and efficiently searching tables becomes a challenge. Existing search engines do not provide satisfactory search results largely because the current ranking schemes are inadequate for table search and automatic table understanding and extraction are rather difficult in general. In this work, we design and evaluate a novel table ranking algorithm - TableRank to improve the performance of our table search engine TableSeer. Given a keyword based table query, TableRank facilities TableSeer to return the most relevant tables by tailoring the classic vector space model. TableRank adopts an innovative term weighting scheme by aggregating multiple weighting factors from three levels: term, table and document. The experimental results show that our table search engine outperforms existing search engines on table search. In addition, incorporating multiple weighting factors can significantly improve the ranking results.
UR - http://www.scopus.com/inward/record.url?scp=36349024712&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=36349024712&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:36349024712
SN - 1577353234
SN - 9781577353232
T3 - Proceedings of the National Conference on Artificial Intelligence
SP - 317
EP - 322
BT - AAAI-07/IAAI-07 Proceedings
Y2 - 22 July 2007 through 26 July 2007
ER -