TY - GEN
T1 - Searching for tables in digital documents
AU - Liu, Ying
AU - Bai, Kun
AU - Mitra, Prasenjit
AU - Lee Giles, C.
PY - 2007
Y1 - 2007
N2 - Tables are ubiquitous. In scientific documents, tables are widely used to present experimental results or statistical data in a condensed fashion. Current search engines do not allow the end-user to search for relevant tables. In this paper, we describe TableSeer, an automatic table extraction and search engine system. TableSeer crawls scientific documents, identifies documents with tables, extracts tables from documents, indexes them and enables end-users to search for tables. We also propose an extensive set of mediumindependent metadata for tables representation. Given a query, TableSeer ranks the returned results using an innovative ranking algorithm - TableRank. Our results show that TableSeer outperforms popular search engines, such as Google Scholar when the end-user seeks for tables.
AB - Tables are ubiquitous. In scientific documents, tables are widely used to present experimental results or statistical data in a condensed fashion. Current search engines do not allow the end-user to search for relevant tables. In this paper, we describe TableSeer, an automatic table extraction and search engine system. TableSeer crawls scientific documents, identifies documents with tables, extracts tables from documents, indexes them and enables end-users to search for tables. We also propose an extensive set of mediumindependent metadata for tables representation. Given a query, TableSeer ranks the returned results using an innovative ranking algorithm - TableRank. Our results show that TableSeer outperforms popular search engines, such as Google Scholar when the end-user seeks for tables.
UR - http://www.scopus.com/inward/record.url?scp=51149113056&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=51149113056&partnerID=8YFLogxK
U2 - 10.1109/ICDAR.2007.4377052
DO - 10.1109/ICDAR.2007.4377052
M3 - Conference contribution
AN - SCOPUS:51149113056
SN - 0769528228
SN - 9780769528229
T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
SP - 934
EP - 938
BT - Proceedings - 9th International Conference on Document Analysis and Recognition, ICDAR 2007
T2 - 9th International Conference on Document Analysis and Recognition, ICDAR 2007
Y2 - 23 September 2007 through 26 September 2007
ER -