TY - GEN
T1 - Automatic extraction of table metadata from digital documents
AU - Liu, Ying
AU - Mitra, Prasenjit
AU - Giles, C. Lee
AU - Bai, Kun
PY - 2006
Y1 - 2006
N2 - Tables are used to present, list, summarize, and structure important data in documents. In scholarly articles, they are often used to present the relationships among data and highlight a collection of results obtained from experiments and scientific analysis. In digital libraries, extracting this data automatically and understanding the structure and content of tables are very important to many applications. Automatic identification extraction, and search for the contents of tables can be made more precise with the help of metadata. In this paper, we propose a set of medium-independent table metadata to facilitate the table indexing, searching, and exchanging. To extract the contents of tables and their metadata, an automatic table metadata extraction algorithm is designed and tested on PDF documents.
AB - Tables are used to present, list, summarize, and structure important data in documents. In scholarly articles, they are often used to present the relationships among data and highlight a collection of results obtained from experiments and scientific analysis. In digital libraries, extracting this data automatically and understanding the structure and content of tables are very important to many applications. Automatic identification extraction, and search for the contents of tables can be made more precise with the help of metadata. In this paper, we propose a set of medium-independent table metadata to facilitate the table indexing, searching, and exchanging. To extract the contents of tables and their metadata, an automatic table metadata extraction algorithm is designed and tested on PDF documents.
UR - http://www.scopus.com/inward/record.url?scp=34247230999&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34247230999&partnerID=8YFLogxK
U2 - 10.1145/1141753.1141835
DO - 10.1145/1141753.1141835
M3 - Conference contribution
AN - SCOPUS:34247230999
SN - 1595933549
SN - 9781595933546
T3 - Proceedings of the ACM/IEEE Joint Conference on Digital Libraries
SP - 339
EP - 340
BT - 6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006
T2 - 6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006: Opening Information Horizons, JCDL '06
Y2 - 11 June 2006 through 15 June 2006
ER -