TY - GEN
T1 - Automatic extraction of data from 2-D plots in documents
AU - Lu, Xiaonan
AU - Wang, James Z.
AU - Mitra, Prasenjit
AU - Giles, C. Lee
PY - 2007
Y1 - 2007
N2 - Two-dimensional (2-D) plots in digital documents contain important information. Often, the results of scientific experiments and performance of businesses are summarized using plots. Although 2-D plots are easily understood by human users, current search engines rarely utilize the information contained in the plots to enhance the results returned in response to queries posed by endusers. We propose an automated algorithm for extracting information from line curves in 2-D plots. The extracted information can be stored in a database and indexed to answer end-user queries and enhance search results. We have collected 2-D plot images from a variety of resources and tested our extraction algorithms. Experimental evaluation has demonstrated that our method can produce results suitable for real world use.
AB - Two-dimensional (2-D) plots in digital documents contain important information. Often, the results of scientific experiments and performance of businesses are summarized using plots. Although 2-D plots are easily understood by human users, current search engines rarely utilize the information contained in the plots to enhance the results returned in response to queries posed by endusers. We propose an automated algorithm for extracting information from line curves in 2-D plots. The extracted information can be stored in a database and indexed to answer end-user queries and enhance search results. We have collected 2-D plot images from a variety of resources and tested our extraction algorithms. Experimental evaluation has demonstrated that our method can produce results suitable for real world use.
UR - http://www.scopus.com/inward/record.url?scp=51149119523&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=51149119523&partnerID=8YFLogxK
U2 - 10.1109/ICDAR.2007.4378701
DO - 10.1109/ICDAR.2007.4378701
M3 - Conference contribution
AN - SCOPUS:51149119523
SN - 0769528228
SN - 9780769528229
T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
SP - 188
EP - 192
BT - Proceedings - 9th International Conference on Document Analysis and Recognition, ICDAR 2007
T2 - 9th International Conference on Document Analysis and Recognition, ICDAR 2007
Y2 - 23 September 2007 through 26 September 2007
ER -