TY - GEN
T1 - Adaptive removal of background and white space from document images using seam categorization
AU - Fillion, Claude
AU - Fan, Zhigang
AU - Monga, Vishal
PY - 2011
Y1 - 2011
N2 - Document images are obtained regularly by rasterization of document content and as scans of printed documents. Resizing via background and white space removal is often desired for better consumption of these images, whether on displays or in print. While white space and background are easy to identify in images, existing methods such as naïve removal and content aware resizing (seam carving) each have limitations that can lead to undesirable artifacts, such as uneven spacing between lines of text or poor arrangement of content. An adaptive method based on image content is hence needed. In this paper we propose an adaptive method to intelligently remove white space and background content from document images. Document images are different from pictorial images in structure. They typically contain objects (text letters, pictures and graphics) separated by uniform background, which include both white paper space and other uniform color background. Pixels in uniform background regions are excellent candidates for deletion if resizing is required, as they introduce less change in document content and style, compared with deletion of object pixels. We propose a background deletion method that exploits both local and global context. The method aims to retain the document structural information and image quality.
AB - Document images are obtained regularly by rasterization of document content and as scans of printed documents. Resizing via background and white space removal is often desired for better consumption of these images, whether on displays or in print. While white space and background are easy to identify in images, existing methods such as naïve removal and content aware resizing (seam carving) each have limitations that can lead to undesirable artifacts, such as uneven spacing between lines of text or poor arrangement of content. An adaptive method based on image content is hence needed. In this paper we propose an adaptive method to intelligently remove white space and background content from document images. Document images are different from pictorial images in structure. They typically contain objects (text letters, pictures and graphics) separated by uniform background, which include both white paper space and other uniform color background. Pixels in uniform background regions are excellent candidates for deletion if resizing is required, as they introduce less change in document content and style, compared with deletion of object pixels. We propose a background deletion method that exploits both local and global context. The method aims to retain the document structural information and image quality.
UR - http://www.scopus.com/inward/record.url?scp=79953019352&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79953019352&partnerID=8YFLogxK
U2 - 10.1117/12.877266
DO - 10.1117/12.877266
M3 - Conference contribution
AN - SCOPUS:79953019352
SN - 9780819484161
T3 - Proceedings of SPIE - The International Society for Optical Engineering
BT - Imaging and Printing in a Web 2.0 World II
T2 - Imaging and Printing in a Web 2.0 World II
Y2 - 26 January 2011 through 27 January 2011
ER -