Not registered? - Request an account here

Segmentation and Classification of Document Images

A. Antonacopoulos, R.T. Ritchings

Digest of the IEE Colloquium on Document Image Processing and Multimedia Environments, The Institution of Electrical Engineers, November 1995, pp. 16/1-16/7, ISSN 0963-3308

Abstract

There is a significant and growing need to convert documents from printed paper to an electronic form. Document image analysis is concerned with the segmentation of the document image into regions of interest, their description, and the classification of the regions according to the type of their contents. A new unified approach to page segmentation and classification, based on the description of the background with tiles, is presented. The segmentation method is flexible to successfully analyse and describe regions in complicated layouts where other methods fail. Images with severe skew are handled equally well with no additional computations. The classification is based on textural features which are derived by simple calculations from the representation of space in the regions, produced during the segmentation process. This is a considerable advantage over previous methods where extra image accesses and lengthy computations are necessary. Overall, the whole approach of segmentation and classification by white tiles is fast and efficient as no time-consuming processes are required.

Citation

A. Antonacopoulos, R.T. Ritchings , "Segmentation and Classification of Document Images", Digest of the IEE Colloquium on Document Image Processing and Multimedia Environments, The Institution of Electrical Engineers, November 1995, pp. 16/1-16/7, ISSN 0963-3308

DOI

10.1049/ic:19951197

Full Paper

Download PDF