Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Multiscale Segmentation for Mixed Raster Content Applicable to Document Coding


Affiliations
1 Department of CSE, Mepco Schlenk Engineering College, Sivakasi, India
     

   Subscribe/Renew Journal


Compound document images contain graphic or textual content along with pictures. They are found in magazines, brochures, web-sites, etc in a document format. The goal is to compress an image containing the mixed raster content (MRC) using multi-layer approach. The proposed methodology segments the image into regions such as text, pictures and background. The key to MRC compression is the separation of the document into foreground and background layers, represented as a binary mask. The compression quality depends on the segmentation algorithm used to compute the binary mask.
The proposed multi-scale segmentation algorithm models the complex aspects of both local and global contextual behavior. The proposed algorithm finds the block-wise segmentation of the raster image in a global cost optimization framework. Then the initial segmentation is refined by classifying feature vectors of connected components using a Markov random field (MRF) model. Then hybrid procedures of the previous steps are then incorporated into a multi-scale framework in order to improve the segmentation accuracy of text with varying size. It is shown that the proposed methodology achieves greater accuracy of text detection but with a lower false detection rate of non-text features. This segmentation algorithm can improve the quality of decoded documents while simultaneously lowering the bit rate. It is also shown that execution time can be greatly reduced by the use of features that are not computationally intensive.

Keywords

Muliscale Image Analysis, Mixed Raster Content, Document Image Segmentation, MRC Compression, Markov Random Fields, Document Coding.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 222

PDF Views: 2




  • Multiscale Segmentation for Mixed Raster Content Applicable to Document Coding

Abstract Views: 222  |  PDF Views: 2

Authors

S. Amutha
Department of CSE, Mepco Schlenk Engineering College, Sivakasi, India
V. Ponraj
Department of CSE, Mepco Schlenk Engineering College, Sivakasi, India

Abstract


Compound document images contain graphic or textual content along with pictures. They are found in magazines, brochures, web-sites, etc in a document format. The goal is to compress an image containing the mixed raster content (MRC) using multi-layer approach. The proposed methodology segments the image into regions such as text, pictures and background. The key to MRC compression is the separation of the document into foreground and background layers, represented as a binary mask. The compression quality depends on the segmentation algorithm used to compute the binary mask.
The proposed multi-scale segmentation algorithm models the complex aspects of both local and global contextual behavior. The proposed algorithm finds the block-wise segmentation of the raster image in a global cost optimization framework. Then the initial segmentation is refined by classifying feature vectors of connected components using a Markov random field (MRF) model. Then hybrid procedures of the previous steps are then incorporated into a multi-scale framework in order to improve the segmentation accuracy of text with varying size. It is shown that the proposed methodology achieves greater accuracy of text detection but with a lower false detection rate of non-text features. This segmentation algorithm can improve the quality of decoded documents while simultaneously lowering the bit rate. It is also shown that execution time can be greatly reduced by the use of features that are not computationally intensive.

Keywords


Muliscale Image Analysis, Mixed Raster Content, Document Image Segmentation, MRC Compression, Markov Random Fields, Document Coding.