Open Access Open Access  Restricted Access Subscription Access

Segmentation of Broken Characters of Handwritten Gurmukhi Script


Affiliations
1 Department of Computer Engineering, Yadavindra College of Engineering, Talwandi Sabo (Bathinda), India
2 Department of Computer Engineering, Yadavindra College of Engineering, Talwandi Sabo (Bathinda), India
 

Character Segmentation of Handwritten Documents has been an active area of research and due to its diverse applicable environment; it continues to be a challenging research topic. The desire to edit scanned text document forces the researchers to think about the optical character recognition (OCR). OCR is the process of recognizing a segmented part of the scanned image as a character. OCR process consists of three major sub processes - pre processing, segmentation and then recognition. Out of these three, the segmentation process is the most important phase of the overall OCR process. Different problems in the characters segmentation of handwritten text is due to the different writing style of different people because the size and shape is not fixed while we write any text. In this work, we formulate an algorithm to segment the scanned document image as a character. According to proposed algorithm, broken characters in Gurmukhi script, we used the segmentation of these characters that can become easily identify how many characters are in one word. To develop the algorithm to segment the characters from a word we are using combinations of two approaches which are Horizontal Profile Projection and Vertical Profile Projection. And get the accuracy is 93%.

Keywords

Gurmukhi Script, OCR, Segmentation, Handwritten Document, Horizontal Profile Projection, Vertical Profile Projection.
User
Notifications
Font Size

Abstract Views: 202

PDF Views: 0




  • Segmentation of Broken Characters of Handwritten Gurmukhi Script

Abstract Views: 202  |  PDF Views: 0

Authors

Bharti Mehta
Department of Computer Engineering, Yadavindra College of Engineering, Talwandi Sabo (Bathinda), India
Simpel Rani
Department of Computer Engineering, Yadavindra College of Engineering, Talwandi Sabo (Bathinda), India

Abstract


Character Segmentation of Handwritten Documents has been an active area of research and due to its diverse applicable environment; it continues to be a challenging research topic. The desire to edit scanned text document forces the researchers to think about the optical character recognition (OCR). OCR is the process of recognizing a segmented part of the scanned image as a character. OCR process consists of three major sub processes - pre processing, segmentation and then recognition. Out of these three, the segmentation process is the most important phase of the overall OCR process. Different problems in the characters segmentation of handwritten text is due to the different writing style of different people because the size and shape is not fixed while we write any text. In this work, we formulate an algorithm to segment the scanned document image as a character. According to proposed algorithm, broken characters in Gurmukhi script, we used the segmentation of these characters that can become easily identify how many characters are in one word. To develop the algorithm to segment the characters from a word we are using combinations of two approaches which are Horizontal Profile Projection and Vertical Profile Projection. And get the accuracy is 93%.

Keywords


Gurmukhi Script, OCR, Segmentation, Handwritten Document, Horizontal Profile Projection, Vertical Profile Projection.