Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Segmentation of Printed Meitei/Meetei Script Documents


Affiliations
1 Department of Computer Science, Manipur University, Imphal, India
     

   Subscribe/Renew Journal


There are three main Process in Optical Character Recognition (OCR) System – Pre Processing, Segmentation and Recognition. Segmentation process of characters is one of the most crucial step in the development of OCR system of any language. Perfect segmentation of individual characters will determine the accuracy of the OCR system. It is used to segment the lines, words and individual characters from the document image. Meitei/Meetei script is not much popular script in India, but this language is schedule Indian language of Tibeto-Burman origin, which is also a very highly agglutinative language. Characters Segmentation of the Meitei/Meetei script is a difficult task because of the overlapping adjacent characters. In this paper we proposed a methodology, individual text lines and words are segmented by using Projection Profile technique. And for the individual characters we proposed Connected Component Analysis method. Proposed method was tested and segmentation accuracy rate of 95.6% is achieved.


Keywords

Characters Segmentation, Connected Component Analysis, Meitei/Meetei Script, OCR, Projection Profile.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 235

PDF Views: 1




  • Segmentation of Printed Meitei/Meetei Script Documents

Abstract Views: 235  |  PDF Views: 1

Authors

Y. Loijing Khomba Khuman
Department of Computer Science, Manipur University, Imphal, India
H. Mamata Devi
Department of Computer Science, Manipur University, Imphal, India
Ksh. Nareshkumar Singh
Department of Computer Science, Manipur University, Imphal, India
S. Poireiton Meitei
Department of Computer Science, Manipur University, Imphal, India
N. Ajith Singh
Department of Computer Science, Manipur University, Imphal, India

Abstract


There are three main Process in Optical Character Recognition (OCR) System – Pre Processing, Segmentation and Recognition. Segmentation process of characters is one of the most crucial step in the development of OCR system of any language. Perfect segmentation of individual characters will determine the accuracy of the OCR system. It is used to segment the lines, words and individual characters from the document image. Meitei/Meetei script is not much popular script in India, but this language is schedule Indian language of Tibeto-Burman origin, which is also a very highly agglutinative language. Characters Segmentation of the Meitei/Meetei script is a difficult task because of the overlapping adjacent characters. In this paper we proposed a methodology, individual text lines and words are segmented by using Projection Profile technique. And for the individual characters we proposed Connected Component Analysis method. Proposed method was tested and segmentation accuracy rate of 95.6% is achieved.


Keywords


Characters Segmentation, Connected Component Analysis, Meitei/Meetei Script, OCR, Projection Profile.