Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Discrete Cosine Transform for Script Identification and Character Recognition


Affiliations
1 Veer Narmad South Gujarat University, India
     

   Subscribe/Renew Journal


Optical Character Recognition (OCR) in printed multi-script documents is still challenge due to the script dependence of OCR. Identification of script is important phase in design of multi-script OCR system for processing of multi-script documents. Most of the script identification work reported is on document, paragraph/block, and word level. This research article presents character level script identification and character recognition using Discrete Cosine Transforms (DCT) feature in bilingual Gujarati-English text. DCT is employed to extract the features based on energy coefficients analysis. The proposed method has two phases: Classification and Recognition. In classification, performance of KNN and SVM classifiers is studied separately and compared. The same DCT features are used in recognition phase. Experiments and results show that, presented method is robust for character level printed bilingual script identification and character recognition.

Keywords

Bilingual, DCT, Classification, Recognition.
Subscription Login to verify subscription
User
Notifications
Font Size


Abstract Views: 228

PDF Views: 0




  • Discrete Cosine Transform for Script Identification and Character Recognition

Abstract Views: 228  |  PDF Views: 0

Authors

Shailesh Chaudhari
Veer Narmad South Gujarat University, India

Abstract


Optical Character Recognition (OCR) in printed multi-script documents is still challenge due to the script dependence of OCR. Identification of script is important phase in design of multi-script OCR system for processing of multi-script documents. Most of the script identification work reported is on document, paragraph/block, and word level. This research article presents character level script identification and character recognition using Discrete Cosine Transforms (DCT) feature in bilingual Gujarati-English text. DCT is employed to extract the features based on energy coefficients analysis. The proposed method has two phases: Classification and Recognition. In classification, performance of KNN and SVM classifiers is studied separately and compared. The same DCT features are used in recognition phase. Experiments and results show that, presented method is robust for character level printed bilingual script identification and character recognition.

Keywords


Bilingual, DCT, Classification, Recognition.