Open Access Open Access  Restricted Access Subscription Access

Determine Characters by Mathematical Model for Segmentation Arabic Words by Voronoi Diagrams


Affiliations
1 School of Computer Science, Faculty of Information Science and Technology, Centre of Artificial Intelligence Universiti Kebangsaan Malaysia, Bangi - 43200, Selangor, Malaysia
 

Objectives: The objectives are to use a mathematical model to define a region-based segmentation method. This study determines whether the Connected Component (CC) is one or more than one character. Method: Whereas the other methods they tend to ignore the solid foundation of describing characters and connection points. This proposed method adopts on many stages for adaptive the mathematic in segmentation characters process are: i) peak detection from vertical histogram for (CC), and ii) enhancement of the model using a mathematical model to improve the segmentation method based on the Voronoi Diagram (VD) Through a number of peaks. Findings: Whereas characters, such as س and ص, are confusing to segmentation methods; these errors include separating connection strokes from both sides to produce a separated one. Other errors must be handled at a later stage, such as segmenting the character ح at an acute angle. Whereas the mathematical model is depending on peaks, numbers, direction, and length of CC. This model is tested on segmentation using five Arabic datasets as: AHDB, IFN-ENIT, AHDB-FTR, APTI, Zeki and Al Hamad DB datasets. The Preliminary results show that the application of the EDMS feature with multi perceptron-NN classifier it’s preferable. Its accuracy when compared with Zeki method is 96.81% for the ACTOR printed dataset and the rate of this method is 85.81% for Zeki dataset and also compared with Al Hamad method is 95.09%, and 89.10% for ACDARhandwrittendataset. Whereas the others datasets accuracies are 95.09% for IFN-ENIT, 98.27% for APTI, 91.63% for AHDB, and 90.69% for AHDB-FTR on same feature (EDMS) and classifier (MLP_NN). Novelty: Adapt Mathematics with segmentation process to determine whether the CC is one or more than one character. Using a mathematical model based on the VD to avoid over segmentation .

Keywords

Mathematical Model, Arabic words, More than one character, Segmentation, Voronoi Diagrams.
User

Abstract Views: 160

PDF Views: 0




  • Determine Characters by Mathematical Model for Segmentation Arabic Words by Voronoi Diagrams

Abstract Views: 160  |  PDF Views: 0

Authors

Jabril Ramdan
School of Computer Science, Faculty of Information Science and Technology, Centre of Artificial Intelligence Universiti Kebangsaan Malaysia, Bangi - 43200, Selangor, Malaysia
Khairuddin Omar
School of Computer Science, Faculty of Information Science and Technology, Centre of Artificial Intelligence Universiti Kebangsaan Malaysia, Bangi - 43200, Selangor, Malaysia
Mohammad Faidzul
School of Computer Science, Faculty of Information Science and Technology, Centre of Artificial Intelligence Universiti Kebangsaan Malaysia, Bangi - 43200, Selangor, Malaysia

Abstract


Objectives: The objectives are to use a mathematical model to define a region-based segmentation method. This study determines whether the Connected Component (CC) is one or more than one character. Method: Whereas the other methods they tend to ignore the solid foundation of describing characters and connection points. This proposed method adopts on many stages for adaptive the mathematic in segmentation characters process are: i) peak detection from vertical histogram for (CC), and ii) enhancement of the model using a mathematical model to improve the segmentation method based on the Voronoi Diagram (VD) Through a number of peaks. Findings: Whereas characters, such as س and ص, are confusing to segmentation methods; these errors include separating connection strokes from both sides to produce a separated one. Other errors must be handled at a later stage, such as segmenting the character ح at an acute angle. Whereas the mathematical model is depending on peaks, numbers, direction, and length of CC. This model is tested on segmentation using five Arabic datasets as: AHDB, IFN-ENIT, AHDB-FTR, APTI, Zeki and Al Hamad DB datasets. The Preliminary results show that the application of the EDMS feature with multi perceptron-NN classifier it’s preferable. Its accuracy when compared with Zeki method is 96.81% for the ACTOR printed dataset and the rate of this method is 85.81% for Zeki dataset and also compared with Al Hamad method is 95.09%, and 89.10% for ACDARhandwrittendataset. Whereas the others datasets accuracies are 95.09% for IFN-ENIT, 98.27% for APTI, 91.63% for AHDB, and 90.69% for AHDB-FTR on same feature (EDMS) and classifier (MLP_NN). Novelty: Adapt Mathematics with segmentation process to determine whether the CC is one or more than one character. Using a mathematical model based on the VD to avoid over segmentation .

Keywords


Mathematical Model, Arabic words, More than one character, Segmentation, Voronoi Diagrams.



DOI: https://doi.org/10.17485/ijst%2F2016%2Fv9i40%2F126510