Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

A Partial Ratio and Ratio Based Fuzzy-Wuzzy Procedure for Characteristic Mining of Mathematical Formulas from Documents


Affiliations
1 Department of Computer Science and Engineering, GITAM Institute of Technology, India
2 Department of Information Technology, Anil Neerukonda Institute of Technology and Sciences, India
3 Department of Computer Science and Systems Engineering, Andhra University, India
     

   Subscribe/Renew Journal


Retrieval of mathematical text from data is a key predicament in present circumstances. To achieve this, we have considered three different algorithms viz., Sequence matcher, Levenshtein Distance and Fuzzy-Wuzzy. Two different variants of Fuzzy-Wuzzy are found applicable to this study out of four variants. Performance of these variants in retrieving mathematical texts, is calculated using efficiency measure, sensitivity analysis and time series exploration. Fuzzy-Wuzzy partial ratio algorithm scored better over the other variants on efficiency measure and sensitivity analysis.

Keywords

Sequence Matcher, Levenshtein Distance, Fuzzy-Wuzzy, Partial Ratio.
Subscription Login to verify subscription
User
Notifications
Font Size


  • A Partial Ratio and Ratio Based Fuzzy-Wuzzy Procedure for Characteristic Mining of Mathematical Formulas from Documents

Abstract Views: 468  |  PDF Views: 2

Authors

G. Appa Rao
Department of Computer Science and Engineering, GITAM Institute of Technology, India
G. Srinivas
Department of Information Technology, Anil Neerukonda Institute of Technology and Sciences, India
K. Venkata Rao
Department of Computer Science and Systems Engineering, Andhra University, India
P. V. G. D. Prasad Reddy
Department of Computer Science and Systems Engineering, Andhra University, India

Abstract


Retrieval of mathematical text from data is a key predicament in present circumstances. To achieve this, we have considered three different algorithms viz., Sequence matcher, Levenshtein Distance and Fuzzy-Wuzzy. Two different variants of Fuzzy-Wuzzy are found applicable to this study out of four variants. Performance of these variants in retrieving mathematical texts, is calculated using efficiency measure, sensitivity analysis and time series exploration. Fuzzy-Wuzzy partial ratio algorithm scored better over the other variants on efficiency measure and sensitivity analysis.

Keywords


Sequence Matcher, Levenshtein Distance, Fuzzy-Wuzzy, Partial Ratio.

References