Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

An Evolutionary and a Rule-Based Approach to String Transformation


Affiliations
1 CSE Department, Vigana’s Institute of Engg. for Women, Ranga Reddy, Telangana, India
     

   Subscribe/Renew Journal


Natural language processing focuses on analyzing and processing human languages using digital computers. String transformation is an important area of research in the field of natural language processing. String transformation maps a source string to a desirable form, which is related to various applications like spelling error correction, query reformulation, top k related candidate generation, and word stemming. Even though various traditional approaches are available for string transformation but they cannot be considered as optimal because accuracy and efficiency are the basic parameters to optimise. This paper proposes a novel model for string transformation which is up to 99% accurate, with an improved F-measure and G-measure. The model is intended to use evolutionary approach, which involves the methods to search the population of keywords, algorithm to find the distances between strings, and finally transforming the strings with and without using a dictionary. Our paper mainly focuses on the following: (1) spelling error correction which detects a wrong spelling and provides correct suggestion, (2) top k candidate generation which provides the most related suggestions for a keyword, (3) query reformulation which transforms a short form of a query into an elaborate form, and (4) word stemming which identifies a part of a word when it is also concatenated by grammatical stuffs. In short, it identifies redundant queries. It is rule-based system which is implemented without using a dictionary. Graphical comparisons are demonstrated for the candidates generated between the existing and the proposed system.Experimental results on large scale data shows that the proposed model is accurate and improved over the traditional approach.

Keywords

Natural Language Processing, String Transformation, Spelling Error Correction, Top K Candidate Generation, Query Reformulation, Stemming, F-Measure, G-Measure.
Subscription Login to verify subscription
User
Notifications
Font Size


Abstract Views: 530

PDF Views: 0




  • An Evolutionary and a Rule-Based Approach to String Transformation

Abstract Views: 530  |  PDF Views: 0

Authors

Nandita Bhanja Chaudhuri
CSE Department, Vigana’s Institute of Engg. for Women, Ranga Reddy, Telangana, India
D. Kamal Kumari
CSE Department, Vigana’s Institute of Engg. for Women, Ranga Reddy, Telangana, India
S. Ram Prasad Reddy
CSE Department, Vigana’s Institute of Engg. for Women, Ranga Reddy, Telangana, India

Abstract


Natural language processing focuses on analyzing and processing human languages using digital computers. String transformation is an important area of research in the field of natural language processing. String transformation maps a source string to a desirable form, which is related to various applications like spelling error correction, query reformulation, top k related candidate generation, and word stemming. Even though various traditional approaches are available for string transformation but they cannot be considered as optimal because accuracy and efficiency are the basic parameters to optimise. This paper proposes a novel model for string transformation which is up to 99% accurate, with an improved F-measure and G-measure. The model is intended to use evolutionary approach, which involves the methods to search the population of keywords, algorithm to find the distances between strings, and finally transforming the strings with and without using a dictionary. Our paper mainly focuses on the following: (1) spelling error correction which detects a wrong spelling and provides correct suggestion, (2) top k candidate generation which provides the most related suggestions for a keyword, (3) query reformulation which transforms a short form of a query into an elaborate form, and (4) word stemming which identifies a part of a word when it is also concatenated by grammatical stuffs. In short, it identifies redundant queries. It is rule-based system which is implemented without using a dictionary. Graphical comparisons are demonstrated for the candidates generated between the existing and the proposed system.Experimental results on large scale data shows that the proposed model is accurate and improved over the traditional approach.

Keywords


Natural Language Processing, String Transformation, Spelling Error Correction, Top K Candidate Generation, Query Reformulation, Stemming, F-Measure, G-Measure.