Comparison of String Similarity Algorithms to Measure Lexical Similarity

Sagar J. Gandhi; Mihirraj M. Thakor; Jikitsha Sheth; Hariom I. Pandit; Hemin S. Patel

Comparison of String Similarity Algorithms to Measure Lexical Similarity

Sagar J. Gandhi ¹, Mihirraj M. Thakor ¹, Jikitsha Sheth ², Hariom I. Pandit ¹, Hemin S. Patel ¹

Affiliations
1 Shrimad Rajchandra Institute of Management and Computer Applications, UTU, Bardoli, India
2 Shrimad Rajchandra Inst. of Management & Comp. Appl., UTU, Bardoli, India

Subscribe/Renew Journal

A string similarity represents the lexical similarity between two words. This can be further exploited to identify similarity between questions. Several string similarity algorithm exists in literature. In this paper the authors have implemented five string similarity algorithms viz. Dice coefficient, Jaccard similarity, Levenshtein distance, Jaro distance and Cosine similarity. The results of these algorithms are further compared with human judges to determine, which of them resembles the human way to dissimilarize the given strings. The experimentation is done over 1000 English word pairs.

I-Scholar

Journal Help

Subscription Login to verify subscription

User

Notifications

Journal Content
Browse

Font Size

Information

Comparison of String Similarity Algorithms to Measure Lexical Similarity

Abstract Views: 560 | PDF Views: 6

Authors

Sagar J. Gandhi
Shrimad Rajchandra Institute of Management and Computer Applications, UTU, Bardoli, India

Mihirraj M. Thakor
Shrimad Rajchandra Institute of Management and Computer Applications, UTU, Bardoli, India

Jikitsha Sheth
Shrimad Rajchandra Inst. of Management & Comp. Appl., UTU, Bardoli, India

Hariom I. Pandit
Shrimad Rajchandra Institute of Management and Computer Applications, UTU, Bardoli, India

Hemin S. Patel
Shrimad Rajchandra Institute of Management and Computer Applications, UTU, Bardoli, India

National Journal of System and Information Technology

Comparison of String Similarity Algorithms to Measure Lexical Similarity

Subscribe/Renew Journal

Comparison of String Similarity Algorithms to Measure Lexical Similarity

Authors

Abstract

References

Username
Password
Remember me

Username
Password
Remember me