Refine your search
Collections
Co-Authors
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z All
Garg, Urvashi
- Effect of Stop Word Removal on Document Similarity for Hindi Text
Abstract Views :137 |
PDF Views:0
Authors
Urvashi Garg
1,
Vishal Goyal
2
Affiliations
1 Haryana College of Technology and Management, Kaithal, IN
2 Punjabi University, Patiala, IN
1 Haryana College of Technology and Management, Kaithal, IN
2 Punjabi University, Patiala, IN
Source
Research Cell: An International Journal of Engineering Sciences, Vol 13 (2014), Pagination: 161-163Abstract
Stop word removal is one of the important NLP techniques. Stop words are very common in any document. In this paper, we have created a list of stop words for Hindi text on the basis of frequency of words in documents. Hindi documents from EMILLE corpus have been used for finding out the stop words. UTF-8 encoding is used. The percentage of stop words in any document has been find out and experimentally analyzed. The paper discusses the effect of stop word removal on the similarity of two documents containing Hindi text. Hoad&Zobel approach is used for finding the similarity of documents containing Hindi text.Keywords
Stop Words, Removal, Text, Hindi, List, Frequency.- Plagiarism and Detection Tools:An Overview
Abstract Views :125 |
PDF Views:0
Authors
Affiliations
1 HCTM, Kaithal, IN
1 HCTM, Kaithal, IN