Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Efficient Keyword Based Document Clustering Using Fuzzy C-Means Algorithm


Affiliations
1 Erode Arts and Science College, Erode-638009, Tamil Nadu, India
     

   Subscribe/Renew Journal


Clustering is an useful technique in the field of textual data mining. Cluster analysis divides objects into meaningful groups based on similarity between objects. The existing clustering approaches face the issues like practical applicability, very less accuracy, more classification time etc. In recent times, inclusion of fuzzy logic in clustering results in better clustering results. In order to further improve the performance of clustering, the  Fuzzy C-Means (FCMA) Algorithm is used. The keywords are extracted from the documents using LSA based document extraction. The Fuzzy partition matrix is created for the clustering process and the performance of the document clustering is greater based on the keyword when compared to the Existing K-Means Clustering and EM Algorithm. The proposed technique will be highly useful in the text mining process to increase the accuracy and performance of the text extraction process.

Keywords

Document Clustering, Fuzzy Cluster, Fuzzy C-Means, K-Means Clustering.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 249

PDF Views: 3




  • Efficient Keyword Based Document Clustering Using Fuzzy C-Means Algorithm

Abstract Views: 249  |  PDF Views: 3

Authors

K. Prabha
Erode Arts and Science College, Erode-638009, Tamil Nadu, India
K. Vivekanandan
Erode Arts and Science College, Erode-638009, Tamil Nadu, India
S. Sukumaran
Erode Arts and Science College, Erode-638009, Tamil Nadu, India

Abstract


Clustering is an useful technique in the field of textual data mining. Cluster analysis divides objects into meaningful groups based on similarity between objects. The existing clustering approaches face the issues like practical applicability, very less accuracy, more classification time etc. In recent times, inclusion of fuzzy logic in clustering results in better clustering results. In order to further improve the performance of clustering, the  Fuzzy C-Means (FCMA) Algorithm is used. The keywords are extracted from the documents using LSA based document extraction. The Fuzzy partition matrix is created for the clustering process and the performance of the document clustering is greater based on the keyword when compared to the Existing K-Means Clustering and EM Algorithm. The proposed technique will be highly useful in the text mining process to increase the accuracy and performance of the text extraction process.

Keywords


Document Clustering, Fuzzy Cluster, Fuzzy C-Means, K-Means Clustering.