Open Access Open Access  Restricted Access Subscription Access

Document Clustering using a New Similarity Measure based on Energy of a Bipartite Graph


Affiliations
1 Department of Mathematics, School of Advanced Sciences, VIT University, Chennai - 600127, Tamil Nadu, India
 

Objectives: This paper aims at clustering documents using a new similarity measure based on energy of a bipartite graph. Methods/Statistical Analysis: We have made use of bipartite representation of documents and clustered them. The proposed algorithm has been illustrated for a small document set. The documents have been clustered using the new similarity measure based on energy of a bipartite graph introduced by us. Findings: Our proposed algorithm gives a better clustering quality comparing with the k means clustering algorithm. Application/Improvements: This proposed algorithm can be further extended and applied to cluster large document sets.

Keywords

Bipartite Graph, Cluster Quality, Document Clustering, Energy, Similarity Measure.
User

Abstract Views: 191

PDF Views: 0




  • Document Clustering using a New Similarity Measure based on Energy of a Bipartite Graph

Abstract Views: 191  |  PDF Views: 0

Authors

G. Hannah Grace
Department of Mathematics, School of Advanced Sciences, VIT University, Chennai - 600127, Tamil Nadu, India
Kalyani Desikan
Department of Mathematics, School of Advanced Sciences, VIT University, Chennai - 600127, Tamil Nadu, India

Abstract


Objectives: This paper aims at clustering documents using a new similarity measure based on energy of a bipartite graph. Methods/Statistical Analysis: We have made use of bipartite representation of documents and clustered them. The proposed algorithm has been illustrated for a small document set. The documents have been clustered using the new similarity measure based on energy of a bipartite graph introduced by us. Findings: Our proposed algorithm gives a better clustering quality comparing with the k means clustering algorithm. Application/Improvements: This proposed algorithm can be further extended and applied to cluster large document sets.

Keywords


Bipartite Graph, Cluster Quality, Document Clustering, Energy, Similarity Measure.



DOI: https://doi.org/10.17485/ijst%2F2016%2Fv9i40%2F126272