Survey on Clustering Algorithms for Text Mining

Dawlat A. Sayed; Sohair R. Fahmy

Survey on Clustering Algorithms for Text Mining

Dawlat A. Sayed , Sohair R. Fahmy

Affiliations
1 Department of Information Science & Engineering, University of Paris, France

Subscribe/Renew Journal

Abstract
References
Article Metrics
Refbacks

Clustering is the process of combining groups of similar data objects in the same group based on similarity criteria (i.e. based on property groups). Typically, this cluster of documents is considered a centralized process. The application of this document cluster is done in two ways: online or offline. Of the two types, online cluster applications are generally more limited due to availability issues than offline applications. With this document clustering, you can complete a variety of tasks such as grouping domain-based documents, analyzing customer feedback, and finding meaningful hidden topics across all documents. The data used for clustering is used for normalization. In terms of efficiency and accuracy, the K-means produces better results compared to other algorithms.