Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Towards Semantically Sensitive Text Clustering: A Feature Space Modeling Technology Based on Dimension Extension


Affiliations
1 Department of Computer Science, GATE College, Tirupati, Andhra Pradesh, India
     

   Subscribe/Renew Journal


Content bunching is a large use of knowledge mining. It’s concerned about gathering related content archives together. Proper now paper, a number of models are worked to bunch capstone venture archives using three grouping systems: okay-implies, ok-implies rapid, and k-medoids. Our dataset is acquired from the library of the University Pc and Information Sciences, King Saud tuition, Riyadh. Three closeness measure are tried: Cosine likeness, Jacquard similitude, and Correlation Coefficient. The nature of the got models is assessed and checked out. The results display that the great execution is comprehensive utilizing k-implies and okay-medoids joined with cosine similitude. We watch style in the nature of bunching based on the assessment measure utilized. Additionally, as the estimation of okay builds, the character of the next crew improves. At long last, we find the classifications of commencement ventures provided in the information technological know-how division for female understudies.

Keywords

Clustering, Cosine Similarity, Data Mining, K-Means, K-Medoids, Text Mining.
Subscription Login to verify subscription
User
Notifications
Font Size



  • Towards Semantically Sensitive Text Clustering: A Feature Space Modeling Technology Based on Dimension Extension

Abstract Views: 308  |  PDF Views: 0

Authors

Chitti Babukalapati
Department of Computer Science, GATE College, Tirupati, Andhra Pradesh, India

Abstract


Content bunching is a large use of knowledge mining. It’s concerned about gathering related content archives together. Proper now paper, a number of models are worked to bunch capstone venture archives using three grouping systems: okay-implies, ok-implies rapid, and k-medoids. Our dataset is acquired from the library of the University Pc and Information Sciences, King Saud tuition, Riyadh. Three closeness measure are tried: Cosine likeness, Jacquard similitude, and Correlation Coefficient. The nature of the got models is assessed and checked out. The results display that the great execution is comprehensive utilizing k-implies and okay-medoids joined with cosine similitude. We watch style in the nature of bunching based on the assessment measure utilized. Additionally, as the estimation of okay builds, the character of the next crew improves. At long last, we find the classifications of commencement ventures provided in the information technological know-how division for female understudies.

Keywords


Clustering, Cosine Similarity, Data Mining, K-Means, K-Medoids, Text Mining.

References