Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Enhanced Neighborhood Normalized Pointwise Mutual Information Algorithm for Constraint Aware Data Clustering


Affiliations
1 Department of Computer Science and Engineering, University Visvesvaraya College of Engineering, India
     

   Subscribe/Renew Journal


Clustering of similar data items is an important technique in mining useful patterns. To enhance the performance of Clustering, training or learning is an important task. A constraint learning semi-supervised methodology is proposed which incorporates SVM and Normalized Point wise Mutual Information Computation Strategy to increase the relevance as well as the performance efficiency of clustering. The SVM Classifier is of Hard Margin Type to roughly classify the initial set. A recursive re-clustering approach is proposed for achieving higher degree of relevance in the final clustered set by incorporating ENNPI algorithm. An overall enriched F-Measure value of 94.09% is achieved as compared to existing algorithms.

Keywords

Clustering, Constraint Learning, Normalized Pointwise Mutual Information, Recursive Re-Clustering, SVM.
Subscription Login to verify subscription
User
Notifications
Font Size

Abstract Views: 268

PDF Views: 2




  • Enhanced Neighborhood Normalized Pointwise Mutual Information Algorithm for Constraint Aware Data Clustering

Abstract Views: 268  |  PDF Views: 2

Authors

C. N. Pushpa
Department of Computer Science and Engineering, University Visvesvaraya College of Engineering, India
Gerard Deepak
Department of Computer Science and Engineering, University Visvesvaraya College of Engineering, India
Mohammed Zakir
Department of Computer Science and Engineering, University Visvesvaraya College of Engineering, India
J. Thriveni
Department of Computer Science and Engineering, University Visvesvaraya College of Engineering, India
K. R. Venugopal
Department of Computer Science and Engineering, University Visvesvaraya College of Engineering, India

Abstract


Clustering of similar data items is an important technique in mining useful patterns. To enhance the performance of Clustering, training or learning is an important task. A constraint learning semi-supervised methodology is proposed which incorporates SVM and Normalized Point wise Mutual Information Computation Strategy to increase the relevance as well as the performance efficiency of clustering. The SVM Classifier is of Hard Margin Type to roughly classify the initial set. A recursive re-clustering approach is proposed for achieving higher degree of relevance in the final clustered set by incorporating ENNPI algorithm. An overall enriched F-Measure value of 94.09% is achieved as compared to existing algorithms.

Keywords


Clustering, Constraint Learning, Normalized Pointwise Mutual Information, Recursive Re-Clustering, SVM.