Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

An Enhanced Projected Clustering Algorithm for High Dimensional Space


Affiliations
1 Department of Computer Science, Sri Ramakrishna College of Arts and Science for Women, Coimbatore, India
2 Department of Computer Science Dr.SNS College of Arts and Science, Coimbatore, India
3 Department of Computer Science and Engineering, Park College of Engineering & Technology, Coimbatore, India
     

   Subscribe/Renew Journal


Clustering is a data mining technique for identifying groups in the data set based on some similarity measure. Clustering high dimensional data has been a major challenge due to the inherent sparsity of the points. Most existing clustering algorithms become substantially inefficient if the required similarity measure is computed between data points in the full dimensional space. A number of projected clustering algorithms have been proposed to overcome the above issue. This led to the development of a robust partitional distance based projected clustering algorithm based on K-means algorithm with the computation of distance restricted to subsets of attributes with dense object values. The algorithm is capable of detecting projected clusters of low dimensionality embedded in a high-dimensional space and avoids the computation of the distance in full-dimensional space. The algorithm has been demonstrated using synthetic and real datasets.

Keywords

Clustering, High Dimensional Data, Projected Cluster, K-Means Clustering, Subspace Clustering.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 199

PDF Views: 1




  • An Enhanced Projected Clustering Algorithm for High Dimensional Space

Abstract Views: 199  |  PDF Views: 1

Authors

B. Shanmugapriya
Department of Computer Science, Sri Ramakrishna College of Arts and Science for Women, Coimbatore, India
M. Punithavalli
Department of Computer Science Dr.SNS College of Arts and Science, Coimbatore, India
G. Selvavinayagam
Department of Computer Science and Engineering, Park College of Engineering & Technology, Coimbatore, India

Abstract


Clustering is a data mining technique for identifying groups in the data set based on some similarity measure. Clustering high dimensional data has been a major challenge due to the inherent sparsity of the points. Most existing clustering algorithms become substantially inefficient if the required similarity measure is computed between data points in the full dimensional space. A number of projected clustering algorithms have been proposed to overcome the above issue. This led to the development of a robust partitional distance based projected clustering algorithm based on K-means algorithm with the computation of distance restricted to subsets of attributes with dense object values. The algorithm is capable of detecting projected clusters of low dimensionality embedded in a high-dimensional space and avoids the computation of the distance in full-dimensional space. The algorithm has been demonstrated using synthetic and real datasets.

Keywords


Clustering, High Dimensional Data, Projected Cluster, K-Means Clustering, Subspace Clustering.