Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

A Survey on Partitioning and Parallel Partitioning Clustering Algorithms


Affiliations
1 Department of Computer Applications, S'O'A University, Bhubaneswar-30, India
     

   Subscribe/Renew Journal


Learning is the process of generating useful information from a huge volume of data. Learning can be classified as supervised learning and unsupervised learning. Clustering is a kind of unsupervised learning. A pattern representing a common behaviour or characteristics that exist among each item can be generated. This paper gives an overview of partition clustering and parallel implementation of this clustering algorithm. It describes about the general working behaviour, the methodologies followed on these approaches and the parameters which affects the performance of these algorithms. Clustering is grouping input data sets into subsets, called 'clusters' within which the elements are somewhat similar. In general, clustering is an unsupervised learning task as very little or no prior knowledge is given except the input data sets. The tasks have been used in many fields and therefore various clustering algorithms have been developed. Clustering task is, however, computationally expensive as many of the algorithms require iterative or recursive procedures and most of real-life data is high dimensional. Therefore, the parallelization of clustering algorithms is inevitable, and various parallel clustering algorithms have been implemented and applied to many applications. In this paper, we review a partition clustering algorithms and their parallel versions as well. Although the parallel clustering algorithms have been used for many applications, the clustering tasks are applied as pre processing steps for parallelization of other algorithms too.

Keywords

Clustering, Supervised Learning, Unsupervised Learning, Parallel Clustering.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 278

PDF Views: 2




  • A Survey on Partitioning and Parallel Partitioning Clustering Algorithms

Abstract Views: 278  |  PDF Views: 2

Authors

Sushreeta Tripathy
Department of Computer Applications, S'O'A University, Bhubaneswar-30, India
Sarbeswara Hota
Department of Computer Applications, S'O'A University, Bhubaneswar-30, India

Abstract


Learning is the process of generating useful information from a huge volume of data. Learning can be classified as supervised learning and unsupervised learning. Clustering is a kind of unsupervised learning. A pattern representing a common behaviour or characteristics that exist among each item can be generated. This paper gives an overview of partition clustering and parallel implementation of this clustering algorithm. It describes about the general working behaviour, the methodologies followed on these approaches and the parameters which affects the performance of these algorithms. Clustering is grouping input data sets into subsets, called 'clusters' within which the elements are somewhat similar. In general, clustering is an unsupervised learning task as very little or no prior knowledge is given except the input data sets. The tasks have been used in many fields and therefore various clustering algorithms have been developed. Clustering task is, however, computationally expensive as many of the algorithms require iterative or recursive procedures and most of real-life data is high dimensional. Therefore, the parallelization of clustering algorithms is inevitable, and various parallel clustering algorithms have been implemented and applied to many applications. In this paper, we review a partition clustering algorithms and their parallel versions as well. Although the parallel clustering algorithms have been used for many applications, the clustering tasks are applied as pre processing steps for parallelization of other algorithms too.

Keywords


Clustering, Supervised Learning, Unsupervised Learning, Parallel Clustering.