Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

An Efficient Clustering Method in Unlabeled Data Sets Using KMBA Algorithm


Affiliations
1 Department of Computer Science and Engineering, Bannari Amman Institute of Technology, Sathyamangalam, Tamilnadu, India
2 Department of Information Technology, Bannari Amman Institute of Technology, Sathyamangalam, Tamilnadu, India
     

   Subscribe/Renew Journal


Cluster analysis is one of the primary data analysis methods and K-means algorithm is well known for its efficiency in clustering large data sets. The K-means (KM) algorithm is one of the popular unsupervised learning clustering algorithms for cluster the large datasets but it is sensitive to the selection of initial cluster centroid, and selection of K value is an issue also sometimes it is hard to predict before the number of clusters that would be there in data. There are inefficient and universal methods for the selection of K value, till now we selected that as random value. In this paper, we propose a new metaheuristic method KMBA, the KM and Bat Algorithm (BA) based on the echolocation behavior of bats to identify the initial values for overcome the KM issues. The algorithm does not require the user to give in advance the number of clusters and cluster centre, it resolves the K-means (KM) cluster problem. This method finds the cluster centre which is generated by using the BA, and then it forms the cluster by using the KM. The combination of both KM and BA provides an efficient clustering and achieves higher efficiency. These clusters are formed by the minimal computational resources and time. The experimental result shows that proposed algorithm is better than the existing algorithms.

Keywords

Centroid, Clustering, Metaheuristic, BAT Algorithm.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 310

PDF Views: 4




  • An Efficient Clustering Method in Unlabeled Data Sets Using KMBA Algorithm

Abstract Views: 310  |  PDF Views: 4

Authors

G. Komarasamy
Department of Computer Science and Engineering, Bannari Amman Institute of Technology, Sathyamangalam, Tamilnadu, India
Amitabh Wahi
Department of Information Technology, Bannari Amman Institute of Technology, Sathyamangalam, Tamilnadu, India

Abstract


Cluster analysis is one of the primary data analysis methods and K-means algorithm is well known for its efficiency in clustering large data sets. The K-means (KM) algorithm is one of the popular unsupervised learning clustering algorithms for cluster the large datasets but it is sensitive to the selection of initial cluster centroid, and selection of K value is an issue also sometimes it is hard to predict before the number of clusters that would be there in data. There are inefficient and universal methods for the selection of K value, till now we selected that as random value. In this paper, we propose a new metaheuristic method KMBA, the KM and Bat Algorithm (BA) based on the echolocation behavior of bats to identify the initial values for overcome the KM issues. The algorithm does not require the user to give in advance the number of clusters and cluster centre, it resolves the K-means (KM) cluster problem. This method finds the cluster centre which is generated by using the BA, and then it forms the cluster by using the KM. The combination of both KM and BA provides an efficient clustering and achieves higher efficiency. These clusters are formed by the minimal computational resources and time. The experimental result shows that proposed algorithm is better than the existing algorithms.

Keywords


Centroid, Clustering, Metaheuristic, BAT Algorithm.