Open Access Open Access  Restricted Access Subscription Access

An Optimization-Based Data Search Clustering Approach for Multidimensional Datasets


Affiliations
1 Research Scholar, Department of CS & IT, Rabindranath Tagore University, Bhopal, India
2 Associate Professor, Department of CS & IT, Rabindranath Tagore University, Bhopal, India

Multidimensional data refers to datasets featuring with multiple columns, often referred to as features or attributes. The challenge in multidimensional data analysis is that clusters and outliers are often detected based on the dataset's features, which may not align well with ground truth in real-world scenarios (e.g., gene expression data). Efficiency is a critical consideration as optimized clustering algorithms must handle the growing size of multidimensional datasets. In this research paper, we have proposed a Sinusoidal Chaotic and Information Entropy based Elephant-Herding Optimization for Clustering (SCIE_EOC) to data search in multidimensional datasets. The result shows the proposed method shows around 92-95 per cent accuracy for different datasets which is around 5 per cent better than the earlier methods.

Keywords

Multidimensional dataset, Clustering, Optimization, Data Search Capability
User
Notifications
Font Size

Abstract Views: 18




  • An Optimization-Based Data Search Clustering Approach for Multidimensional Datasets

Abstract Views: 18  | 

Authors

Pradeep Kumar Atulker
Research Scholar, Department of CS & IT, Rabindranath Tagore University, Bhopal, India
Rajendra Gupta
Associate Professor, Department of CS & IT, Rabindranath Tagore University, Bhopal, India

Abstract


Multidimensional data refers to datasets featuring with multiple columns, often referred to as features or attributes. The challenge in multidimensional data analysis is that clusters and outliers are often detected based on the dataset's features, which may not align well with ground truth in real-world scenarios (e.g., gene expression data). Efficiency is a critical consideration as optimized clustering algorithms must handle the growing size of multidimensional datasets. In this research paper, we have proposed a Sinusoidal Chaotic and Information Entropy based Elephant-Herding Optimization for Clustering (SCIE_EOC) to data search in multidimensional datasets. The result shows the proposed method shows around 92-95 per cent accuracy for different datasets which is around 5 per cent better than the earlier methods.

Keywords


Multidimensional dataset, Clustering, Optimization, Data Search Capability