Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Perturbation Based Technique for Privacy Preserving Clustering of High Dimensional Data


Affiliations
1 Department of Computer Applications, Sri Krishna College of Engineering and Technology, Kuniamuthur, Coimbatore, India
2 Department of Mathematics, Coimbatore Institute of Technology, Peelamedu, Coimbatore, India
     

   Subscribe/Renew Journal


Privacy of personal data is a fundamental human right. The freedom and transparency of data flow due to swift advances in data processing techniques and internet technology has heightened concerns of privacy. Reluctance to provide personal information could impede the success of data mining. Concern about the privacy of data is becoming an important concern in business, academic, defense and health care domains. Privacy-preserving data mining (PPDM) addresses these issues by striking a balance between privacy preservation and knowledge discovery. We propose a novel linear component analysis based transformation technique for Privacy preserving clustering to preserve the privacy of confidential data. We further evaluate the performance of this technique with the classical k-means clustering algorithm. The effectiveness of our new approach is demonstrated by various experiments conducted on synthetic data sets of varying dimensions. The accuracy of clustering has been computed before and after privacy preserving transformation using adjusted rand Index. Based on our results, we conclude that our method is an effective and feasible technique to build data mining models from perturbed data.

Keywords

Adjusted Rand Index, K-Means, Linear Components Analysis, Transformation Matrix.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 229

PDF Views: 1




  • Perturbation Based Technique for Privacy Preserving Clustering of High Dimensional Data

Abstract Views: 229  |  PDF Views: 1

Authors

R. VidyaBanu
Department of Computer Applications, Sri Krishna College of Engineering and Technology, Kuniamuthur, Coimbatore, India
N. Nagaveni
Department of Mathematics, Coimbatore Institute of Technology, Peelamedu, Coimbatore, India

Abstract


Privacy of personal data is a fundamental human right. The freedom and transparency of data flow due to swift advances in data processing techniques and internet technology has heightened concerns of privacy. Reluctance to provide personal information could impede the success of data mining. Concern about the privacy of data is becoming an important concern in business, academic, defense and health care domains. Privacy-preserving data mining (PPDM) addresses these issues by striking a balance between privacy preservation and knowledge discovery. We propose a novel linear component analysis based transformation technique for Privacy preserving clustering to preserve the privacy of confidential data. We further evaluate the performance of this technique with the classical k-means clustering algorithm. The effectiveness of our new approach is demonstrated by various experiments conducted on synthetic data sets of varying dimensions. The accuracy of clustering has been computed before and after privacy preserving transformation using adjusted rand Index. Based on our results, we conclude that our method is an effective and feasible technique to build data mining models from perturbed data.

Keywords


Adjusted Rand Index, K-Means, Linear Components Analysis, Transformation Matrix.