Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Deriving Private Information from Randomized Dataset Using Data Reorganization Techniques


Affiliations
1 Department of Computer Science and Engineering, K.S. Rangasamy College of Technology, Tamilnadu, India
     

   Subscribe/Renew Journal


Publishing data about individuals without revealing sensitive information about them is an important problem. To enforce privacy-preserving paradigms, such as k-anonymity and l-diversity, while minimizing the information loss incurred in the anonymizing process (i.e. maximize data utility). work well for fixed-schema data, with low dimensionality. Nevertheless, certain applications require privacy-preserving publishing of transaction data (or basket data), which involves hundreds or even thousands of dimensions, rendering existing methods unusable. A novel anonymization method for sparse high dimensional data is achieved. Two categories of novel anonymization method for sparse high-dimensional data. The first category is based on approximate nearest-neighbor (NN) search in high-dimensional spaces, which is efficiently performed through locality-sensitive hashing (LSH). In the second category, a data transformation that capture the correlation in the underlying data is Gray encoding-based sorting. These representations facilitate the formation of anonymized groups with low information loss, through an efficient linear-time heuristic.

Keywords

Privacy, Transactional Data, Anonymization.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 276

PDF Views: 1




  • Deriving Private Information from Randomized Dataset Using Data Reorganization Techniques

Abstract Views: 276  |  PDF Views: 1

Authors

S. Shobana
Department of Computer Science and Engineering, K.S. Rangasamy College of Technology, Tamilnadu, India
P. Nagajothi
Department of Computer Science and Engineering, K.S. Rangasamy College of Technology, Tamilnadu, India

Abstract


Publishing data about individuals without revealing sensitive information about them is an important problem. To enforce privacy-preserving paradigms, such as k-anonymity and l-diversity, while minimizing the information loss incurred in the anonymizing process (i.e. maximize data utility). work well for fixed-schema data, with low dimensionality. Nevertheless, certain applications require privacy-preserving publishing of transaction data (or basket data), which involves hundreds or even thousands of dimensions, rendering existing methods unusable. A novel anonymization method for sparse high dimensional data is achieved. Two categories of novel anonymization method for sparse high-dimensional data. The first category is based on approximate nearest-neighbor (NN) search in high-dimensional spaces, which is efficiently performed through locality-sensitive hashing (LSH). In the second category, a data transformation that capture the correlation in the underlying data is Gray encoding-based sorting. These representations facilitate the formation of anonymized groups with low information loss, through an efficient linear-time heuristic.

Keywords


Privacy, Transactional Data, Anonymization.