Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Parallel Mining of Frequent Maximal Itemsets Using Order Preserving Generators


Affiliations
1 Department of Information Technology, PSG College of Technology, Tamil Nadu, India
2 Department of Computer Science and Engineering, Alpha Engineering College, Tamil Nadu, India
     

   Subscribe/Renew Journal


In this paper, we propose a parallel algorithm for mining maximal itemsets. We propose POP-MAX (Parallel Order Preserving MAXimal itemset algorithm), a fast and memory efficient parallel algorithm which enumerates all the maximal patterns concurrently and independently across several nodes. Also, POP-MAX uses an efficient maximality checking technique which determines the maximality of an itemset using less number of items. To enhance the load sharing among different nodes, we have used round robin strategy which achieves load balancing as high as 90%. We have also incorporated bit-vectors and numerous optimizations to reduce the memory consumption and overall running time of the algorithm. Our comprehensive experimental analyses involving both real and synthetic datasets show that our algorithm takes less memory and less running time than other maximal itemset mining algorithms.

Keywords

Data Mining, Closed Itemsets, Maximal Itemsets, Mining Methods.
Subscription Login to verify subscription
User
Notifications
Font Size

Abstract Views: 222

PDF Views: 0




  • Parallel Mining of Frequent Maximal Itemsets Using Order Preserving Generators

Abstract Views: 222  |  PDF Views: 0

Authors

R. V. Nataraj
Department of Information Technology, PSG College of Technology, Tamil Nadu, India
S. Selvan
Department of Computer Science and Engineering, Alpha Engineering College, Tamil Nadu, India

Abstract


In this paper, we propose a parallel algorithm for mining maximal itemsets. We propose POP-MAX (Parallel Order Preserving MAXimal itemset algorithm), a fast and memory efficient parallel algorithm which enumerates all the maximal patterns concurrently and independently across several nodes. Also, POP-MAX uses an efficient maximality checking technique which determines the maximality of an itemset using less number of items. To enhance the load sharing among different nodes, we have used round robin strategy which achieves load balancing as high as 90%. We have also incorporated bit-vectors and numerous optimizations to reduce the memory consumption and overall running time of the algorithm. Our comprehensive experimental analyses involving both real and synthetic datasets show that our algorithm takes less memory and less running time than other maximal itemset mining algorithms.

Keywords


Data Mining, Closed Itemsets, Maximal Itemsets, Mining Methods.