Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Constraint-Based Multidimensional Frequent Sequential Pattern in Web Usage Mining


Affiliations
1 Department of Computer Applications, Thiagarajar College of Engineering, Madurai, Tamil Nadu, India
2 Department of Mathematics, Thiagarajar College of Engineering, Madurai, Tamil Nadu, India
3 Department of Computer Applications, K.L.N College of Engineering, Madurai, Tamil Nadu, India
     

   Subscribe/Renew Journal


Sequential Pattern Mining is one of the important approaches, which extracts frequent subsequences as pattern in a Sequence Database. Basic formulation of the frequent sequential pattern discovery problem assumes that the only constraint to be satisfied by discovered patterns is the minimum support threshold. Data mining systems should be able to exploit such constraints to speed-up the mining process. Though much work has been done in this area on one and two-dimensional database, mining sequential patterns from multidimensional database is yet on progress. In this paper we introduce an efficient strategy for discovering Web usage mining is the application of data mining techniques to discover usage patterns from Web data, in order to understand and better serve the needs of Web-based applications. Web usage mining consists of three phases, namely preprocessing, pattern discovery, and pattern analysis. This paper describes each of these phases in detail. 
The main objective of multidimensional sequential pattern mining is to provide the end user with more useful and interesting patterns. To mine such kind of sequence data, we have used an extended version of the prefixspan (EXT-Prefixspan) algorithm to extract the Constraint-based multidimensional frequent sequential patterns in web usage mining. A web access pattern is a sequential pattern that is pursued frequently by users. Using these sequences as prefixes a projected database is constructed which is then recursively mined to find the frequent sequential patterns. The EXT-Prefixspan mines the complete set of patterns but greatly reduces the efforts of candidate subsequence generation. Moreover, prefix –projection substantially reduces the size of projected database and leads to efficient processing. We show that the EXT-Prefixspan algorithm is more flexible at capturing desired knowledge than previous Algorithm.

Keywords

Data Mining, Frequent Pattern Mining, Sequence Pattern Mining, Web Usage Mining.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 263

PDF Views: 2




  • Constraint-Based Multidimensional Frequent Sequential Pattern in Web Usage Mining

Abstract Views: 263  |  PDF Views: 2

Authors

S. Vijayalakshmi
Department of Computer Applications, Thiagarajar College of Engineering, Madurai, Tamil Nadu, India
V. Mohan
Department of Mathematics, Thiagarajar College of Engineering, Madurai, Tamil Nadu, India
S. Suresh Raja
Department of Computer Applications, K.L.N College of Engineering, Madurai, Tamil Nadu, India

Abstract


Sequential Pattern Mining is one of the important approaches, which extracts frequent subsequences as pattern in a Sequence Database. Basic formulation of the frequent sequential pattern discovery problem assumes that the only constraint to be satisfied by discovered patterns is the minimum support threshold. Data mining systems should be able to exploit such constraints to speed-up the mining process. Though much work has been done in this area on one and two-dimensional database, mining sequential patterns from multidimensional database is yet on progress. In this paper we introduce an efficient strategy for discovering Web usage mining is the application of data mining techniques to discover usage patterns from Web data, in order to understand and better serve the needs of Web-based applications. Web usage mining consists of three phases, namely preprocessing, pattern discovery, and pattern analysis. This paper describes each of these phases in detail. 
The main objective of multidimensional sequential pattern mining is to provide the end user with more useful and interesting patterns. To mine such kind of sequence data, we have used an extended version of the prefixspan (EXT-Prefixspan) algorithm to extract the Constraint-based multidimensional frequent sequential patterns in web usage mining. A web access pattern is a sequential pattern that is pursued frequently by users. Using these sequences as prefixes a projected database is constructed which is then recursively mined to find the frequent sequential patterns. The EXT-Prefixspan mines the complete set of patterns but greatly reduces the efforts of candidate subsequence generation. Moreover, prefix –projection substantially reduces the size of projected database and leads to efficient processing. We show that the EXT-Prefixspan algorithm is more flexible at capturing desired knowledge than previous Algorithm.

Keywords


Data Mining, Frequent Pattern Mining, Sequence Pattern Mining, Web Usage Mining.