Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Feature Selection Techniques with Distributed Data Mining Models


Affiliations
1 Department of Computer Science, D J Academy for Managerial Excellence, Coimbatore-32, India
     

   Subscribe/Renew Journal


Data mediated knowledge discovery is essential for any end users for value added decision making. Discerning vital, accurate and precise knowledge in the classification, various feature subsets are necessary. Apart from feature selection, processing and representation of data is also indispensable for analysis and implementation of any knowledge. Principal Component Analysis is the used for data pre-processing and representation of data. Eigen vectors, co variance matrix are estimated for distributed environment where local and global set are computed and evaluated. It reduces the dimensionality of data. RELIEF, CMIM and other feature selection methods are discussed here in this paper. On selecting the features may increase the classification accuracy and enhance classification and prediction.

Keywords

Feature Selection, Models, Distributed Data Mining, PCA, Classification, CMIM, mRMR, RELIEF.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 236

PDF Views: 1




  • Feature Selection Techniques with Distributed Data Mining Models

Abstract Views: 236  |  PDF Views: 1

Authors

E. Chandra
Department of Computer Science, D J Academy for Managerial Excellence, Coimbatore-32, India
P. Ajitha
Department of Computer Science, D J Academy for Managerial Excellence, Coimbatore-32, India

Abstract


Data mediated knowledge discovery is essential for any end users for value added decision making. Discerning vital, accurate and precise knowledge in the classification, various feature subsets are necessary. Apart from feature selection, processing and representation of data is also indispensable for analysis and implementation of any knowledge. Principal Component Analysis is the used for data pre-processing and representation of data. Eigen vectors, co variance matrix are estimated for distributed environment where local and global set are computed and evaluated. It reduces the dimensionality of data. RELIEF, CMIM and other feature selection methods are discussed here in this paper. On selecting the features may increase the classification accuracy and enhance classification and prediction.

Keywords


Feature Selection, Models, Distributed Data Mining, PCA, Classification, CMIM, mRMR, RELIEF.